ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction Paper • 2412.12888 • Published Dec 17, 2024
EliGen: Entity-Level Controlled Image Generation with Regional Attention Paper • 2501.01097 • Published Jan 2
Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs Paper • 2504.17432 • Published 14 days ago • 38