T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper β’ 2505.00703 β’ Published 6 days ago β’ 39
TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching Paper β’ 2505.00562 β’ Published 7 days ago β’ 3
Improving Editability in Image Generation with Layer-wise Memory Paper β’ 2505.01079 β’ Published 6 days ago β’ 23
PixelHacker: Image Inpainting with Structural and Semantic Consistency Paper β’ 2504.20438 β’ Published 9 days ago β’ 37
COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning Paper β’ 2504.21850 β’ Published 7 days ago β’ 24
UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities Paper β’ 2504.20734 β’ Published 9 days ago β’ 60
YoChameleon: Personalized Vision and Language Generation Paper β’ 2504.20998 β’ Published 8 days ago β’ 11