Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 4 days ago • 26
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 4 days ago • 75
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 28 days ago • 450
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 600