FlowReasoner: Reinforcing Query-Level Meta-Agents Paper β’ 2504.15257 β’ Published 7 days ago β’ 45
LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Paper β’ 2504.16078 β’ Published 6 days ago β’ 19
view article Article Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition π€ 14 days ago β’ 40
DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning Paper β’ 2503.15265 β’ Published Mar 19 β’ 47
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22 β’ 391
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper β’ 2411.06176 β’ Published Nov 9, 2024 β’ 46
Gemma 2: Improving Open Language Models at a Practical Size Paper β’ 2408.00118 β’ Published Jul 31, 2024 β’ 77
MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model Paper β’ 2408.10198 β’ Published Aug 19, 2024 β’ 35
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Paper β’ 2406.19280 β’ Published Jun 27, 2024 β’ 65
X-Mesh: Towards Fast and Accurate Text-driven 3D Stylization via Dynamic Textual Guidance Paper β’ 2303.15764 β’ Published Mar 28, 2023 β’ 2
DETRs Beat YOLOs on Real-time Object Detection Paper β’ 2304.08069 β’ Published Apr 17, 2023 β’ 13
CAT3D: Create Anything in 3D with Multi-View Diffusion Models Paper β’ 2405.10314 β’ Published May 16, 2024 β’ 49