Submitted by ksshumab 57 Predictive Data Selection: The Data That Predicts Is the Data That Teaches · 8 authors 2
Submitted by lzq2021 40 DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking · 9 authors 4
Submitted by nicolas-dufour 26 How far can we go with ImageNet for Text-to-Image generation? · 5 authors 2
Submitted by autumncc 20 ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents · 7 authors 2
Submitted by akhaliq 16 Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids · 5 authors 2
Submitted by kamahori 13 LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation · 4 authors 2
Submitted by kamahori 11 TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval · 14 authors 2
Submitted by hturbe 11 Tell me why: Visual foundation models as self-explainable classifiers · 4 authors 2
Submitted by adaamko 11 LettuceDetect: A Hallucination Detection Framework for RAG Applications · 2 authors 2
Submitted by Yifan-Zhong 9 DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping · 7 authors 2
Submitted by BestWishYsh 5 MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing · 6 authors 2
Submitted by akhaliq 2 HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models · 8 authors 2