Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published 8 days ago • 88
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Paper • 2505.02707 • Published 2 days ago • 65
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published 12 days ago • 41
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning Paper • 2504.17192 • Published 14 days ago • 105
Describe Anything: Detailed Localized Image and Video Captioning Paper • 2504.16072 • Published 15 days ago • 60
QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models Paper • 2502.12346 • Published Feb 17 • 1
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients Paper • 2407.08296 • Published Jul 11, 2024 • 34
When are 1.58 bits enough? A Bottom-up Exploration of BitNet Quantization Paper • 2411.05882 • Published Nov 8, 2024 • 1
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 615
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUs Paper • 2410.16144 • Published Oct 21, 2024 • 5
BitNet: Scaling 1-bit Transformers for Large Language Models Paper • 2310.11453 • Published Oct 17, 2023 • 102
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published 19 days ago • 119
FANformer: Improving Large Language Models Through Effective Periodicity Modeling Paper • 2502.21309 • Published Feb 28 • 1
Cobra: Efficient Line Art COlorization with BRoAder References Paper • 2504.12240 • Published 21 days ago • 27