Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance Paper • 2502.12459 • Published Feb 18
LongAttn: Selecting Long-context Training Data via Token-level Attention Paper • 2502.16860 • Published Feb 24 • 1
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision Paper • 2502.20790 • Published Feb 28
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published Mar 6 • 15
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published Mar 6 • 15
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published Mar 6 • 15
Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance Paper • 2502.12459 • Published Feb 18
TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation Paper • 2503.04872 • Published Mar 6 • 15
Expand VSR Benchmark for VLLM to Expertize in Spatial Rules Paper • 2412.18224 • Published Dec 24, 2024
LongAttn: Selecting Long-context Training Data via Token-level Attention Paper • 2502.16860 • Published Feb 24 • 1
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision Paper • 2502.20790 • Published Feb 28