OpenReasoning

AI & ML interests

None defined yet.

Recent Activity

zhaoguangxiang authored a paper about 2 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

zhaoguangxiang authored a paper about 2 months ago

LongAttn: Selecting Long-context Training Data via Token-level Attention

zhaoguangxiang authored a paper about 2 months ago

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

View all activity

OpenReasoning's activity

zhaoguangxiang

authored 3 papers about 2 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18

LongAttn: Selecting Long-context Training Data via Token-level Attention

Paper • 2502.16860 • Published Feb 24 • 1

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Paper • 2502.20790 • Published Feb 28

Husserl233

authored a paper about 2 months ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

yuhanwuuu

authored a paper about 2 months ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

zhaoguangxiang

authored a paper about 2 months ago

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

lincharliesun

authored 5 papers about 2 months ago

Stress Testing Generalization: How Minor Modifications Undermine Large Language Model Performance

Paper • 2502.12459 • Published Feb 18

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Paper • 2503.04872 • Published Mar 6 • 15

Expand VSR Benchmark for VLLM to Expertize in Spatial Rules

Paper • 2412.18224 • Published Dec 24, 2024

LongAttn: Selecting Long-context Training Data via Token-level Attention

Paper • 2502.16860 • Published Feb 24 • 1

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Paper • 2502.20790 • Published Feb 28