Sen Yang

ringos

https://ringos.github.io/

ringos

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Learning to Reason under Off-Policy Guidance

liked a dataset 3 months ago

di-zhang-fdu/AIME_1983_2024

upvoted a paper 3 months ago

MoM: Linear Sequence Modeling with Mixture-of-Memories

View all activity

Organizations

ringos's activity

upvoted a paper 16 days ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published 17 days ago • 80

liked a dataset 3 months ago

di-zhang-fdu/AIME_1983_2024

Viewer • Updated Mar 3 • 933 • 2.14k • 30

upvoted a paper 3 months ago

MoM: Linear Sequence Modeling with Mixture-of-Memories

Paper • 2502.13685 • Published Feb 19 • 36

liked a dataset 3 months ago

open-r1/OpenR1-Math-220k

Viewer • Updated Feb 18 • 450k • 28.9k • 572

upvoted a paper 4 months ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published Jan 22 • 61

updated a dataset 4 months ago

ringos/output_Llama-3.1-8B-simpleqa-0_1000-m_generation-n_128-t_1.0-k_50-p_0.95-l_128

Updated Dec 25, 2024 • 22

liked a Space 5 months ago

557

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

updated 5 datasets 5 months ago

ringos/output_Llama-3.1-8B-simpleqa-0_-1-m_generation-n_128-t_1.0-k_50-p_0.95-l_128

Updated Dec 17, 2024 • 76

ringos/mistral_nemo_base-mmlu-val

Viewer • Updated Dec 16, 2024 • 18.7k • 78

ringos/llama-3_1-8b-mmlu-val

Viewer • Updated Dec 16, 2024 • 18.7k • 67

ringos/output_Mistral-Nemo-Base-2407-simpleqa-0_1000-m_generation-n_32-t_1.0-k_40-p_0.9-l_128

Viewer • Updated Dec 2, 2024 • 216 • 58

ringos/simple_qa

Viewer • Updated Dec 2, 2024 • 4.33k • 40

liked 2 datasets 6 months ago

HuggingFaceTB/smoltalk

Viewer • Updated Feb 10 • 2.2M • 7.41k • 333

MingZhong/crosseval

Viewer • Updated Oct 1, 2024 • 1.4k • 36 • 6

updated 3 datasets 6 months ago