9 1

Ritvik Rastogi

Ritvik19

https://ritvik19.github.io

AI & ML interests

Machine Learning Deep Learning, Natural Language Processing, Computer Vision

Recent Activity

commented on a paper 7 days ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

commented on a paper 7 days ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

commented on a paper 9 days ago

Process Reward Models That Think

View all activity

Organizations

Ritvik19's activity

commented 2 papers 7 days ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 9 days ago • 88 •

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 9 days ago • 88 •

commented a paper 9 days ago

Process Reward Models That Think

Paper • 2504.16828 • Published 15 days ago • 16 •

commented a paper 13 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 20 days ago • 119 •

commented 2 papers 14 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 20 days ago • 119 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 20 days ago • 119 •

commented 2 papers 15 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 20 days ago • 119 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 20 days ago • 119 •

commented 2 papers 20 days ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published 23 days ago • 12 •

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published 23 days ago • 12 •

commented a paper 21 days ago

From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models

Paper • 2504.06214 • Published 30 days ago •

commented 2 papers about 1 month ago

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26 • 8 •

Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging

Paper • 2503.20641 • Published Mar 26 • 8 •

New activity in open-acc/README 6 months ago

[24/ 11] What are you working on this week! 💪

#2 opened 6 months ago by

reach-vb

New activity in Ritvik19/openhermes-danube2-sft-qlora 12 months ago

Adding Evaluation Results

#1 opened 12 months ago by

leaderboard-pr-bot

New activity in Ritvik19/Sudoku-Dataset over 1 year ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter