Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2501.19393

RL+reason model

about 5 hours ago

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28
Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 30
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 121
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Paper • 2412.12098 • Published Dec 16, 2024 • 5

Video Generation

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 28 days ago • 123
s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120
LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 229
Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 59
Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 115
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 121

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

Papers Storm 🌪️

A curated collection of research papers referenced in Panoram'IA program, offering a comprehensive resource for further exploration.

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published Dec 12, 2024 • 97
Generative World Explorer

Paper • 2411.11844 • Published Nov 18, 2024 • 78
Video Depth without Video Models

Paper • 2411.19189 • Published Nov 28, 2024 • 39
Mobile Video Diffusion

Paper • 2412.07583 • Published Dec 10, 2024 • 20

LLM and Reasoning Papers

Papers dump of LLM Reasoning domain

about 1 month ago

Internal Consistency and Self-Feedback in Large Language Models: A Survey

Paper • 2407.14507 • Published Jul 19, 2024 • 47
Large Language Models are Zero-Shot Reasoners

Paper • 2205.11916 • Published May 24, 2022 • 1
Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 10
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Paper • 2201.11903 • Published Jan 28, 2022 • 13

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 114
PaSa: An LLM Agent for Comprehensive Academic Paper Search

Paper • 2501.10120 • Published Jan 17 • 49
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong

Paper • 2501.09775 • Published Jan 16 • 34
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Paper • 2501.10132 • Published Jan 17 • 20

test time compute

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs