60 74 195

Asankhaya Sharma

codelion

http://asankhaya.github.io/

AI & ML interests

AI/ML, Dev Tools and Application Security

Recent Activity

liked a Space about 19 hours ago

codelion/videoanalysis

published a Space about 19 hours ago

codelion/videoanalysis

upvoted a paper about 21 hours ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

View all activity

Organizations

codelion's activity

upvoted a paper about 21 hours ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 1 day ago • 61

upvoted a paper 5 days ago

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published 7 days ago • 42

upvoted 5 papers 7 days ago

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Paper • 2504.21233 • Published 8 days ago • 37

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published 7 days ago • 41

UniversalRAG: Retrieval-Augmented Generation over Multiple Corpora with Diverse Modalities and Granularities

Paper • 2504.20734 • Published 9 days ago • 60

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 9 days ago • 88

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published 9 days ago • 50

upvoted a paper 10 days ago

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published 16 days ago • 155

upvoted a paper 13 days ago

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published 13 days ago • 86

upvoted 2 papers 14 days ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published 16 days ago • 73

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

Paper • 2504.16078 • Published 15 days ago • 20

upvoted 2 papers 15 days ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 15 days ago • 102

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 19 days ago • 119

upvoted 3 papers 2 months ago

LettuceDetect: A Hallucination Detection Framework for RAG Applications

Paper • 2502.17125 • Published Feb 24 • 11

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 74

The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer

Paper • 2502.15631 • Published Feb 21 • 9

upvoted a paper 3 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

upvoted a collection 3 months ago

The Ultimate Collection of Code Classifiers

Collection

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 2 days ago • 11

upvoted 2 papers 3 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 156

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17 • 45