SAMBIT CHAKRABORTY's picture

64 9

SAMBIT CHAKRABORTY

sambitchakhf03

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

upvoted a paper 5 days ago

DeepCritic: Deliberate Critique with Large Language Models

upvoted a paper 6 days ago

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

View all activity

Organizations

sambitchakhf03's activity

upvoted a paper 2 days ago

Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

Paper • 2505.00234 • Published 7 days ago • 20

upvoted a paper 5 days ago

DeepCritic: Deliberate Critique with Large Language Models

Paper • 2505.00662 • Published 6 days ago • 46

upvoted a paper 6 days ago

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published 7 days ago • 41

upvoted 3 papers 13 days ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published 16 days ago • 73

OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published 16 days ago • 33

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 15 days ago • 102

upvoted a paper 20 days ago

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Paper • 2504.09643 • Published 24 days ago • 34

upvoted 3 papers 25 days ago

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Paper • 2504.06958 • Published 28 days ago • 11

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 27 days ago • 27

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 83

upvoted a paper 27 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 29 days ago • 73

upvoted 6 papers about 1 month ago

FreSca: Unveiling the Scaling Space in Diffusion Models

Paper • 2504.02154 • Published Apr 2 • 19

VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Paper • 2504.01956 • Published Apr 2 • 40

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published Apr 3 • 56

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 273

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published Apr 3 • 78

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 149

liked a model about 1 month ago

sambitchakhf03/chatbox-llm-merged

Text Generation • Updated Aug 15, 2023 • 53 • 1

upvoted 2 papers about 2 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 162

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 94