1 4163 305

fdsqefsgergd

T-representer

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago

Teaching Models to Understand (but not Generate) High-risk Data

upvoted a paper about 6 hours ago

SWE-smith: Scaling Data for Software Engineering Agents

upvoted a paper about 15 hours ago

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

View all activity

Organizations

None yet

T-representer's activity

upvoted 2 papers about 6 hours ago

Teaching Models to Understand (but not Generate) High-risk Data

Paper • 2505.03052 • Published 2 days ago • 2

SWE-smith: Scaling Data for Software Engineering Agents

Paper • 2504.21798 • Published 7 days ago • 5

upvoted 2 papers about 15 hours ago

HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation

Paper • 2504.21650 • Published 8 days ago • 7

VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Paper • 2505.03739 • Published 1 day ago • 6

upvoted 8 papers about 20 hours ago

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Paper • 2505.03730 • Published 1 day ago • 21

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 2 days ago • 65

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published 2 days ago • 68

InfoVids: Reimagining the Viewer Experience with Alternative Visualization-Presenter Relationships

Paper • 2505.03164 • Published 2 days ago • 5

upvoted 8 papers 1 day ago

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL

Paper • 2505.02391 • Published 3 days ago • 21

Practical Efficiency of Muon for Pretraining

Paper • 2505.02222 • Published 3 days ago • 34

SkillMimic-V2: Learning Robust and Generalizable Interaction Skills from Sparse and Noisy Demonstrations

Paper • 2505.02094 • Published 4 days ago • 15

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published 10 days ago • 28

Think on your Feet: Adaptive Thinking via Reinforcement Learning for Social Agents

Paper • 2505.02156 • Published 3 days ago • 16

SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing

Paper • 2505.02370 • Published 3 days ago • 10

Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities

Paper • 2505.01043 • Published 6 days ago • 9

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published 5 days ago • 28