Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZhangYuanhan 's Collections
LMM RL
good papers
Vision Language General

LMM RL

updated Mar 13
Upvote
-

  • Token-Efficient Long Video Understanding for Multimodal LLMs

    Paper • 2503.04130 • Published Mar 6 • 94

  • Temporal Preference Optimization for Long-Form Video Understanding

    Paper • 2501.13919 • Published Jan 23 • 22

  • MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

    Paper • 2503.07365 • Published Mar 10 • 61

    Note KL in RL is unnecessary.

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs