Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yinan He's picture

Yinan He

yinanhe

AI & ML interests

computer vision

Recent Activity

authored a paper 23 days ago
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
authored a paper 28 days ago
VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning
authored a paper about 1 month ago
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
View all activity

Organizations

None yet

yinanhe's activity

authored a paper 23 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 24 days ago • 255
authored a paper 28 days ago

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Paper • 2504.06958 • Published 29 days ago • 11
authored a paper about 1 month ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27 • 34
authored a paper 4 months ago

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Paper • 2412.19326 • Published Dec 26, 2024 • 18
authored a paper about 1 year ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22, 2024 • 26
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs