Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Min-Hung Chen's picture
5 33 4

Min-Hung Chen

cmhungsteve
huckiyang's profile picture Mi6paulino's profile picture BK-Lee's profile picture
·
https://minhungchen.netlify.app/
  • CMHungSteven
  • cmhungsteve
  • chensteven
  • cmhungsteve.bsky.social

AI & ML interests

Multimodal AI, Transfer Learning, Unsupervised Learning, Video Understanding, Vision Transformer, Computer Vision, Deep Learning

Recent Activity

liked a Space about 2 months ago
nvidia/multilingual-voice-4B-demo
upvoted a paper 2 months ago
Token-Efficient Long Video Understanding for Multimodal LLMs
upvoted a paper 2 months ago
Visual-RFT: Visual Reinforcement Fine-Tuning
View all activity

Organizations

NVIDIA's profile picture Georgia Tech (Georgia Institute of Technology)'s profile picture

cmhungsteve's activity

commented a paper 3 months ago

V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models

Paper • 2502.09980 • Published Feb 14 • 4 •
2
commented a paper 4 months ago

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Paper • 2501.08326 • Published Jan 14 • 35 •
2
commented a paper 6 months ago

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Paper • 2410.21271 • Published Oct 28, 2024 • 7 •
2
commented a paper about 1 year ago

DoRA: Weight-Decomposed Low-Rank Adaptation

Paper • 2402.09353 • Published Feb 14, 2024 • 27 •
7
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs