Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Zhaoyang Liu's picture
5 4 22

Zhaoyang Liu

zyliu
21world's profile picture
·
  • liu-zhy

AI & ML interests

Video understanding, 3D Perception, Autonomous driving, Foundation models, AIGC

Recent Activity

updated a model 10 days ago
shzhonghe/qwen2_5vl_2
published a model 10 days ago
shzhonghe/qwen2_5vl_2
upvoted a paper 15 days ago
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models
View all activity

Organizations

OpenGVLab's profile picture MyGroup's profile picture Zhonghe & SHAILAB's profile picture

zyliu's activity

upvoted a paper 15 days ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published 17 days ago • 73
upvoted a paper 24 days ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 24 days ago • 255
upvoted a paper 5 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 157
upvoted a paper over 1 year ago

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Paper • 2310.17796 • Published Oct 26, 2023 • 18
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs