Yanwei Li's picture

2 5 5

Yanwei Li

YanweiLi

·

AI & ML interests

None yet

Recent Activity

authored a paper 21 days ago

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

authored a paper 3 months ago

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

authored a paper 5 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

View all activity

Organizations

None yet

YanweiLi's activity

authored a paper 21 days ago

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published 24 days ago • 28

authored a paper 3 months ago

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published Feb 13 • 28

authored a paper 5 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 49

upvoted a paper 5 months ago

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 49

authored a paper 9 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

upvoted a paper 9 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 61

authored a paper 11 months ago

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Paper • 2405.21075 • Published May 31, 2024 • 24

updated 2 models about 1 year ago

YanweiLi/MGM-8B

Text Generation • Updated May 4, 2024 • 8 • 1

YanweiLi/MGM-8B-HD

Text Generation • Updated May 4, 2024 • 7 • 6

updated a collection about 1 year ago

MGM

Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" • 13 items • Updated May 3, 2024 • 47

updated 8 models about 1 year ago

YanweiLi/MGM-8x7B

Text Generation • Updated Apr 21, 2024 • 9 • 7

YanweiLi/MGM-8x7B-HD

Text Generation • Updated Apr 21, 2024 • 7 • 9

YanweiLi/MGM-7B

Text Generation • Updated Apr 21, 2024 • 790 • 8

YanweiLi/MGM-13B-HD

Text Generation • Updated Apr 21, 2024 • 8 • 13

YanweiLi/MGM-34B-HD

Text Generation • Updated Apr 21, 2024 • 10 • 21

YanweiLi/MGM-34B

Text Generation • Updated Apr 21, 2024 • 6 • 9

YanweiLi/MGM-2B

Text Generation • Updated Apr 21, 2024 • 56 • 20

YanweiLi/MGM-7B-HD

Text Generation • Updated Apr 21, 2024 • 13 • 29