LongVU - a Vision-CAIR Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Vision-CAIR 's Collections

LongVU

LongVU

updated Oct 31, 2024

Vision-CAIR/LongVU_Qwen2_7B

Video-Text-to-Text • Updated Feb 28 • 178 • 71
Vision-CAIR/LongVU_Llama3_2_3B

Video-Text-to-Text • Updated Feb 28 • 1.32k • 7
Vision-CAIR/LongVU_Llama3_2_3B_img

Updated Feb 28 • 1 • 6
Vision-CAIR/LongVU_Qwen2_7B_img

Updated Feb 28 • 5 • 5
Vision-CAIR/LongVU_Llama3_2_1B

Video-Text-to-Text • Updated Feb 28 • 516 • 11
Runtime error

79

79

LongVU

🌖

Generate responses to video or image inputs
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Paper • 2410.17434 • Published Oct 22, 2024 • 29

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs