VideoChat-R1 Collection VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning • 3 items • Updated 17 days ago • 5
Running on Zero 41 41 Chat with Kimi-VL (Image, Agent, Video, PDF) 🚀 Chat with Kimi-VL-A3B-Instruct using text, images, and videos
VideoChat-R1 Collection VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning • 3 items • Updated 17 days ago • 5
VideoChat-R1 Collection VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning • 3 items • Updated 17 days ago • 5
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate Jun 13, 2024 • 54
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation Paper • 2503.19622 • Published Mar 25 • 31
OpenGVLab/VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B Video-Text-to-Text • Updated Mar 16 • 159 • 4