Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
VITA-MLLM 's Collections
VITA-Audio
Long-VITA

VITA-Audio

updated about 8 hours ago
Upvote
1

  • VITA-MLLM/VITA-Audio-Boost

    Updated 11 days ago • 8 • 1

  • VITA-MLLM/VITA-Audio-Balance

    Updated 11 days ago • 10 • 1

  • VITA-MLLM/VITA-Audio-Plus-Vanilla

    Updated 2 days ago • 78 • 2

  • VITA-MLLM/VITA-Audio-Data

    Preview • Updated about 10 hours ago • 3

  • VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    Paper • 2505.03739 • Published 2 days ago • 6
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs