Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gabrielmbmb 's Collections
Audio Papers
Math Datasets
Upcycling Papers
Synthetic Data Papers
Upcycling Experiments
LLM Leaderboards

Audio Papers

updated Mar 7

A collection of audio related papers that I want to read

Upvote
-

  • LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

    Paper • 2502.20583 • Published Feb 27 • 13

  • Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

    Paper • 2410.15316 • Published Oct 20, 2024 • 11

  • Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

    Paper • 2503.01710 • Published Mar 3 • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs