ESPnet

non-profit

https://github.com/espnet/espnet

espnet

Activity Feed Request to join this org

AI & ML interests

voice-conversion speech-separation speech-enhancement speech-translation speech-synthesis speech-recognition spoken-language-understanding

Recent Activity

TangRain updated a model about 17 hours ago

espnet/mixdata_svs_visinger2_spkemb_lang_pretrained

TangRain published a model about 17 hours ago

espnet/mixdata_svs_visinger2_spkemb_lang_pretrained

Fhrozen updated a Space 2 days ago

espnet/TheESPnetLeaderBoard

View all activity

espnet's activity

TangRain

updated a model about 17 hours ago

espnet/mixdata_svs_visinger2_spkemb_lang_pretrained

Updated about 17 hours ago

TangRain

published a model about 17 hours ago

espnet/mixdata_svs_visinger2_spkemb_lang_pretrained

Updated about 17 hours ago

Fhrozen

updated a Space 2 days ago

TheESPnetLeaderBoard

Official ESPnet Leaderboard

lijialudew

updated a model 2 days ago

espnet/proyecto_nahuatl

Automatic Speech Recognition • Updated 2 days ago

lijialudew

published a model 2 days ago

espnet/proyecto_nahuatl

Automatic Speech Recognition • Updated 2 days ago

wanchichen

updated a model 5 days ago

espnet/owls_18b_360K_intermediates

Updated 5 days ago

wanchichen

updated 6 models 6 days ago

espnet/owls_18B_180K

Automatic Speech Recognition • Updated 6 days ago • 14 • 1

espnet/owls_2B_180K

Automatic Speech Recognition • Updated 6 days ago • 35

espnet/owls_1B_180K

Automatic Speech Recognition • Updated 6 days ago • 5 • 2

espnet/owls_025B_180K

Automatic Speech Recognition • Updated 6 days ago • 15

espnet/owls_05B_180K

Automatic Speech Recognition • Updated 6 days ago • 14

espnet/owls_4B_180K

Automatic Speech Recognition • Updated 6 days ago • 62 • 5

wanchichen

updated a collection 6 days ago

OWLS: Scaling Laws for Speech Recognition and Translation

🦉 A suite of Whisper-style models from 250M to 18B parameters. Trained on up to 360K hours of data. 16k sampling rate. • 8 items • Updated 6 days ago • 5

wanchichen

updated 2 models 6 days ago

espnet/owls_18B_360K

Automatic Speech Recognition • Updated 6 days ago • 1

espnet/owls_9B_180K

Automatic Speech Recognition • Updated 6 days ago • 9

wanchichen

published 2 models 6 days ago

espnet/owls_18b_360K_intermediates

Updated 5 days ago

espnet/owls_18B_360K

Automatic Speech Recognition • Updated 6 days ago • 1

Fhrozen

published a Space 10 days ago

TheESPnetLeaderBoard

Official ESPnet Leaderboard

wanchichen

updated a model 28 days ago

espnet/owls_2B_180K_intermediates

Updated 28 days ago • 6

BrianatCambridge

authored a paper 3 months ago

video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model

Paper • 2502.11775 • Published Feb 17 • 8