YTC SDT team

company

https://huggingface.co/datasets/Gorgarik/YTC

Activity Feed

AI & ML interests

Yes

Recent Activity

nithinraok authored a paper about 2 months ago

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

nithinraok authored a paper about 2 months ago

TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context

nithinraok authored a paper about 2 months ago

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

View all activity

YTCenj's activity

nithinraok

authored 10 papers about 2 months ago

The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System

Paper • 2310.12378 • Published Oct 18, 2023

TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context

Paper • 2110.04410 • Published Oct 8, 2021

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach

Paper • 2309.05248 • Published Sep 11, 2023

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations

Paper • 2407.03495 • Published Jul 3, 2024

Spectral Codecs: Spectrogram-Based Audio Codecs for High Quality Speech Synthesis

Paper • 2406.05298 • Published Jun 7, 2024

Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens

Paper • 2409.06656 • Published Sep 10, 2024

NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks

Paper • 2408.13106 • Published Aug 23, 2024 • 1

Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation

Paper • 2310.12371 • Published Oct 18, 2023

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

Paper • 2406.19674 • Published Jun 28, 2024

Training and Inference Efficiency of Encoder-Decoder Speech Models

Paper • 2503.05931 • Published Mar 7 • 3

mgaido91

authored 10 papers 2 months ago

When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP

Paper • 2303.16166 • Published Mar 28, 2023 • 1

Direct Models for Simultaneous Translation and Automatic Subtitling: FBK@IWSLT2023

Paper • 2309.15554 • Published Sep 27, 2023 • 1

Integrating Language Models into Direct Speech Translation: An Inference-Time Solution to Control Gender Inflection

Paper • 2310.15752 • Published Oct 24, 2023 • 1

Dealing with training and test segmentation mismatch: FBK@IWSLT2021

Paper • 2106.12607 • Published Jun 23, 2021 • 1

Speechformer: Reducing Information Loss in Direct Speech Translation

Paper • 2109.04574 • Published Sep 9, 2021 • 1

Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?

Paper • 2402.12025 • Published Feb 19, 2024 • 1

How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena

Paper • 2402.13208 • Published Feb 20, 2024

Does Simultaneous Speech Translation need Simultaneous Models?

Paper • 2204.03783 • Published Apr 8, 2022 • 1

Efficient yet Competitive Speech Translation: FBK@IWSLT2022

Paper • 2205.02629 • Published May 5, 2022 • 1

Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation

Paper • 2206.05807 • Published Jun 12, 2022 • 1

AI & ML interests

Recent Activity

Team members 8

YTCenj's activity