5 78 22

Ha-Yeong Choi

Ha0

https://scholar.google.com/citations?user=Jw3X6UgAAAAJ&hl=ko

hayeong0

AI & ML interests

Speech Synthesis, Voice Conversion, Generative Models

Recent Activity

liked a model about 11 hours ago

lmms-lab/Aero-1-Audio

upvoted a paper 5 days ago

Spatial Speech Translation: Translating Across Space With Binaural Hearables

upvoted a paper 5 days ago

A Survey of Interactive Generative Video

View all activity

Organizations

None yet

Ha0's activity

liked a model about 11 hours ago

lmms-lab/Aero-1-Audio

Text Generation • Updated 10 days ago • 861 • 63

upvoted 2 papers 5 days ago

Spatial Speech Translation: Translating Across Space With Binaural Hearables

Paper • 2504.18715 • Published 12 days ago • 7

A Survey of Interactive Generative Video

Paper • 2504.21853 • Published 7 days ago • 42

upvoted a paper 23 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published about 1 month ago • 129

upvoted a collection about 2 months ago

Orpheus TTS

Collection

TTS Towards Human-Sounding Speech • 2 items • Updated Mar 18 • 64

liked 2 datasets about 2 months ago

google/fleurs

Updated Aug 25, 2024 • 29.2k • 288

google/xtreme_s

Updated Sep 10, 2024 • 4.1k • 62

upvoted a paper 2 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 87

upvoted 3 papers 3 months ago

upvoted a paper 4 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 88

liked a model 4 months ago

kyutai/helium-1-preview-2b

Text Generation • Updated 7 days ago • 12.5k • 143

upvoted 3 papers 5 months ago

Causal Diffusion Transformers for Generative Modeling

Paper • 2412.12095 • Published Dec 16, 2024 • 23

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Paper • 2412.04432 • Published Dec 5, 2024 • 16

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 85

liked a model 5 months ago

google-bert/bert-large-uncased

Fill-Mask • Updated Feb 19, 2024 • 1M • 132

upvoted 2 papers 5 months ago

Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis

Paper • 2412.01819 • Published Dec 2, 2024 • 36

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 31

liked a Space 5 months ago

2.09k

Anycoder

🏢

Select and view code snippets for different providers