Hinglish/Hindi CoT Collection Hinglish and Hindi CoTs; authors:-> fhai50032 and adi-kmt • 9 items • Updated 9 days ago • 2
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper • 2502.08127 • Published Feb 12 • 56
Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing Paper • 2502.12962 • Published Feb 18 • 1
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 24 days ago • 255
indic-evals Collection Translated versions of popular LLM benchmarks. • 4 items • Updated Oct 23, 2024 • 5
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated 28 days ago • 77
SANA-Sprint Collection 🏃SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation • 6 items • Updated 22 days ago • 35
SuperBPE Collection SuperBPE tokenizers and models trained with them • 8 items • Updated 29 days ago • 14
SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated 22 days ago • 4
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 229