Collections
Discover the best community collections!
Collections including paper arxiv:2412.13663
-
ReZero: Enhancing LLM search ability by trying one-more-time
Paper • 2504.11001 • Published • 14 -
FonTS: Text Rendering with Typography and Style Controls
Paper • 2412.00136 • Published -
GenEx: Generating an Explorable World
Paper • 2412.09624 • Published • 97 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 149
-
answerdotai/ModernBERT-base
Fill-Mask • Updated • 467k • 841 -
answerdotai/ModernBERT-large
Fill-Mask • Updated • 90.7k • 391 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 149 -
lightonai/modernbert-embed-large
Sentence Similarity • Updated • 7.51k • 23
-
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 290 -
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Paper • 2501.04519 • Published • 276 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 149 -
Apollo: An Exploration of Video Understanding in Large Multimodal Models
Paper • 2412.10360 • Published • 146
-
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 149 -
Qwen2.5 Technical Report
Paper • 2412.15115 • Published • 367 -
Are Your LLMs Capable of Stable Reasoning?
Paper • 2412.13147 • Published • 95 -
Byte Latent Transformer: Patches Scale Better Than Tokens
Paper • 2412.09871 • Published • 102