Collections
Discover the best community collections!
Collections including paper arxiv:2402.17764
-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 148 -
Orion-14B: Open-source Multilingual Large Language Models
Paper • 2401.12246 • Published • 13 -
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 59 -
MM-LLMs: Recent Advances in MultiModal Large Language Models
Paper • 2401.13601 • Published • 49
-
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper • 2309.11495 • Published • 39 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 78 -
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 85 -
Language Modeling Is Compression
Paper • 2309.10668 • Published • 83
-
Qwen2.5 Technical Report
Paper • 2412.15115 • Published • 367 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 615 -
meta-llama/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • Updated • 829k • • 878 -
keras-io/GauGAN-Image-generation
Updated • 34 • 4
-
HiDream-ai/HiDream-I1-Full
Text-to-Image • Updated • 40k • • 826 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 11.3k • 470 -
6.17k
DeepSite
🐳Generate any application with DeepSeek
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 615
-
meta-llama/Llama-4-Scout-17B-16E-Instruct
Image-Text-to-Text • Updated • 829k • • 878 -
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer • Updated • 3.91M • 11.3k • 470 -
6.17k
DeepSite
🐳Generate any application with DeepSeek
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 615