Llama Nemotron Collection Open, Production-ready Enterprise Models • 5 items • Updated 3 days ago • 50
NanoBEIR 🍺with BM25 Rankings Collection NanoBEIR by Zeta Alpha, extended with BM25 scores. These datasets are used in the Sentence Transformers Cross Encoder NanoBEIR Evaluator. • 13 items • Updated Feb 25 • 2
Nomic Embed Multimodal Collection Multimodal models allowing you to search over interleaved text, PDFs, charts, and images! • 15 items • Updated Apr 7 • 20
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 218
EuroBERT Collection Scaling Multilingual Encoders for European Languages • 4 items • Updated Mar 10 • 11
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published Mar 7 • 78
SaulLM-54B & SaulLM-141B: Scaling Up Domain Adaptation for the Legal Domain Paper • 2407.19584 • Published Jul 28, 2024 • 66
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 758
CroissantLLM: A Truly Bilingual French-English Language Model Paper • 2402.00786 • Published Feb 1, 2024 • 27