Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenEvals 's Collections
YourBench
Archived Open LLM Leaderboard (2024-2025)
Research collaborations
Leaderboards related tools
Archived Open LLM Leaderboard (2023-2024)

Research collaborations

updated Apr 2

A small overview of our research collabs through the years

Upvote
1

  • GAIA: a benchmark for General AI Assistants

    Paper • 2311.12983 • Published Nov 21, 2023 • 207

  • Zephyr: Direct Distillation of LM Alignment

    Paper • 2310.16944 • Published Oct 25, 2023 • 122

  • SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

    Paper • 2502.02737 • Published Feb 4 • 229

  • Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

    Paper • 2412.03304 • Published Dec 4, 2024 • 19

  • The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

    Paper • 2404.05904 • Published Apr 8, 2024 • 9
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs