Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OliP 's Collections
NewGen small LMs
Leading Leaderboards
2024 Papers of the year
2023 (and before) Papers of the Year
LLM Deployment
Vision-Language
Long-Context
Audio
Special LMs <10B
🌶️ Spaces
Evaluation
Applications
Coding

Leading Leaderboards

updated Nov 6, 2024
Upvote
-

  • Running on CPU Upgrade
    13k
    13k

    Open LLM Leaderboard

    🏆

    Track, rank and evaluate open LLMs and chatbots


  • Running on CPU Upgrade
    5.6k
    5.6k

    MTEB Leaderboard

    🥇

    Embedding Leaderboard


  • Running
    4.36k
    4.36k

    Chatbot Arena Leaderboard

    🏆

    Display chatbot performance leaderboard


  • Running
    203
    203

    BigCodeBench Leaderboard

    🥇

    Explore and analyze code evaluation data


  • Running on CPU Upgrade
    739
    739

    Open VLM Leaderboard

    🌎

    VLMEvalKit Evaluation Results Collection


  • Running
    134
    134

    Vidore Leaderboard

    🥇

    Browse and submit visual document retrieval benchmark results


  • Running on CPU Upgrade
    796
    796

    Open ASR Leaderboard

    🏆

    Request evaluation for new speech models


  • Running
    102
    102

    Berkeley Function Calling Leaderboard

    🏃


  • Runtime error
    21
    21

    LiveBench

    🥇


  • Running
    22
    22

    JudgeBench Leaderboard

    🏆

    Display and filter leaderboard results for LLM judges

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs