Running on CPU Upgrade 70 70 AIR-Bench Leaderboard π₯ Explore benchmark results for QA and long doc models
Running on CPU Upgrade 116 116 Open Chinese LLM Leaderboard π Display and filter LLM benchmark results
Running on CPU Upgrade 13k 13k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots