Running 4 4 CompassJudger Subjective Evaluation Learderboard 🌎 CompassJudger Subjective Evaluation Learderboard
Running on CPU Upgrade 70 70 AIR-Bench Leaderboard 🥇 Explore benchmark results for QA and long doc models