Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
OpenHands/evaluation
3rdn4
/
evaluation
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
7eb2653
evaluation
/
utils
Ctrl+K
Ctrl+K
6 contributors
History:
13 commits
Xingyao Wang
fix fine-grained report; support visualization while running
7eb2653
12 months ago
__init__.py
Safe
5.57 kB
support visualization of new swebench-eval
12 months ago
mint.py
Safe
3.48 kB
Create visualization for MINT benchmark & upload results (#2)
12 months ago
swe_bench.py
7.48 kB
fix fine-grained report; support visualization while running
12 months ago