Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromย
OpenHands/evaluation
3rdn4
/
openhands_official_evaluation
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
03f74db
openhands_official_evaluation
/
utils
Ctrl+K
Ctrl+K
6 contributors
History:
14 commits
Xingyao Wang
change test_result to bool
1ae8615
12 months ago
__init__.py
Safe
5.57 kB
support visualization of new swebench-eval
12 months ago
mint.py
Safe
3.48 kB
Create visualization for MINT benchmark & upload results (#2)
12 months ago
swe_bench.py
Safe
7.81 kB
change test_result to bool
12 months ago