Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromΒ
OpenHands/evaluation
3rdn4
/
openhands_official_evaluation
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
03f74db
openhands_official_evaluation
Ctrl+K
Ctrl+K
6 contributors
History:
56 commits
Xingyao Wang
add result for codeact 1.6
03f74db
12 months ago
outputs
add result for codeact 1.6
12 months ago
pages
only show swe bench on visualizer
12 months ago
utils
change test_result to bool
12 months ago
.gitattributes
Safe
1.61 kB
initial results
about 1 year ago
.gitignore
Safe
85 Bytes
add result for codeact 1.6
12 months ago
0_π_OpenDevin_Benchmark.py
Safe
4.15 kB
Create visualization for MINT benchmark & upload results (#2)
12 months ago
README.md
Safe
277 Bytes
Update README.md
about 1 year ago
requirements.txt
Safe
52 Bytes
update visualizer on multi-page
about 1 year ago