Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromΒ
OpenHands/evaluation
3rdn4
/
evaluation
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
72c2e93
evaluation
Ctrl+K
Ctrl+K
6 contributors
History:
17 commits
Xingyao Wang
add results for gpt-4o
72c2e93
about 1 year ago
outputs
add results for gpt-4o
about 1 year ago
pages
support multi-page
about 1 year ago
utils
change to only load merged
about 1 year ago
.gitattributes
Safe
1.61 kB
initial results
about 1 year ago
.gitignore
Safe
14 Bytes
update gitignore
about 1 year ago
0_π_OpenDevin_Benchmark.py
Safe
2.85 kB
add absolute number of solved
about 1 year ago
README.md
Safe
277 Bytes
Update README.md
about 1 year ago
requirements.txt
Safe
43 Bytes
add benchmark code
about 1 year ago