Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Duplicated fromΒ
OpenHands/evaluation
ryanhoangt
/
OpenHands-evaluation
like
2
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
091b42e
OpenHands-evaluation
Ctrl+K
Ctrl+K
6 contributors
History:
70 commits
Xingyao Wang
update results using new ver of swebench
091b42e
11 months ago
outputs
update results using new ver of swebench
11 months ago
pages
fix visualizer to only display eval_report when it exists
11 months ago
utils
support loading report with new format
11 months ago
.gitattributes
Safe
1.61 kB
initial results
about 1 year ago
.gitignore
Safe
109 Bytes
update gitignore
11 months ago
0_π_OpenDevin_Benchmark.py
Safe
4.9 kB
set n error/stuck/cost to 0 for CodeAct exp run below v1.5
11 months ago
README.md
Safe
277 Bytes
Update README.md
about 1 year ago
requirements.txt
Safe
52 Bytes
update visualizer on multi-page
about 1 year ago