Format information in README
Browse files
README.md
CHANGED
@@ -1,19 +1,25 @@
|
|
1 |
-
|
2 |
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
[](https://discord.gg/Rq7t8wnsuA) 
|
7 |
-
[](https://twitter.com/WecoAI) 
|
8 |
|
9 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
|
11 |
-
AIDE is
|
12 |
-
|
13 |
-
METR's [RE-Bench](https://arxiv.org/pdf/2411.15114) shows that AIDE is not only capable at machine learning tasks but generalizes to the AI R&D tasks such as optimizing low level Triton kernels and finetuning GPT-2 for QA, even surpassing the performance of human experts.
|
14 |
|
15 |
In our own benchmark composed of over 60 Kaggle data science competitions, AIDE demonstrated impressive performance, surpassing 50% of Kaggle participants on average.
|
16 |
|
|
|
|
|
|
|
|
|
17 |
More specifically, AIDE has the following features:
|
18 |
|
19 |
1. **Instruct with Natural Language**: Describe your problem or additional requirements and expert insights, all in natural language.
|
@@ -253,7 +259,7 @@ By repeatedly applying these steps, AIDE navigates the vast space of possible so
|
|
253 |
|
254 |
If you use AIDE in your work, please cite the following paper:
|
255 |
```bibtex
|
256 |
-
@
|
257 |
title={AIDE: AI-Driven Exploration in the Space of Code},
|
258 |
author={Zhengyao Jiang and Dominik Schmidt and Dhruv Srikanth and Dixing Xu and Ian Kaplan and Deniss Jacenko and Yuxiang Wu},
|
259 |
year={2025},
|
|
|
1 |
+
<h1 align="center">AIDE: The Machine Learning Engineer Agent</h1>
|
2 |
|
3 |
+
<p align="center">
|
4 |
+
📑 <a href="https://arxiv.org/abs/2502.13138">Paper</a>   |   📝 <a href="https://www.weco.ai/blog/technical-report">Blog</a>   |   🌐 <a href="https://www.aide.ml">Project</a>
|
5 |
+
</p>
|
|
|
|
|
6 |
|
7 |
+
<p align="center">
|
8 |
+
<a href="https://www.python.org/downloads/release/python-3100/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg" alt="Python 3.10+"></a>
|
9 |
+
<a href="https://pypi.org/project/aideml/"><img src="https://img.shields.io/pypi/v/aideml?color=blue" alt="PyPI"></a> 
|
10 |
+
<a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-green.svg" alt="License: MIT"></a> 
|
11 |
+
<a href="https://discord.gg/Rq7t8wnsuA"><img src="https://dcbadge.vercel.app/api/server/Rq7t8wnsuA?compact=true&style=flat" alt="Discord"></a> 
|
12 |
+
<a href="https://twitter.com/WecoAI"><img src="https://img.shields.io/twitter/follow/WecoAI?style=social" alt="Twitter Follow"></a> 
|
13 |
+
</p>
|
14 |
|
15 |
+
AIDE is an LLM agent that generates solutions for machine learning tasks just from natural language descriptions of the task.
|
|
|
|
|
16 |
|
17 |
In our own benchmark composed of over 60 Kaggle data science competitions, AIDE demonstrated impressive performance, surpassing 50% of Kaggle participants on average.
|
18 |
|
19 |
+
OpenAI's [MLE-bench](https://arxiv.org/pdf/2410.07095), a benchmark composed of 75 Kaggle machine learning tasks, shows that AIDE achieved four times more medals compared to the runner-up agent architecture.
|
20 |
+
|
21 |
+
METR's [RE-Bench](https://arxiv.org/pdf/2411.15114) shows that AIDE is not only capable at machine learning tasks but generalizes to the AI R&D tasks such as optimizing low level Triton kernels and finetuning GPT-2 for QA, even surpassing the performance of human experts.
|
22 |
+
|
23 |
More specifically, AIDE has the following features:
|
24 |
|
25 |
1. **Instruct with Natural Language**: Describe your problem or additional requirements and expert insights, all in natural language.
|
|
|
259 |
|
260 |
If you use AIDE in your work, please cite the following paper:
|
261 |
```bibtex
|
262 |
+
@article{aide2025,
|
263 |
title={AIDE: AI-Driven Exploration in the Space of Code},
|
264 |
author={Zhengyao Jiang and Dominik Schmidt and Dhruv Srikanth and Dixing Xu and Ian Kaplan and Deniss Jacenko and Yuxiang Wu},
|
265 |
year={2025},
|