dSrikanth commited on
Commit
83d9156
·
1 Parent(s): cf15688

Format information in README

Browse files
Files changed (1) hide show
  1. README.md +17 -11
README.md CHANGED
@@ -1,19 +1,25 @@
1
- # AIDE: the Machine Learning Engineer Agent
2
 
3
- [![License: MIT](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT) 
4
- [![PyPI](https://img.shields.io/pypi/v/aideml?color=blue)](https://pypi.org/project/aideml/) 
5
- [![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/release/python-3100/)
6
- [![Discord](https://dcbadge.vercel.app/api/server/Rq7t8wnsuA?compact=true&style=flat)](https://discord.gg/Rq7t8wnsuA) 
7
- [![Twitter Follow](https://img.shields.io/twitter/follow/WecoAI?style=social)](https://twitter.com/WecoAI) 
8
 
9
- AIDE is an LLM agent that generates solutions for machine learning tasks just from natural language descriptions of the task. This repository implements the AIDE agent described in our paper - [AIDE: AI-Driven Exploration in the Space of Code](https://arxiv.org/pdf/2502.13138). We recommend to check out the [project page](https://www.aide.ml) and [technical report](https://www.weco.ai/blog/technical-report) for a quick summary of the method and results.
 
 
 
 
 
 
10
 
11
- AIDE is the state-of-the-art agent on OpenAI's [MLE-bench](https://arxiv.org/pdf/2410.07095), a benchmark composed of 75 Kaggle machine learning tasks, where we achieved four times more medals compared to the runner-up agent architecture.
12
-
13
- METR's [RE-Bench](https://arxiv.org/pdf/2411.15114) shows that AIDE is not only capable at machine learning tasks but generalizes to the AI R&D tasks such as optimizing low level Triton kernels and finetuning GPT-2 for QA, even surpassing the performance of human experts.
14
 
15
  In our own benchmark composed of over 60 Kaggle data science competitions, AIDE demonstrated impressive performance, surpassing 50% of Kaggle participants on average.
16
 
 
 
 
 
17
  More specifically, AIDE has the following features:
18
 
19
  1. **Instruct with Natural Language**: Describe your problem or additional requirements and expert insights, all in natural language.
@@ -253,7 +259,7 @@ By repeatedly applying these steps, AIDE navigates the vast space of possible so
253
 
254
  If you use AIDE in your work, please cite the following paper:
255
  ```bibtex
256
- @misc{aide2025,
257
  title={AIDE: AI-Driven Exploration in the Space of Code},
258
  author={Zhengyao Jiang and Dominik Schmidt and Dhruv Srikanth and Dixing Xu and Ian Kaplan and Deniss Jacenko and Yuxiang Wu},
259
  year={2025},
 
1
+ <h1 align="center">AIDE: The Machine Learning Engineer Agent</h1>
2
 
3
+ <p align="center">
4
+ 📑 <a href="https://arxiv.org/abs/2502.13138">Paper</a>&nbsp&nbsp | &nbsp&nbsp📝 <a href="https://www.weco.ai/blog/technical-report">Blog</a>&nbsp&nbsp | &nbsp&nbsp🌐 <a href="https://www.aide.ml">Project</a>
5
+ </p>
 
 
6
 
7
+ <p align="center">
8
+ <a href="https://www.python.org/downloads/release/python-3100/"><img src="https://img.shields.io/badge/python-3.10+-blue.svg" alt="Python 3.10+"></a>
9
+ <a href="https://pypi.org/project/aideml/"><img src="https://img.shields.io/pypi/v/aideml?color=blue" alt="PyPI"></a>&ensp;
10
+ <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-green.svg" alt="License: MIT"></a>&ensp;
11
+ <a href="https://discord.gg/Rq7t8wnsuA"><img src="https://dcbadge.vercel.app/api/server/Rq7t8wnsuA?compact=true&style=flat" alt="Discord"></a>&ensp;
12
+ <a href="https://twitter.com/WecoAI"><img src="https://img.shields.io/twitter/follow/WecoAI?style=social" alt="Twitter Follow"></a>&ensp;
13
+ </p>
14
 
15
+ AIDE is an LLM agent that generates solutions for machine learning tasks just from natural language descriptions of the task.
 
 
16
 
17
  In our own benchmark composed of over 60 Kaggle data science competitions, AIDE demonstrated impressive performance, surpassing 50% of Kaggle participants on average.
18
 
19
+ OpenAI's [MLE-bench](https://arxiv.org/pdf/2410.07095), a benchmark composed of 75 Kaggle machine learning tasks, shows that AIDE achieved four times more medals compared to the runner-up agent architecture.
20
+
21
+ METR's [RE-Bench](https://arxiv.org/pdf/2411.15114) shows that AIDE is not only capable at machine learning tasks but generalizes to the AI R&D tasks such as optimizing low level Triton kernels and finetuning GPT-2 for QA, even surpassing the performance of human experts.
22
+
23
  More specifically, AIDE has the following features:
24
 
25
  1. **Instruct with Natural Language**: Describe your problem or additional requirements and expert insights, all in natural language.
 
259
 
260
  If you use AIDE in your work, please cite the following paper:
261
  ```bibtex
262
+ @article{aide2025,
263
  title={AIDE: AI-Driven Exploration in the Space of Code},
264
  author={Zhengyao Jiang and Dominik Schmidt and Dhruv Srikanth and Dixing Xu and Ian Kaplan and Deniss Jacenko and Yuxiang Wu},
265
  year={2025},