aideml / README.md
zhengyao jiang
fix incorrect scores
f37da2f
|
raw
history blame
17.8 kB

AIDE: Autonomous AI for Data Science

Welcome to the official repository for AIDE, an AI system that can automatically solve data science tasks at a human level, and with human input, it can perform even better. We believe giving developers and researchers direct access to AIDE locally, with local compute and choice to use their own LLM keys, is the most straightforward way to make it useful. That's why we'll open-source it, and the tentative timeline is it will arrive before the end of April. Currently, this repository serves as a gallery showcasing its solutions for 60+ Kaggle competitions we tested.

About AIDE

AIDE is an AI-powered data science assistant that can autonomously understand task requirements, design, and implement solutions. By leveraging large language models and innovative agent architectures, such as the Solution Space Tree Search algorithm, AIDE has achieved human-level performance on a wide range of data science tasks, outperforming over 50% of human data scientists on Kaggle competitions.

Gallery

Domain Task Top% Solution Link Competition Link
Urban Planning Forecast city bikeshare system usage 0.05 link link
Physics Predicting Critical Heat Flux 0.56 link link
Genomics Classify bacteria species from genomic data 0.0 link link
Agriculture Predict blueberry yield 0.58 link link
Healthcare Predict disease prognosis 0.0 link link
Economics Predict monthly microbusiness density in a given area 0.35 link link
Cryptography Decrypt shakespearean text 0.91 link link
Data Science Education Predict passenger survival on Titanic 0.78 link link
Software Engineering Predict defects in c programs given various attributes about the code 0.0 link link
Real Estate Predict the final price of homes 0.05 link link
Real Estate Predict house sale price 0.36 link link
Entertainment Analytics Predict movie worldwide box office revenue 0.62 link link
Entertainment Analytics Predict scoring probability in next 10 seconds of a rocket league match 0.21 link link
Environmental Science Predict air pollution levels 0.12 link link
Environmental Science Classify forest categories using cartographic variables 0.55 link link
Computer Vision Predict the probability of machine failure 0.32 link link
Computer Vision Identify handwritten digits 0.14 link link
Manufacturing Predict missing values in dataset 0.7 link link
Manufacturing Predict product failures 0.48 link link
Manufacturing Cluster control data into different control states 0.96 link link
Natural Language Processing Classify toxic online comments 0.78 link link
Natural Language Processing Predict passenger transport to an alternate dimension 0.59 link link
Natural Language Processing Classify sentence sentiment 0.42 link link
Natural Language Processing Predict whether a tweet is about a real disaster 0.48 link link
Business Analytics Predict total sales for each product and store in the next month 0.87 link link
Business Analytics Predict book sales for 2021 0.66 link link
Business Analytics Predict insurance claim amount 0.8 link link
Business Analytics Minimize penalty cost in scheduling families to santa's workshop 1.0 link link
Business Analytics Predict yearly sales for learning modules 0.26 link link
Business Analytics Binary classification of manufacturing machine state 0.6 link link
Business Analytics Forecast retail store sales 0.36 link link
Business Analytics Predict reservation cancellation 0.54 link link
Finance Predict the probability of an insurance claim 0.13 link link
Finance Predict loan loss 0.0 link link
Finance Predict a continuous target 0.42 link link
Finance Predict customer churn 0.24 link link
Finance Predict median house value 0.58 link link
Finance Predict closing price movements for nasdaq listed stocks 0.99 link link
Finance Predict taxi fare 1.0 link link
Finance Predict insurance claim probability 0.62 link link
Biotech Predict cat in dat 0.66 link link
Biotech Predict the biological response of molecules 0.62 link link
Biotech Predict medical conditions 0.92 link link
Biotech Predict wine quality 0.61 link link
Biotech Predict binary target without overfitting 0.98 link link
Biotech Predict concrete strength 0.86 link link
Biotech Predict crab age 0.46 link link
Biotech Predict enzyme characteristics 0.1 link link
Biotech Classify activity state from sensor data 0.51 link link
Biotech Predict horse health outcomes 0.86 link link
Biotech Predict the mohs hardness of a mineral 0.64 link link
Biotech Predict cirrhosis patient outcomes 0.51 link link
Biotech Predict obesity risk 0.62 link link
Biotech Classify presence of feature in data 0.66 link link
Biotech Predict patient's smoking status 0.4 link link