π Transform months of research into minutes of insight
English | δΈζ
ARIA is an automated research assistant framework for scientific data analysis, visualization, and report generation.
aria/
βββ .claude/commands/ # Claude AI command files
β βββ academic/ # Academic workflow commands
β βββ git/ # Git operation commands
β βββ python/ # Python environment commands
βββ data/
β βββ raw/ # Original data
β βββ processed/ # Preprocessed data
β βββ output/ # Experiment outputs
β βββ models/ # Trained models
β βββ results/ # Experimental results
β βββ figures/ # Visualization charts
β βββ logs/ # Execution logs
βββ docs/ # Project documentation
βββ src/ # Source code
βββ scripts/ # Execution scripts
βββ GETTING_STARTED.md # Quick start guide
βββ INSTALL.md # Installation guide
βββ README.md # This file
- Place raw data in the
data/raw/folder - Manually create
docs/01-basic-information.mdto describe project background, objectives, and data overview
- Raw Data Analysis: Use
@raw-data-analysis.mdcommand to analyze raw data, generatingdocs/02-raw-data-analysis.md - Data Preprocessing: Use
@preprocess.mdcommand to design preprocessing plan (docs/03-preprocess-plan.md), execute preprocessing, analyze processed data (docs/04-processed-data-analysis.md)
- Research Plan: Use
@research-plan.mdcommand to develop research plan including feature engineering, model selection, evaluation metrics, generatingdocs/05-research-plan.md
- Code Development: Use
@code-implementation.mdcommand to implement research plan, creating necessary Python modules - Generate
docs/06-implementation-docs.md(implementation documentation) anddocs/07-execution-instructions.md(execution guide) - Code quality check: Use mypy and ruff to ensure code quality
- Run Experiments: Use
@run-experiments.mdcommand to execute experiment scripts - Outputs saved to corresponding subfolders in
output/directory
- Results Analysis: Use
@experiment-analysis.mdcommand to analyze experiment outputs - Generate individual analyses in
docs/08-experiment-results/directory - Generate
docs/09-experiment-report.mdcomprehensive experiment report
- Academic Paper: Use
@research-report.mdcommand to generate high-impact journal format paper - Generate
docs/10-manuscript.md(main text),docs/10-manuscript-supplement.md(supplementary materials),docs/10-cover-letter.md(cover letter)
- Gradio Interface: If models are trained, use
@gradio-app.mdcommand to create model inference interface - Generate
docs/11-model-deployment-guide.mddeployment guide
All academic command files are located in .claude/commands/academic/ directory:
@raw-data-analysis.md- Analyze raw data@preprocess.md- Data preprocessing@research-plan.md- Research plan design@code-implementation.md- Code implementation@run-experiments.md- Experiment execution@experiment-analysis.md- Results analysis@research-report.md- Academic paper generation@gradio-app.md- Model deployment interface@convert2docs.md- Convert markdown to DOCX (optional)
Python command files are located in .claude/commands/python/ directory:
@setup-environment.md- Automated environment setup- Checks and installs Git, Python 3.12+, and UV package manager
- Sets up project dependencies with
uv sync - Includes Tsinghua mirror fallback for network issues
- Provides complete environment verification
Git command files are located in .claude/commands/git/ directory:
@git-commit.md- Intelligent Git commits- Automatically creates structured commit messages
- When there are many modified files, automatically commits in batches (max 10 files per commit)
- Generates meaningful commit descriptions based on code changes
Complete example projects demonstrating ARIA workflow across different research tasks:
- π aria-example-buston - Boston housing price prediction (regression) | OpenML Dataset
- π aria-example-diamonds - Diamond price prediction (regression) | OpenML Dataset
- π aria-example-kc1 - Software defect prediction (classification) | OpenML Dataset
- π§© aria-example-sat11 - SAT solver performance prediction (regression) | OpenML Dataset
All datasets are from OpenML, an open machine learning platform. Each project includes complete documentation, production-ready code, trained models, and academic manuscripts.
New to ARIA? Start here: Getting Started Guide π
The complete guide covers:
- Installing Git and AI code editor (Cursor/VSCode/Lingma IDE)
- Automated environment setup with
@setup-environment.md - Step-by-step workflow from data to paper
- Troubleshooting and best practices
- Setup Environment: Use
@setup-environment.mdto configure Python and dependencies - Prepare Data: Place raw data in
data/raw/ - Create Project Description: Write
docs/01-basic-information.md - Execute Workflow: Use Claude AI to execute academic commands in sequence
- Version Control: Use
@git-commit.mdfor intelligent commits
π Full Documentation: See GETTING_STARTED.md and INSTALL.md
Using UV for Python dependency management:
uv add <package-name> # Add new dependency
uv sync # Synchronize dependencies- Type checking:
mypy src/ - Code style:
ruff check src/ - Code formatting:
ruff format src/
- π Data-Driven: Complete workflow from raw data to academic paper
- π€ AI-Assisted: Claude AI commands automate each research phase
- π Academic Standards: Generate Nature/Science standard papers
- π― Quality Assurance: Integrated code quality checking tools
- π Model Deployment: Support for Gradio interface rapid deployment
- π¦ Version Control: Intelligent Git commit management
This project is dual-licensed:
- GNU AGPL-3.0 - for open source use (personal, research, education, open source projects)
- Commercial License - for proprietary/commercial use (contact for licensing)
See the LICENSE file for details.
Contributions are welcome! Please feel free to submit Issues and Pull Requests.
For questions, suggestions, or collaboration opportunities:
- GitHub Issues: Create an issue
- Email: biao00luo@gmail.com
- Project: ARIA on GitHub
