Skip to content

AutoResearch Lab: AI research paper generator using multi-agent collaboration. Scrapes arXiv papers, generates 2500-3000 word research papers with proper citations in under 1 minute. Free, open-source, works with any domain.

License

Notifications You must be signed in to change notification settings

vikashmehta292511/autoresearch-lab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AutoResearch Lab

Multi-Agent System for Autonomous Research Paper Generation

This is my capstone project for the Kaggle 5‑Day AI Agents Intensive Course. It demonstrates how I applied agent workflows, tools, memory and evaluation to build a practical, real‑world AI agent. It generates professional research papers on ANY topic using AI agents + web scraping.

Kaggle 5‑Day AI Agents Intensive Python 3.8+ Gemini API License: MIT


Features

  • 5 Specialized Agents working together
  • Web Scraping from arXiv for real papers
  • 2500-3000 Words publication-ready papers
  • Complete Pipeline from problem → hypothesis → methodology → analysis → paper
  • FREE uses Google Gemini API key
  • Saves to Repository in output/ folder

Architecture

User Input (Domain)
        ↓
[Problem Finder Agent] ← Scrapes arXiv papers
        ↓
[Hypothesis Generator Agent]
        ↓
[Experiment Designer Agent]
        ↓
[Data Analyst Agent]
        ↓
[Paper Writer Agent] ← Uses Gemini AI
        ↓
Output → research_paper.md

Quick Start

Prerequisites

  • Python 3.8+
  • Google Gemini API key

Installation

  1. Clone repository
git clone https://github.com/yourusername/autoresearch-lab.git
cd autoresearch-lab
  1. Install dependencies
pip install -r requirements.txt
  1. Get FREE Gemini API Key
  1. Create .env file
echo "GEMINI_API_KEY=your-key-here" > .env
  1. Run
python main.py
  1. Enter domain
Enter research domain: your research domain

Output Structure

output/research_20241201_143022/
├── research_paper.md          # paper
├── metadata.json             # Run information
└── pipeline_history.json     # Complete agent outputs

What Each Agent Does

1. Problem Finder Agent

  • Scrapes arXiv for recent papers
  • Analyzes 10 papers in the domain
  • Identifies research gaps
  • Extracts keywords

2. Hypothesis Generator Agent

  • Creates testable hypothesis
  • Identifies variables (IV, DV, CV)
  • Defines null/alternative hypotheses
  • Establishes research framework

3. Experiment Designer Agent

  • Designs research methodology
  • Specifies sample size
  • Defines experimental procedure
  • Outlines analysis plan

4. Data Analyst Agent

  • Simulates statistical analysis
  • Calculates p-values and effect sizes
  • Interprets results
  • Draws conclusions

5. Paper Writer Agent

  • Uses Gemini AI for 3000-word generation
  • Integrates all agent outputs
  • Follows academic structure
  • Includes 12-15 citations
  • Professional formatting

Configuration

Environment Variables

Create .env file:

GEMINI_API_KEY=your-gemini-api-key-here

Customize Paper Length

Edit agents/paper_writer.py :

"Length: 3500-4000 words"  # Change as needed

Customize Scraping

Edit agents/problem_finder.py :

max_results=20  # Fetch more papers

Cost

FREE!

  • Gemini 2.5 Flash:
  • arXiv API: FREE

Usage limits:

  • Gemini: 60 requests/minute (free tier)
  • arXiv: Unlimited

Performance

  • Generation Time: 30-60 seconds
  • Word Count: 2500-3000 words
  • Papers Scraped: 5-10 per run
  • Citations: 12-15 per paper

Project Structure

autoresearch-lab/
├── main.py                    # Entry point
├── requirements.txt           # Dependencies
├── .env                      # API keys
├── README.md                 # This file
│
├── agents/
│   ├── __init__.py
│   ├── problem_finder.py     # With arXiv scraping
│   ├── hypothesis_generator.py
│   ├── experiment_designer.py
│   ├── data_analyst.py
│   └── paper_writer.py       # Gemini 3000 words
│
├── utils/
│   ├── __init__.py
│   ├── memory_store.py       # Context management
│   └── logger.py             # Logging
│
└── output/                    # Generated papers
    └── research_YYYYMMDD_HHMMSS/
        ├── research_paper.md
        ├── metadata.json
        └── pipeline_history.json

Example Domains

You can try these domains:

  • Quantum Machine Learning
  • Climate Change Mitigation Strategies
  • CRISPR Gene Editing Ethics
  • Blockchain Scalability Solutions
  • Deep Learning for Medical Diagnosis
  • Renewable Energy Optimization
  • Space Debris Management
  • Cybersecurity in IoT Devices

Acknowledgments

  • Google Gemini - AI generation
  • arXiv - Research papers
  • Python community - Libraries and tools

Built as the Capstone Project for the 5‑Day AI Agents Intensive Course with Google (Kaggle‑Mentors). Thanks to the Kaggle community, mentors, and Google researchers for guidance and resources.


Made for researchers and students worldwide

Generate professional research papers in 60 seconds, not weeks.

About

AutoResearch Lab: AI research paper generator using multi-agent collaboration. Scrapes arXiv papers, generates 2500-3000 word research papers with proper citations in under 1 minute. Free, open-source, works with any domain.

Topics

Resources

License

Stars

Watchers

Forks

Languages