AstroBio Explorer is our AI-powered research assistant built for the NASA Space Apps Challenge 2025(Build a Space Biology Knowledge Engine). It helps anyone explore NASAβs space biology research in a simple, conversational and interactive way.
NASA has been studying how life behaves beyond Earth for decades. From how plants grow in space to how microgravity affects the human body, the knowledge is incredible. But itβs buried inside hundreds of research papers that are hard to search and connect.
We wanted to make that exploration feel natural, like having a conversation with a scientist who knows exactly where to look. Thatβs why we built AstroBio Explorer.
AstroBio Explorer turns complex scientific literature into a friendly research companion. Hereβs what you can do inside it:
- π§ Ask research questions in plain language and get clear, well-grounded answers from NASA bioscience papers
- π See inline citations with clickable links to original publications
- π Explore related Google Scholar papers using SerpAPI
- πΌ View relevant images tied to your question in a sidebar
- π¬ Keep multiple research threads open using multi-session chat
It doesnβt just give you a block of text. It brings together papers, citations, images and related work so you can truly explore a topic.
π Watch the Demo on YouTube
We structured AstroBio Explorer into three main parts working together.
- Built with FastAPI, LangChain, and Gemini 2.5 Flash
- Parsed and chunked 608 NASA bioscience articles at the sentence level
- Generated dense and sparse embeddings and stored them in Qdrant for fast retrieval
- Used a small agent system to combine the RAG pipeline with Google Scholar and image search, merging multiple sources for each response
- Built with Next.js and Tailwind CSS
- Clean chat interface with SCSS and Haml animations for a smooth space-themed feel
- Citations appear inline inside chat bubbles
- Images are shown in a sidebar for visual context using Image Fetch API(Serp API)
- Includes a Google Scholar button to fetch related papers(Serp API)
- Supports multiple chat sessions so users can explore different topics simultaneously
- Vector DB: Qdrant, storing each chunk with metadata like paper ID, section, and URL for precise citation linking
- Context + session storage: For multi-turn conversations and persistent threads
- Finding the right chunk size to keep answers precise without slowing the system
- Placing inline citations at exactly the right spots in the generated text
- Coordinating Gemini, Qdrant and SerpAPI in a single pipeline
- Designing a UI that could handle citations, media and multiple threads without feeling cluttered
- Built a working research assistant that connects NASA data, Google Scholar and images in one conversational interface
- Added inline citations that link directly to real papers
- Enabled multi-session research chats for parallel exploration
- Designed a modular backend that can easily be extended with more tools later
- How to combine different AI components into a smooth, useful experience
- The importance of chunking and retrieval when working with real scientific literature
- How to shape LLM outputs into structured answers that are actually useful for research
- How to design frontend components that gracefully handle citations, references and visual content
- A Research Mode with longer, more detailed answers and expanded context
- Visual knowledge graphs that show how concepts connect across studies
- Hypothesis generation and exportable research briefs
- Direct integration with NASAβs open data repositories for richer exploration
- Next.js
- Tailwind CSS
- SCSS + Haml animations
- FastAPI
- LangChain
- Gemini 2.5 Flash
- Qdrant
- SerpAPI (Google Scholar + Image Fetch)
We tested AstroBio Explorer on real NASA space biology literature to measure how well it retrieves, cites and answers scientific questions.
- 181 questions
- 42 research papers
- Evaluation measured retrieval, answer quality and citation accuracy
- Overall Score: 72% (Weighted across retrieval, answer quality, and citation accuracy)
| Component | Score | Notes |
|---|---|---|
| Answer Quality | 83% | Excellent factual alignment with expert answers |
| Citation Accuracy | 70% | Good relevance and proper linking |
| Retrieval Quality | 59% | Good but room for broader document recall |
Formula:
Overall = 35% Γ Retrieval + 45% Γ Answer Quality + 20% Γ Citations
| Question Type | Score |
|---|---|
| Specific questions | 79% |
| Comparative analysis | 77% |
| Factual queries | 73% |
| Complex reasoning | 69% |
| Broad overviews | 60% |
AstroBio Explorer excels at targeted scientific questions, which is where researchers benefit most.
- Medical case studies β 89%
- Space agriculture research β 86%
- Plant imaging systems β 88%
- FAIR data systems β 88%
Create a .env file for the backend and a .env.local for the frontend.
Do not commit these files to Git. If you ever push them by mistake, rotate the keys immediately.
# LLM
GEMINI_API_KEY=your_gemini_api_key
MODEL_NAME=gemini-2.5-flash
# Vector DB
QDRANT_URL=https://your-qdrant-host:6333
QDRANT_API_KEY=your_qdrant_api_key
QDRANT_COLLECTION=space_bio_chunks
# External search
SERPAPI_API_KEY=your_serpapi_key
# CORS
ALLOW_ORIGINS=http://localhost:3000# Your API base for local dev
NEXT_PUBLIC_API_BASE_URL=http://localhost:8000
Make sure your root .gitignore includes:
.env
.env.local
**/.env
**/.env.local
cd backend
python -m venv venvmacOS or Linux:
source venv/bin/activateWindows PowerShell:
venv\Scripts\Activate.ps1pip install -r requirements.txtMake sure backend/.env exists with your keys. If you use a process manager it will load automatically. For local shells, you can also export manually if needed.
macOS or Linux:
export $(grep -v '^#' .env | xargs)Windows PowerShell:
Get-Content .env | ForEach-Object {
if ($_ -and $_ -notmatch '^\s*#') {
$name, $value = $_.Split('=',2)
[Environment]::SetEnvironmentVariable($name.Trim(), $value.Trim())
}
}python main.pyThe backend will be available at:
http://localhost:8000
cd ../frontend
npm installnpm run devThe frontend will be available at:
http://localhost:3000
.
βββ backend # FastAPI + LangChain + Gemini chatbot + RAG pipeline
βββ frontend # Next.js + Tailwind CSS UI
βββ evaluation # Evaluation scripts, metrics and results
AstroBio Explorer is more than a chatbot. Itβs a research companion designed to make space bioscience accessible and explorable. Whether youβre a scientist, a student or just curious about how life thrives beyond Earth, AstroBio Explorer lets you ask real questions and get meaningful, research-backed answers.