Retrieval-Augmented Generation system for chest X-ray images )
unzip chroma_db/db.zip
docker-compose up --buildFastAPI: http://localhost:8000/docs
- LLM
Gemma 3 4B (Optional) - Embedding Model
google/medsiglip-448 (without fine-tuning) - Vector Database
Chroma - Web Framework
FastAPI
| Metric | Value |
|---|---|
| Precision | 0.28 |
| Recall | 0.58 |
| F1-Score | 0.38 |
-
User uploads chest X-ray
→ processed insrc/app/main.py -
Image → Embedding
- Model: MedSigLIP-448
- Code:
src/embedding/model.py+img2emb.py
→ dense vector
-
Similarity Search
- Query Chroma vector store
- Code:
src/vectorstore/chroma.py+retriever.py
→ top-k similar images + metadata
-
LLM Inference
- Input: new image + context prompt
- Model: Gemma 3 4B
- Code:
src/llm/ask.py
→ diagnosis, explanation, confidence, differential diagnosis
Dataset: https://www.kaggle.com/datasets/simhadrisadaram/mimic-cxr-dataset


