RAG System with Anthropic API (Claude), FastAPI Web Interface, Paragraph-based Chunking, and PDF Upload

Features

✅ Paragraph-based chunking
✅ PDF upload support
✅ Prompt caching (90% cost savings!)
✅ Smart relevance filtering
✅ Simple Web UI
✅ Qdrant as vector database
✅ Type-safe FastAPI routes
✅ Marked.js for Markdown rendering

Install

python -m venv rag-app

source rag-app/bin/activate

App version using sentence-transformers:

pip install fastapi uvicorn python-multipart anthropic sentence-transformers qdrant-client pypdf

App version using transformers directly:

pip install fastapi uvicorn python-multipart anthropic transformers torch qdrant-client pypdf

App version using transformers with custom progress bar:

pip install fastapi uvicorn python-multipart anthropic transformers torch qdrant-client pypdf tqdm

Run

Export the Anthropic API Key:

export ANTHROPIC_API_KEY='your-api-key'

Run app version using sentence-transformers:

uvicorn app_qdrant_fastapi:app --reload

Run app version using transformers directly:

uvicorn app_qdrant_fastapi_tf:app --reload

Run app version using transformers with custom progress bar:

uvicorn app_qdrant_fastapi_tf_prog:app --reload

depending on the version you want to start.

Note

When you enable the progress bar, you will notice a warning like this on shutdown:

/home/stahlhe2/.local/share/pypoetry/python/cpython@3.12.9/lib/python3.12/multiprocessing/>resource_tracker.py:255: UserWarning: resource_tracker: There appear to be 1 leaked semaphore objects >to clean up at shutdown
 warnings.warn('resource_tracker: There appear to be %d '

The semaphore leak warning is a known issue with sentence-transformers and transformers libraries when using multiprocessing, you can safely ignore it. The resources are still freed by the OS when the process exits.

Note

Instead of working with a local Qdrant database, you can also use the in-memory Qdrant instance for development: Change

self.qdrant_client = QdrantClient(path="./qdrant_db")

to

self.qdrant_client = QdrantClient(":memory:")

or use a Qdrant Docker container for persistence (https://qdrant.tech/documentation/quick_start/)

docker run -p 6333:6333 -v $(pwd)/qdrant_storage:/qdrant/storage qdrant/qdrant

self.qdrant_client = QdrantClient(url="http://localhost:6333")

Open the browser:

Open http://localhost:8000 in your browser

Documentation

You will find the Swagger/OpenAPI docs under

http://localhost:8000/docs

Optimizations

You can optimize the startup time by skipping the example documents. Just comment out this line:

rag.add_documents(example_docs)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app_qdrant_fastapi.py		app_qdrant_fastapi.py
app_qdrant_fastapi_tf.py		app_qdrant_fastapi_tf.py
app_qdrant_fastapi_tf_prog.py		app_qdrant_fastapi_tf_prog.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG System with Anthropic API (Claude), FastAPI Web Interface, Paragraph-based Chunking, and PDF Upload

Features

Install

App version using sentence-transformers:

App version using transformers directly:

App version using transformers with custom progress bar:

Run

Export the Anthropic API Key:

Run app version using sentence-transformers:

Run app version using transformers directly:

Run app version using transformers with custom progress bar:

Open the browser:

Documentation

Optimizations

About

Uh oh!

Releases

Packages

Languages

License

hstm/rag-app-qdrant

Folders and files

Latest commit

History

Repository files navigation

RAG System with Anthropic API (Claude), FastAPI Web Interface, Paragraph-based Chunking, and PDF Upload

Features

Install

App version using sentence-transformers:

App version using transformers directly:

App version using transformers with custom progress bar:

Run

Export the Anthropic API Key:

Run app version using sentence-transformers:

Run app version using transformers directly:

Run app version using transformers with custom progress bar:

Open the browser:

Documentation

Optimizations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages