QueryLake Backend

This is a FastAPI and Ray backend for QueryLake. This repo now recommends using uv for Python environments + lockfiles.

You must have CUDA installed (and an appropriate NVIDIA driver) to run local models. We recommend the Lambda Stack.

Install (recommended: `uv`)

Create a virtualenv and sync locked dependencies:

uv venv --python 3.12
uv sync

Optional extras:

uv sync --extra cli (enables setup.py CLI helpers)
uv sync --extra inference-hf (local HF/torch inference helpers)
uv sync --extra ocr (OCR stack: Marker/Surya + OCRmyPDF)
uv sync --extra dev (pytest tooling)

Note: We intentionally keep vLLM as a separate runtime in production (run it as an upstream service and let QueryLake talk to it over HTTP). Use the vllm extra only for experiments.

Install (legacy: conda + requirements.txt)

This is no longer the recommended path, but is kept for compatibility:

conda create --name QueryLake python=3.10
conda activate QueryLake
pip install -r requirements.txt

ExllamaV2

One of the dependencies installed is exllamav2, however this occassionaly raises issues to the build. To safely install it, you should build from source by cloning it doing the following:

git clone https://github.com/turboderp/exllamav2
cd exllamav2
pip install -r requirements.txt
pip install .
cd ../
rm -rf exllamav2

Tesseract

We currently support tesseract for OCR. This requires apt installing tesseract like so:

sudo apt install tesseract-ocr
sudo apt install libtesseract-dev

Database

The database is a ParadeDB container. To initialize it, you must have docker and docker-compose installed (use these instructions). Once these are installed, you can run the following to start or completely reset the database:

./restart_database.sh

Setup

To set up your models, run the setup.py CLI like so and follow the instructions:

python setup.py

I recommend using the presets for now, as custom model additions are under development.

Start

I recommend starting a head node for ray clusters first. This initiates the ray dashboard, and may make it easier to connect serve deployments in the future. you can do so as follows:

ray start --head --port=6379 --dashboard-host 0.0.0.0

To start the server, run

serve run server:deployment

Server settings are generated in config.json. The file can be modified to your preferred settings.

Name		Name	Last commit message	Last commit date
Latest commit History 215 Commits
.github/workflows		.github/workflows
QueryLake		QueryLake
docs		docs
scripts		scripts
tests		tests
toolchains		toolchains
toolchains_v2_examples		toolchains_v2_examples
.gitignore		.gitignore
CLI_SPRINT.md		CLI_SPRINT.md
DockerFile.backend		DockerFile.backend
DockerFile.frontend		DockerFile.frontend
GPU_SCALING_PLANNING_MODEL_REQUEST.md		GPU_SCALING_PLANNING_MODEL_REQUEST.md
LICENSE		LICENSE
PLANNING_MODEL_REQUEST.md		PLANNING_MODEL_REQUEST.md
README.md		README.md
config.json.bak		config.json.bak
docker-compose-only-db.yml		docker-compose-only-db.yml
docker-compose.yml		docker-compose.yml
init.sql		init.sql
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
ray_placement_groups_doc.md		ray_placement_groups_doc.md
requirements.txt		requirements.txt
requirements_api.txt		requirements_api.txt
requirements_full.txt		requirements_full.txt
requirements_inference.txt		requirements_inference.txt
requirements_ocr.txt		requirements_ocr.txt
restart_database.sh		restart_database.sh
server.py		server.py
setup.py		setup.py
start_querylake.out		start_querylake.out
start_querylake.py		start_querylake.py
test_openai_chat.py		test_openai_chat.py
test_user_info.json		test_user_info.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

QueryLake Backend

Install (recommended: `uv`)

Install (legacy: conda + requirements.txt)

ExllamaV2

Tesseract

Database

Setup

Start

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

kmccleary3301/QueryLakeBackend

Folders and files

Latest commit

History

Repository files navigation

QueryLake Backend

Install (recommended: uv)

Install (legacy: conda + requirements.txt)

ExllamaV2

Tesseract

Database

Setup

Start

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Install (recommended: `uv`)

Packages