LLM Gradio WebUI (text + RAG).

A Gradio web interface for text generation (Gemma3 4B as a default model).

Screenshots

Requirements

NVIDIA GPU (8GB+ VRAM)
Python 3.11+
CUDA 12.1+

Installation

Clone the repository:

git clone https://github.com/vpakarinen2/llm-text-gradio-webui.git
cd llm-text-gradio-webui

Create/activate virtual environment:

python -m venv .venv

# Windows
.venv\Scripts\activate

# Linux/Mac
source .venv/bin/activate

Install PyTorch with CUDA:

pip install torch --index-url https://download.pytorch.org/whl/cu121

Install dependencies:

pip install -r requirements.txt --no-deps

Create .env file:

EMBED_MODEL_ID=sentence-transformers/all-MiniLM-L6-v2
GRADIO_ANALYTICS_ENABLED=False
MAX_NEW_TOKENS=128
DEVICE=cuda

Hugging Face Token

Log in to Hugging Face
Create an access token (Settings → Access Tokens).
Log in:

huggingface-cli login

Usage

python -m app.server

Author

Ville pakarinen (@vpakarinen2)

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
app		app
LICENSE		LICENSE
README.md		README.md
chat_2.png		chat_2.png
env.example		env.example
rag_chat.png		rag_chat.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Gradio WebUI (text + RAG).

Screenshots

Requirements

Installation

Hugging Face Token

Usage

Author

About

Uh oh!

Languages

License

vpakarinen2/llm-text-gradio-webui

Folders and files

Latest commit

History

Repository files navigation

LLM Gradio WebUI (text + RAG).

Screenshots

Requirements

Installation

Hugging Face Token

Usage

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages