Overview

The Jane Austen Literary Assistant is an LLM-powered chatbot that uses RAG (Retrieval Augmented Generation) to answer questions about Jane Austen's works. The system processes and understands the complete works of Jane Austen, providing detailed responses to user queries about plots, characters, themes, and literary analysis.

Live App: Jane Austen Literary Assistant

Features

Interactive question-answering interface
Pre-loaded demo questions for quick exploration
Comprehensive coverage of Jane Austen's major works: Pride and Prejudice (1813), Sense and Sensibility (1811), Emma (1815), Mansfield Park (1814), Persuasion (1817), Northanger Abbey (1817), Lady Susan (1871), Love and Friendship.

Technical Stack

Backend: Python, Flask
Frontend: HTML, CSS, Bootstrap
AI/ML: LangChain, OpenAI GPT
Vector Database: Chroma
Document Processing: LangChain Text Splitters
Containerization: Docker
Deployment: Render

Project Structure

JaneAustenChatBot/
├── LICENSE
├── README.md
├── data/
│   ├── processed/      # Processed text chunks
│   ├── vector_db/      # Vector embeddings database
│   └── raw/            # Original text files
├── src/
│   ├── __init__.py
│   ├── base.py         # Utility functions
│   ├── data_ingestion.py  # Text processing
│   └── data_preprocessing.py  # Vector database creation
├── templates/
│   └── index.html      # Web interface
├── requirements.txt
├── Dockerfile       
└── app.py            # Main Flask application

Setup and Installation

1. Clone the repository:

git clone https://github.com/aphdinh/JaneAustenChatBot.git
cd JaneAustenChatBot

2. Create and activate a virtual environment:

pyenv virtualenv 3.11.0 janeaustenchatbot
pyenv activate janeaustenchatbot

3. Install dependencies:

pip install -r requirements.txt

4. Set up environment variables:

Create a .env file in the root directory with your personal API keys or other secrets:

OPENAI_API_KEY=your_api_key_here
LANGCHAIN_API_KEY=your_api_key_here
LANGCHAIN_TRACING_V2=true

5. Process the text files and create a vector database (with Chroma):

python src/data_ingestion.py
python src/data_preprocessing.py

6. Run the application:

python app.py

Deployment

Using Docker Locally

Docker ensures that the chatbot runs in a consistent environment across different machines. The following two steps can be used with the Dockerfile to build the container and image for local testing!

Build the Docker image:

docker build -t janeaustenchatbot .

Run the container:

docker run -p 5000:5000 janeaustenchatbot

Deploying on Render

Create a new web service on Render:
- Select the GitHub repository.
- Choose a runtime (Python 3.11.0).
- Add the necessary environment variables (OPENAI_API_KEY, etc.).
Set up a start command:

python app.py

Deploy and access the app!

Acknowledgments

We thank Project Gutenberg for providing access to Jane Austen’s literary works. This project is powered by OpenAI’s GPT models, enabling nuanced and context-aware responses. The frontend design draws inspiration from Regency-era aesthetics, reflecting the historical charm of Austen’s time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overview

Features

Technical Stack

Project Structure

Setup and Installation

1. Clone the repository:

2. Create and activate a virtual environment:

3. Install dependencies:

4. Set up environment variables:

5. Process the text files and create a vector database (with Chroma):

6. Run the application:

Deployment

Using Docker Locally

Deploying on Render

Acknowledgments

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Overview

Features

Technical Stack

Project Structure

Setup and Installation

1. Clone the repository:

2. Create and activate a virtual environment:

3. Install dependencies:

4. Set up environment variables:

5. Process the text files and create a vector database (with Chroma):

6. Run the application:

Deployment

Using Docker Locally

Deploying on Render

Acknowledgments