🧪 AI Lab

An experimental platform for technical users to explore various AI concepts and tools, featuring a comprehensive suite of labs including Model Playground, Web Research, Data Processing, Knowledge Base management, Embeddings generation, and Retrieval-Augmented Generation (RAG) pipelines.

🌟 Features

Model Playground: Interactive environment for:
- Text generation and classification
- Named entity recognition
- Document summarization
- Multi-language translation
Web Research Lab: Automated research assistant with:
- Web content crawling and analysis
- Multi-source information synthesis
- Citation generation with credibility scoring
- Source evaluation and bias detection
Data Processing Lab: Advanced text processing pipeline for:
- Document cleaning and normalization
- Semantic document chunking
- Format conversion with metadata preservation
- Automated attribute extraction
Knowledge Base Lab: Document management system featuring:
- Version-controlled storage
- Automated content categorization
- Semantic search capabilities
- Custom taxonomies and cross-referencing
Embeddings Lab: Vector representation workspace offering:
- Multiple embedding model options
- Interactive visualization tools
- Vector database management
- Index optimization and monitoring
RAG Pipeline: End-to-end system combining:
- Semantic search and context-aware retrieval
- Optimized embedding generation
- LLM integration for response generation
- Configurable pre/post-processing

🚀 Getting Started

Minimum Prerequisites

Python 3.10 or 3.11
OpenAI API key
Virtual environment (recommended)

Installation

Clone the repository:

git clone https://github.com/DavoCoder/ai-lab.git
cd ai-lab

Create and activate a virtual environment:

python -m venv venv
source venv/bin/activate  # On Unix/macOS
# or
venv\Scripts\activate     # On Windows

Install dependencies:

pip install -r requirements.txt

Set up environment variables:

Create a .env file in the root folder
Use the .env_example file to get the environment variables needed
Fill out the environment variables with your local paths

Environment Variable	Description	Supported Files
CHROMA_PERSIST_DIR_PATH	Local directory path to create the ChromaDB	-
KNOWLEDGE_ARTICLES_DIR_PATH	Local directory path to get Documents for creating embeddings for the ChromaDB	.txt
METADATA_FILE_PATH	Local file path to store hash maps for identifying changes in the knowledge base article dir and files	-

Running the Application

Streamlit Frontend

streamlit run app.py

FastAPI Backend

uvicorn rest_api.rag_processor_api:app --reload

Access the API documentation at http://localhost:8000/docs

🏗️ Project Structure

ai-lab/
├── config/                  # Configuration
├── data_processing/         # Data processing
├── embeddings/              # Embedding models
├── file_handler/            # File handling
├── knowledge_base/         # Document processing
├── nlp_processing/         # NLP processing models
├── query_pre_processing/   # Query enhancement
├── rag/                    # RAG processing
├── response_post_processing/   # Post-processing of the response
├── rest_api/               # REST API (FastAPI)
├── retrival_optimization/   # Optimization of the retrival
├── toxicity_detection/     # Toxicity detection
├── ui/                     # UI
├── vector_databases/       # Vector storage
├── web_research/           # Web research
├── app.py                 # Streamlit frontend
├── embeddings_generation.py # Embeddings generation
└── requirements.txt      # Project dependencies

🤝 Contributing

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Open a Pull Request

📝 License

Apache License, Version 2.0

🙏 Acknowledgments

OpenAI for LLM support
LangChain for the framework

📞 Contact

DavoCoder

Built by DavoCoder

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧪 AI Lab

🌟 Features

🚀 Getting Started

Minimum Prerequisites

Installation

Running the Application

Streamlit Frontend

FastAPI Backend

🏗️ Project Structure

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Contact

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
config		config
data_processing		data_processing
embeddings		embeddings
file_handler		file_handler
knowledge_base		knowledge_base
llm		llm
nlp_processing		nlp_processing
query_pre_processing		query_pre_processing
rag		rag
response_post_processing		response_post_processing
rest_api		rest_api
retrieval_optimization		retrieval_optimization
toxicity_detection		toxicity_detection
ui		ui
vector_databases		vector_databases
web_research		web_research
.dockerignore		.dockerignore
.env_example.txt		.env_example.txt
.gitignore		.gitignore
.pylintrc		.pylintrc
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
auth_schema.sql		auth_schema.sql
config.py		config.py
requirements.txt		requirements.txt
runtime.txt		runtime.txt

License

DavoCoder/ai-lab

Folders and files

Latest commit

History

Repository files navigation

🧪 AI Lab

🌟 Features

🚀 Getting Started

Minimum Prerequisites

Installation

Running the Application

Streamlit Frontend

FastAPI Backend

🏗️ Project Structure

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Contact

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages