A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.
-
Updated
Oct 30, 2025 - Python
A modular framework for building and deploying Retrieval-Augmented Generation (RAG) systems with built-in evaluation and monitoring.
🤖 Production-ready samples for building multi-modal AI agents that understand images, documents, videos, and text using Amazon Bedrock and Strands Agents. Features Claude integration, MCP tools, streaming responses, and enterprise-grade architecture.
Understand and build embedding models, focusing on word and sentence embeddings, dual encoder architectures. Learn to train embedding models using contrastive loss, implement them in semantic search and RAG systems.
This project implements a Retrieval-Augmented Generation (RAG) based chatbot designed to handle university-related queries using natural language understanding. It combines semantic search with generative AI to provide precise, context-aware answers to students, faculty, and visitors.
Turn any LLM into a self-extending knowledge agent powered by a graph-structured memory - complete with PDF-to-graph ingestion, budget-aware optimisation, and dual-engine orchestration.
RAG Gateway Service 🚪🤖: FastAPI gateway that auto-detects query topics using OpenAI embeddings 🧠🔍 and routes requests to topic-specific RAG agents 🎯, with fallback support and Docker-ready 🚀🐳.
The course provides a comprehensive guide to optimizing retrieval systems in large-scale RAG applications. It covers tokenization, vector quantization, and search optimization techniques to enhance search quality, reduce memory usage, and balance performance in vector search systems.
Experimenting with different kinds of RAGs Systems
This project processes and retrieves information from PDF file or PDF collection. It leverages Qdrant as a vector database for similarity searches and employs a Retrieval-Augmented Generation (RAG).
This repository covers extensive tutorials on how to integrate LangSmith with LangChain with LangGraph to incorporate observability, monitoring, alerting, evaluation, etc. within complex LLM workflows and applications.
Advanced Retrieval-Augmented Generation system supporting multimodal document processing (text, tables, images) with multiple reasoning strategies and comprehensive evaluation framework.
Production-grade RAG system for Singapore government documents with OpenAI integration
Implements a Retrieval-Augmented Generation (RAG) system.
Training Data Generator for SPLADE Model Fine-tuning
AI_Security_Engineers_Roadmap
⚡ Generate dynamic CRUD and Auth routes effortlessly with FastAPI Auto Routes for SQLModel—no repetitive boilerplate needed.
Add a description, image, and links to the rag-systems topic page so that developers can more easily learn about it.
To associate your repository with the rag-systems topic, visit your repo's landing page and select "manage topics."