🤖 Polymind Bot

A powerful, multi-modal Telegram bot leveraging cutting-edge AI technologies including Gemini, DeepSeek, OpenRouter, and 50+ AI models for comprehensive conversational assistance, media processing, and collaborative features with MCP (Model Context Protocol) integration.

📑 Table of Contents

🤖 Telegram Gemini Bot

✨ Key Features

🧠 AI & Language Models

54+ AI Models: Hierarchical model selection across Gemini, DeepSeek, OpenRouter (Llama, Claude, GPT, Qwen, Mistral, etc.)
Tool-Calling Models: Specialized models with function calling capabilities for enhanced interactions
Intelligent Model Switching: Context-aware automatic model selection based on task type
Multi-Modal AI: Combined text, image, document, and voice processing in single requests
Conversation Memory: Persistent context across sessions with model-specific history
Smart Fallback System: Automatic failover between AI providers for reliability

🔧 MCP (Model Context Protocol) Integration

External Tool Integration: Connect to various MCP servers for enhanced capabilities
Context7 Documentation: Access to up-to-date documentation and code examples
Exa Search: Web search capabilities for real-time information
Sequential Thinking: Advanced reasoning and problem-solving tools
Fetch MCP: Web content fetching and analysis
Dynamic Tool Discovery: Automatic discovery and loading of available tools

🎨 Visual & Media Processing

Mermaid Diagram Rendering: Automatic detection and conversion of text-based diagrams to images
Image Generation: Advanced image creation via Together AI and Imagen3 with custom prompts
Video Generation: Text-to-video capabilities for creative content
Image Analysis: Intelligent visual content analysis and description
Document Processing: PDF, DOCX analysis with semantic search and content extraction

🎙️ Voice & Speech

Advanced Voice Recognition: engines ( Faster-Whisper)
Voice Activity Detection: Automatic silence filtering and speech enhancement
Confidence Scoring: Reliability metrics for transcription accuracy

👥 Group Chat & Collaboration

Group Intelligence: Shared memory and context across group conversations
Collaborative Workspaces: Team knowledge management and note sharing
Discussion Threading: Structured conversations with topic tracking
Group Analytics: Usage statistics and conversation insights
Role-Based Access: Customizable permissions for different group members
Real-Time Collaboration: Live typing indicators and activity streams

🔧 Technical Excellence

Production-Ready: Optimized for high-traffic deployments with webhook support
Rate Limiting: Intelligent request management and flood protection
Advanced Formatting: Rich markdown with tables, spoilers, LaTeX, and code highlighting
Smart Message Chunking: Automatic splitting of long responses within Telegram limits
Error Recovery: Comprehensive error handling with graceful degradation
Performance Monitoring: Built-in logging, analytics, and debugging tools

🔧 Prerequisites

Python 3.11+ with asyncio support
Node.js 20.x+ for Mermaid diagram rendering
MongoDB instance (local or MongoDB Atlas)
Required APIs:
- Telegram Bot Token (via @BotFather)
- Google Gemini API key
- OpenRouter API key (optional, for 50+ additional models)
- DeepSeek API key (optional, for DeepSeek models)
- Together AI API key (for image/video generation)
- MCP API keys (for external tool integration)
System Dependencies:
- FFmpeg (for audio/video processing)
- @mermaid-js/mermaid-cli (auto-installed in Docker)

🚀 Installation

🛠️ Development Setup

# Clone the repository
git clone https://github.com/Remy2404/Polymind.git
cd Polymind

# Install Python dependencies using uv (recommended)
uv sync

# Alternative: Install with pip
# pip install -r requirements.txt

# Install Node.js dependencies for Mermaid rendering
npm install -g @mermaid-js/mermaid-cli

# Verify installation
mmdc --version  # Should show Mermaid CLI version

⚡ Quick Start

# Start development server with hot reload
uv run python app.py

# Or start with uvicorn directly
uv run uvicorn app:app --host 0.0.0.0 --port 8000 --reload

# Start with production optimizations
uv run python app.py

⚙️ Configuration

Create a comprehensive .env file in the project root:

# 🤖 Core Bot Configuration
TELEGRAM_BOT_TOKEN=your_telegram_bot_token
MONGODB_URI=mongodb://localhost:27017  # or MongoDB Atlas URI

# 🧠 AI Model APIs
# Links to get your API keys:
# - Gemini: https://aistudio.google.com/
# - Together AI: https://www.together.ai/
# - OpenRouter: https://openrouter.ai/

GEMINI_API_KEY=your_gemini_api_key
OPENROUTER_API_KEY=your_openrouter_api_key
TOGETHER_API_KEY=your_together_api_key

# 🔧 MCP Integration
# - Smithery: https://smithery.ai/
MCP_API_KEY=your_mcp_api_key

# 🌐 Web Configuration
WEBHOOK_URL=https://your-domain.com
PORT=8000

Important

for WEBHOOK_URL use ngrok for local testing:

# https://ngrok.com/
ngrok http 8000

💡 Usage

🚀 Starting the Bot

# Start the bot
uv run python app.py

# Start with hot reload for development
uv run uvicorn app:app --host 0.0.0.0 --port 8000 --reload

# Start with production optimizations
uv run python app.py

Production Deployment

# Using Gunicorn with multiple workers
gunicorn app:app -w 4 -k uvicorn.workers.UvicornWorker --bind 0.0.0.0:8000

# Using Docker (recommended for production)
docker-compose up -d

# Using uv for production
uv run python app.py

🌟 Key Features in Action

🎨 Mermaid Diagram Generation

Simply ask the bot to create diagrams:

👤 "Create a flowchart showing the user registration process"
🤖 [Automatically renders a beautiful diagram as an image]

Supports all Mermaid diagram types: flowcharts, sequence, class, ER, Gantt, etc.
Intelligent syntax cleaning and error handling
Fallback to code display if rendering fails

🧠 Multi-Model AI Conversations

👤 /switchmodel
🤖 Shows hierarchical model selection:
    📂 🧠 Gemini Models (3)
    📂 🔮 DeepSeek Models (5)  
    📂 🦙 Meta Llama Models (8)
    📂 🌟 Qwen Models (6)
    📂 ...and 40+ more models

🏢 Group Collaboration

Add the bot to any group chat:

/groupsettings - Configure collaboration features
/groupcontext - View shared group memory
/groupthreads - Manage discussion topics
/groupstats - Group usage analytics

📄 Document Processing

Upload any PDF or DOCX file:

Intelligent content extraction and analysis
Semantic search within documents
AI-powered summarization and Q&A
Export conversations to formatted documents

🎯 Specialized Use Cases

For Developers

👤 "Explain this Python code and suggest improvements"
🤖 [Provides detailed code analysis with suggestions]

👤 "Create a class diagram for a user authentication system"  
🤖 [Generates professional UML diagram]

For Content Creators

👤 /genimg "A futuristic city at sunset with flying cars"
🤖 [Creates high-quality AI-generated image]

#### For Teams and Groups
```bash
👤 "Summarize our last discussion about the project timeline"
🤖 [Provides intelligent summary of group conversations]

👤 /groupthreads
🤖 [Shows organized discussion topics and threads]

For Document Export & Creation

👤 /exportdoc
🤖 Choose what to export:
    📜 Export Conversation
    ✏️ Provide Custom Text
    
👤 [Send custom text like "# My Report\n\nThis is my **important** document"]
🤖 [Converts to professional PDF/DOCX with proper formatting]

👤 /gendoc
🤖 [AI generates complete documents based on your requirements]

📋 Commands

Command	Description	Usage Example
`/start`	Initialize the bot and get welcome	`/start`
`/help`	List all available commands	`/help`
`/genimg`	Generate an image from text prompt	`/genimg sunset over mountains`
`/reset`	Clear conversation history	`/reset`
`/switchmodel`	Hierarchical AI model selection	`/switchmodel`
`/listmodels`	List all available AI models	`/listmodels`
`/currentmodel`	Show current AI model	`/currentmodel`
`/exportdoc`	Export chat to PDF/DOCX	`/exportdoc`
`/gendoc`	Generate AI-powered documents	`/gendoc`

👥 Group Chat Commands

Command	Description	Usage Example
`/groupstats`	Show group usage statistics	`/groupstats`
`/groupsettings`	Configure group settings	`/groupsettings`
`/groupcontext`	View shared group memory	`/groupcontext`
`/groupthreads`	Manage discussion topics	`/groupthreads`
`/cleanthreads`	Clean up inactive conversation threads	`/cleanthreads`

🔧 MCP (Model Context Protocol) Commands

Command	Description	Usage Example
`/mcpstatus`	Show MCP integration status	`/mcpstatus`
`/mcptoggle`	Enable/disable MCP for your account	`/mcptoggle`
`/mcptools`	List available MCP tools	`/mcptools`
`/mcphelp`	Show MCP help and usage guide	`/mcphelp`

🌟 Special Features

🎨 Automatic Mermaid Rendering: Just ask for diagrams and they'll be rendered as images
🎙️ Voice Messages: Send voice notes for transcription and response
📁 File Upload: Drag and drop up to 5 files (PDFs, images, videos, documents) for AI analysis. All files are analyzed together in the same chat context for comprehensive results
💬 Group Chat: Add bot to groups with @mention support
🔄 Model Memory: Each AI model maintains separate conversation history
📄 Rich Export: Export conversations with formatting, images, and metadata
🛠️ Tool-Calling Models: Access to AI models with function calling capabilities
🔧 MCP Integration: Connect to external tools and services for enhanced functionality

Docker Deployment

Build and run:

docker build -t telegram-gemini-bot .
docker run -d -p 8000:8000 --env-file .env telegram-gemini-bot

With Docker Compose:

docker-compose up -d

Contributing

Contributions are welcome. Fork the repo, create a feature branch, commit your changes, and open a pull request.

License

This project is licensed under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 447 Commits
.github		.github
.specify		.specify
assets		assets
migration		migration
scripts		scripts
src		src
tests		tests
.dockerignore		.dockerignore
.env.template		.env.template
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
SECURITY.md		SECURITY.md
app.py		app.py
docker-compose.yml		docker-compose.yml
frontend_backend_integration_flow.mmd		frontend_backend_integration_flow.mmd
mcp.json		mcp.json
mcp.json.template		mcp.json.template
model_config_diagram.mmd		model_config_diagram.mmd
models.json		models.json
project_structure_diagram.mmd		project_structure_diagram.mmd
puppeteer-config.json		puppeteer-config.json
pyproject.toml		pyproject.toml
remove_commnet.py		remove_commnet.py
run.ps1		run.ps1
runtime.txt		runtime.txt
temp_models.json		temp_models.json
uv.lock		uv.lock

License

Remy2404/Polymind

Folders and files

Latest commit

History

Repository files navigation

🤖 Polymind Bot

📑 Table of Contents

✨ Key Features

🧠 AI & Language Models

🔧 MCP (Model Context Protocol) Integration

🎨 Visual & Media Processing

🎙️ Voice & Speech

👥 Group Chat & Collaboration

🔧 Technical Excellence

🔧 Prerequisites

🚀 Installation

🛠️ Development Setup

⚡ Quick Start

⚙️ Configuration

💡 Usage

🚀 Starting the Bot

Production Deployment

🌟 Key Features in Action

🎨 Mermaid Diagram Generation

🧠 Multi-Model AI Conversations

🏢 Group Collaboration

📄 Document Processing

🎯 Specialized Use Cases

For Developers

For Content Creators

For Document Export & Creation

📋 Commands

👥 Group Chat Commands

🔧 MCP (Model Context Protocol) Commands

🌟 Special Features

Docker Deployment

Contributing

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages