๐ก๏ธ Privacy-First โข ๐ Lightning-Fast โข ๐ค AI-Powered โข ๐ฑ Cross-Platform
Complete offline transcription with speaker identification, AI summaries, and export capabilities
|
๐ก๏ธ 100% Privacy-First - All AI processing happens offline |
๐ Students - Record lectures, meetings, interviews |
# ๐ฅ Step 1: Clone the repo
git clone https://github.com/OP-88/Verba.devops.git
cd Verba.devops
# ๐ Step 2: Backend setup
cd backend
python -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install -r requirements.txt
# โก Step 3: Start backend server
uvicorn src.run_fastapi_audio_fixed:app --reload --host 0.0.0.0 --port 8000
# ๐จ Step 4: Frontend setup (new terminal)
cd frontend
export VITE_API_URL=http://localhost:8000 # Windows: set VITE_API_URL=http://localhost:8000
npm install && npm run dev๐ Visit http://localhost:8080 and start transcribing! ๐
๐ Try the Live Demo โข ๐ Read the Docs โข ๐ Report Issues
- โก Real-Time Transcription - Live microphone recording with instant text conversion
- ๐ File Upload Support - Process WAV, MP3, M4A, and more audio formats
- ๐ Smart VAD - Enhanced voice activity detection with Silero VAD
- ๐ฏ Noise Reduction - Advanced audio preprocessing for clarity
- ๐ Audio Visualization - Real-time waveform and level monitoring
- ๐๏ธ Speaker Diarization - Automatic "who said what" identification using pyannote.audio
- ๐ Speaker Statistics - Speaking time analysis and dominant speaker detection
- ๐ท๏ธ Smart Labeling - Automatic speaker assignment to transcript segments
- ๐ Segment Merging - Intelligent combining of short speech segments
- ๐ Auto-Summarization - T5-powered summaries with key points extraction
- ๐ฏ Action Items - Automatic detection of tasks and follow-ups
- ๐ Sentiment Analysis - Meeting tone and mood detection
- ๐ฌ Smart Chat - AI assistant for transcript queries (hybrid mode)
- ๐ Multiple Formats - Markdown, PDF, JSON, TXT, SRT with full metadata
- โ๏ธ Customizable Exports - Include/exclude metadata, speakers, summaries
- ๐ One-Click Copy - Instant clipboard access with formatting
- ๐พ Auto-Save - SQLite database with full history tracking
- Ctrl+R - Start/Stop recording
- Ctrl+P - Pause/Resume recording
- Ctrl+C - Copy transcription
- Ctrl+E - Edit transcription
- Ctrl+S - Save/Export
- Esc - Cancel current action
- ๐ Web App - Modern React interface with PWA support
- ๐ฅ๏ธ Desktop Apps - Native Tauri applications for Windows, macOS, Linux
- ๐ฑ Mobile Responsive - Touch-optimized interface for tablets and phones
- โ๏ธ Cloud Deploy - One-click Vercel deployment ready
graph TB
A[๐ค Audio Input] --> B[๐ WebRTC Stream]
B --> C[โก FastAPI Backend]
C --> D[๐ค Whisper AI]
D --> E[๐ Transcription]
E --> F[๐พ SQLite Storage]
F --> G[๐ฑ React Frontend]
G --> H[๐ค Beautiful UI]
๐ verba/
โโโ ๐ backend/ # FastAPI powerhouse
โ โโโ ๐ฏ main.py # Server magic starts here
โ โโโ ๐๏ธ models/ # Database schemas
โ โโโ ๐ฃ๏ธ routes/ # API endpoints
โ โโโ โ๏ธ services/ # Whisper AI integration
โ โโโ ๐ requirements.txt
โโโ ๐ frontend/ # React brilliance
โ โโโ ๐จ src/
โ โ โโโ ๐งฉ components/ # Reusable UI magic
โ โ โโโ ๐ pages/ # Main app screens
โ โ โโโ ๐ services/ # API communication
โ โ โโโ ๐ฏ types/ # TypeScript definitions
โ โโโ ๐ฆ package.json
โ โโโ โก vite.config.ts
โโโ ๐ docs/ # Everything you need to know
| ๐ Method | ๐ฏ Endpoint | ๐ก What It Does | โจ Magic |
|---|---|---|---|
GET |
/health |
๐ Server heartbeat | Always alive |
POST |
/transcribe |
๐ค Transform audio โ text | AI-powered |
GET |
/history |
๐ Your transcription story | Full history |
POST |
/history |
๐พ Save your gems | Instant storage |
DELETE |
/history/{id} |
๐๏ธ Clean up | One-click delete |
GET |
/export/{id} |
๐ค Download magic | Multiple formats |
| ๐ฏ Endpoint | ๐ซ Real-Time Magic |
|---|---|
/ws/transcribe |
โก Live transcription stream |
|
|
๐พ RAM: 4GB
๐ฟ Storage: 2GB free
โก CPU: Dual-core
๐ Browser: Chrome 80+ | Firefox 75+ | Safari 13+ |
๐ฅ RAM: 8GB+
๐ฟ Storage: 5GB free
โก CPU: Quad-core+
๐ฎ GPU: CUDA-compatible (optional boost!) |
gantt
title ๐ Verba Development Timeline
dateFormat YYYY-MM-DD
section ๐๏ธ Foundation
Backend API Core :active, 2024-09-15, 7d
Database Schema :active, 2024-09-16, 5d
Whisper Integration :2024-09-20, 4d
section ๐จ Frontend
React UI Base :2024-09-18, 6d
WebRTC Recording :2024-09-22, 5d
Real-time Display :2024-09-25, 4d
section โจ Polish
Export Features :2024-09-28, 3d
UI/UX Enhancement :2024-09-30, 5d
Testing & Deploy :2024-10-03, 4d
| Phase | Feature | Status | Timeline |
|---|---|---|---|
| ๐๏ธ | Core API | ๐ In Progress | Week 1 |
| ๐ค | Whisper AI | โณ Planned | Week 2 |
| ๐จ | React UI | ๐ In Progress | Week 2 |
| โก | Real-time | โณ Planned | Week 3 |
| ๐ | Export | โณ Planned | Week 4 |
# ๐ด Fork it
git clone https://github.com/YOUR-USERNAME/Verba.devops.git
# ๐ฑ Branch it
git checkout -b feature/amazing-transcription-magic
# โจ Code it
# ... your brilliant contributions ...
# ๐ Push it
git push origin feature/amazing-transcription-magic
# ๐ PR it - Open a Pull Request!|
๐จ Frontend Magic
|
โก Backend Power
|
๐ค AI Enhancement
|
Coming soon - your name could be here!
Be the first to contribute and earn your place in Verba history! ๐
| ๐ Issue | ๐ก Status | ๐ฏ Solution |
|---|---|---|
| Repository URL verification | ๐ Working | Testing clone process |
| Development environment | โก Priority | Automated setup script |
| Dependency management | ๐ Active | Version compatibility check |
|
|
This project is licensed under the MIT License - see the LICENSE file for details.
๐ Free to use, modify, and distribute! ๐
Transforming the way we interact with audio, one transcription at a time
๐ Ready to revolutionize transcription? Let's build something amazing together! ๐
Made with ๐ฅ passion and โก cutting-edge technology