Skip to content

Radix-Obsidian/Voco-ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

62 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Voco V2

CI License Discord

The Voice-Native Coding Orchestrator β€” Sub-300ms Voice-to-Action

Voco is a local-first, zero-trust voice interface for developers. Speak naturally and Voco will search your codebase, generate code, run terminal commands, and explain results β€” all with human-in-the-loop approval.

Quick Start (5 Minutes)

# 1. Clone and install
git clone https://github.com/Radix-Obsidian/Voco-ai.git
cd Voco-ai/services/mcp-gateway
npm install

# 2. Set up environment
cp .env.example .env
# Add your API keys from https://voco.ai/dashboard

# 3. Start development
npm run dev

Features & Capabilities

πŸŽ™οΈ Voice-to-Context Engine

  • Sub-300ms voice transcription via Deepgram
  • Barge-in support with Silero VAD
  • Instant voice feedback via Cartesia TTS
  • Context-aware LangGraph state machine

πŸ”’ Zero-Trust MCP Gateway

  • Tauri v2 secure sandbox
  • Human-in-the-loop terminal approval
  • Filesystem scope validation
  • Row-level security via Supabase

πŸ’‘ Enterprise Features

  • "Seat + Meter" billing ($19/mo + $0.02/turn)
  • Team workspaces & shared context
  • Audit logs & usage analytics
  • SOC 2 compliance ready

Architecture

Voco V2 is a monorepo with two independent runtimes:

services/
β”œβ”€β”€ mcp-gateway/        # Local frontend & MCP execution sandbox
β”‚   β”œβ”€β”€ Tauri v2 (Rust) # Zero-trust security layer
β”‚   β”œβ”€β”€ React 18       # Modern UI with Shadcn components
β”‚   └── TypeScript     # Type-safe codebase
β”‚
└── cognitive-engine/   # Cloud reasoning & audio engine
    β”œβ”€β”€ LangGraph      # Stateful AI orchestration
    β”œβ”€β”€ Silero-VAD     # Voice activity detection
    β”œβ”€β”€ FastAPI        # WebSocket bridge
    └── Python 3.12+   # Async runtime

MCP Gateway (services/mcp-gateway/)

The local desktop application built with Tauri. Provides:

  • Zero-trust MCP execution sandbox
  • Human-in-the-loop terminal approval
  • Filesystem scope validation
  • WebSocket bridge to cognitive engine
  • Supabase auth & RLS integration
  • Modern React UI with Shadcn/UI

Stack: Tauri v2 β€’ React 18 β€’ TypeScript β€’ Vite β€’ Tailwind CSS 3 β€’ Shadcn/UI β€’ Supabase β€’ Framer Motion

Cognitive Engine (services/cognitive-engine/)

The cloud reasoning service that handles:

  • Voice transcription (Deepgram)
  • Text-to-speech (Cartesia)
  • LangGraph state machine
  • Tool execution & validation
  • Background job queue
  • Usage metering & billing

Stack: Python 3.12+ β€’ FastAPI β€’ LangGraph β€’ Silero-VAD β€’ Supabase β€’ Stripe

Getting Started

Prerequisites

Setup

# Clone the repository
git clone https://github.com/Radix-Obsidian/Voco-ai.git
cd Voco-ai

# Install MCP Gateway dependencies
cd services/mcp-gateway
npm install

# Configure environment
cp .env.example .env
# Edit .env with your Supabase credentials

# Start the dev server
npm run dev

The app will be available at http://localhost:8080.

Environment Variables

Variable Description
VITE_SUPABASE_URL Your Supabase project URL
VITE_SUPABASE_PUBLISHABLE_KEY Your Supabase anon/public key

Project Structure

.
β”œβ”€β”€ Docs/                          # Product & technical documentation
β”‚   β”œβ”€β”€ PRD.md                     # Product Requirements Document
β”‚   β”œβ”€β”€ TDD.md                     # Technical Design Document
β”‚   β”œβ”€β”€ SDD.md                     # System Design Document
β”‚   β”œβ”€β”€ Core-Features-List.md      # Feature inventory
β”‚   └── GTM.md                     # Go-to-Market strategy
β”œβ”€β”€ services/
β”‚   └── mcp-gateway/
β”‚       β”œβ”€β”€ src/
β”‚       β”‚   β”œβ”€β”€ assets/            # Branding assets (logo, icon)
β”‚       β”‚   β”œβ”€β”€ components/
β”‚       β”‚   β”‚   β”œβ”€β”€ ui/            # Shadcn UI primitives (48 components)
β”‚       β”‚   β”‚   β”œβ”€β”€ AuthModal.tsx   # Supabase email/password + Google OAuth
β”‚       β”‚   β”‚   β”œβ”€β”€ Header.tsx      # App header with logo
β”‚       β”‚   β”‚   └── ProtectedRoute.tsx
β”‚       β”‚   β”œβ”€β”€ hooks/             # React hooks (auth, settings, projects, toast)
β”‚       β”‚   β”œβ”€β”€ integrations/
β”‚       β”‚   β”‚   └── supabase/      # Supabase client & generated types
β”‚       β”‚   β”œβ”€β”€ lib/               # Utilities (cn helper)
β”‚       β”‚   └── pages/             # Landing, AppPage, NotFound
β”‚       β”œβ”€β”€ tailwind.config.ts
β”‚       β”œβ”€β”€ vite.config.ts
β”‚       └── components.json        # Shadcn configuration
β”œβ”€β”€ .windsurfrules                 # AI agent architectural guardrails
└── README.md

Documentation

Detailed documentation lives in the Docs/ directory:

  • PRD.md β€” Product requirements and feature parity with V1
  • TDD.md β€” LangGraph & streaming audio architecture
  • SDD.md β€” Zero-trust MCP & human-in-the-loop sandbox design
  • Core-Features-List.md β€” Complete feature inventory
  • GTM.md β€” Go-to-market strategy

Community

Contributing

We welcome contributions! Please read our Contributing Guide before submitting a pull request.

  1. Fork the repository
  2. Create a feature branch from main
  3. Make your changes and add tests
  4. Submit a pull request with a clear description

License

Licensed under the Apache License 2.0.

About

Voice to Code Magic

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Sponsor this project

Packages

 
 
 

Contributors