Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across vLLM, SGLang, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history, tokenization caching, Responses API, embeddings, WASM plugins, MCP, and multi-tenant auth.
-
Updated
Mar 16, 2026 - Rust