🥭 Semango

🥭 Semango is a hybrid search engine that combines lexical (BM25) and semantic (vector) search. Index your codebase and docs and search with natural language queries.

Website & docs: https://semango.org

Features

Hybrid Search: Combines BM25 lexical search (Bleve) with vector similarity (FAISS)
Multi-format Ingestion: Markdown/text, code (plain text), PDFs, CSV/JSON/JSONL, Parquet, SQLite, Excel
Incremental Indexing: Only re-indexes files that have changed (by mtime + size), making subsequent runs near-instant
Embedding Providers: OpenAI-compatible API or local ONNX models
GPU Acceleration: Built-in support for CUDA-accelerated local embeddings
MCP Server: Model Context Protocol support for LLM tool-use (stdio and SSE transports)
Web UI: Embedded React-based search interface with dark mode
REST API: Lightweight HTTP API for programmatic access
Graceful Shutdown: Clean Ctrl+C handling across all commands
Single Binary: Self-contained executable with embedded UI assets

Installation

Download Binary

# macOS / Linux
curl -L "https://github.com/omarkamali/semango/releases/latest/download/semango_$(uname -s | tr '[:upper:]' '[:lower:]')_$(uname -m).tar.gz" | tar xz
sudo mv semango /usr/local/bin/

Docker

docker pull ghcr.io/omarkamali/semango:latest
docker run -p 8181:8181 -v $(pwd):/data ghcr.io/omarkamali/semango:latest

Build from Source

Requires Go 1.23+, Node.js 20+, and CGO dependencies (FAISS, OpenBLAS).

git clone https://github.com/omarkamali/semango.git
cd semango
make build

Quick Start

# 1. Initialize configuration
semango init

# 2. Index your content
semango index

# 3. Start the server (includes web UI + auto-reconciliation)
semango server

Open http://localhost:8181 for the web UI, or query the API:

curl -X POST http://localhost:8181/search \
  -H "Authorization: Bearer your-secret-token" \
  -H "Content-Type: application/json" \
  -d '{"query": "how does authentication work", "top_k": 5}'

CLI Commands

Command	Description
`semango init`	Create a default `semango.yml` config file
`semango index`	Index files (incremental — skips unchanged files)
`semango index stats`	Show indexing statistics (document counts)
`semango search <query>`	Search from the command line
`semango server`	Start the HTTP server with UI and periodic reconciliation
`semango mcp stdio`	Start an MCP server over stdin/stdout
`semango mcp sse`	Start an MCP server over HTTP/SSE
`semango models list`	List available local ONNX models
`semango version`	Print version information

Tip: Running semango index a second time without file changes completes instantly — only modified files are re-embedded and re-indexed.


## Configuration

Semango uses `semango.yml` for configuration. Key options:

```yaml
embedding:
  provider: openai          # or "local" for ONNX models
  model: text-embedding-3-large
  # Optional: base_url, api_key, api_key_env, base_url_env
  
lexical:
  enabled: true
  index_path: ./semango/index/bleve
  
hybrid:
  vector_weight: 0.7
  lexical_weight: 0.3
  fusion: linear            # or "rrf"
  
files:
  include:
    - '**/*.md'
    - '**/*.go'
    - '**/*.pdf'
  exclude:
    - .git/**
    - node_modules/**

server:
  port: 8181
  auth:
    type: token
    token_env: SEMANGO_TOKENS

See docs/SEMANGO_GUIDE.md for the complete configuration reference.

Environment Variables

Variable	Description	Default / Optional
`SEMANGO_TOKENS`	Comma-separated list of valid API tokens	Parsed but not enforced yet
`OPENAI_API_KEY`	API key to use when `provider: openai`	Used if `api_key` or `api_key_env` not set
`OPENAI_BASE_URL`	Base URL for OpenAI-compatible APIs	Defaults to OpenAI official endpoint
`SEMANGO_ENV_FILE`	Path to `.env` file to load	Defaults to `.env`
`SEMANGO_MODEL_DIR`	Cache directory for local ONNX models	Defaults to `~/.cache/semango`

You can also customize which environment variables Semango looks for by setting api_key_env or base_url_env in your semango.yml.

MCP (Model Context Protocol)

Semango can act as an MCP server, letting LLMs use your indexed content as a search tool:

# stdio transport (for Claude Desktop, etc.)
semango mcp stdio --config semango.yml

# SSE transport (HTTP-based, for remote clients)
semango mcp sse --host 0.0.0.0 --port 8080 --config semango.yml

Documentation

https://semango.org/guide/
Local Embeddings — Using local ONNX models
Tabular Data — Ingesting CSV/JSON/JSONL/Parquet/SQLite/Excel files
MCP Integration — Using Semango with LLMs

Online: https://semango.org

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

License

MIT License.

Author

Built by Omar Kamali (https://omarkamali.com) — Omneity Labs (https://omneitylabs.com)

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.devcontainer		.devcontainer
.github		.github
cmd/semango		cmd/semango
docs		docs
examples		examples
include/onnxruntime		include/onnxruntime
internal		internal
libs-static		libs-static
libs		libs
pkg		pkg
reproduction		reproduction
reproduction_text		reproduction_text
scripts		scripts
test-kb		test-kb
ui		ui
.gitignore		.gitignore
.golangci.yml		.golangci.yml
BUILD.md		BUILD.md
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.release		Dockerfile.release
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
THIRD_PARTY.md		THIRD_PARTY.md
VERSION		VERSION
docker-compose.yml		docker-compose.yml
go.mod		go.mod
go.sum		go.sum
install_faiss.md		install_faiss.md
install_faiss.sh		install_faiss.sh
install_onnxruntime.sh		install_onnxruntime.sh
roadmap.md		roadmap.md
semango.yml		semango.yml
spec.md		spec.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🥭 Semango

Features

Installation

Download Binary

Docker

Build from Source

Quick Start

CLI Commands

Environment Variables

MCP (Model Context Protocol)

Documentation

Contributing

License

Author

About

Uh oh!

Releases 4

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🥭 Semango

Features

Installation

Download Binary

Docker

Build from Source

Quick Start

CLI Commands

Environment Variables

MCP (Model Context Protocol)

Documentation

Contributing

License

Author

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 4

Contributors

Uh oh!

Languages