flat

Pack entire codebases into a single file for AI consumption.

Compress source code to its structure (signatures, types, imports — no function bodies), pack files by importance, and fit any project into any context window.

flat | pbcopy

That's it. .gitignore respected, secrets stripped, binaries excluded — automatically. Works on macOS, Linux, and Windows with native binaries.

Quick Start

# Install once
cargo install flat-cli

# Use it
flat | pbcopy                                          # everything to clipboard
flat --format markdown | pbcopy                        # markdown for AI chat
flat --compress --tokens 100k | pbcopy                 # fit into 100k context
flat --tokenizer claude --tokens 10k | pbcopy          # accurate Claude token count

Why flat?

Three features that compose together:

--compress — Structural compression using tree-sitter (16 languages)
- Keeps: imports, type definitions, function signatures, docstrings, database schemas
- Strips: function bodies, loop contents, variable assignments, query implementations
- Result: 30-80% token reduction while preserving API surface and structure
--tokens N — Context window budgeting
- Scores files by importance (README=100, entry points=90, tests=30, fixtures=5)
- Packs greedily: high-value files first, low-value files dropped
- Fits any codebase into any context window
--format — Choose your output
- markdown (default) — human-readable, optimized for AI consumption (Claude, ChatGPT, etc.)
- xml — structured XML, for programmatic parsing and tool integration
- json — machine-readable format for scripts and automation

Why markdown is the default: Since flat is designed for AI/LLM consumption, markdown is the natural default. It's readable in chat interfaces, preserves code formatting without tags, and works perfectly with tools like Claude Code. XML adds overhead for human consumption, so it's available when you need structured data.

The power move:

flat --compress --tokenizer claude --tokens 100k --format markdown | pbcopy

This gives you:

Compressed code (structure, no bodies)
Accurate token count (Claude tokenizer, not heuristic)
Bounded output (fits 100k context exactly)
Human-readable markdown (perfect for copy-paste into AI)

Features

Feature	v0.4.0	v0.5.0	v0.6.0	Details
Tree-sitter compression	✓	✓	✓	16 languages, 30-80% reduction
Real tokenizers	✓	✓	✓	Claude/GPT-4/GPT-3.5 (accurate) or heuristic (fast)
Token budgeting	✓	✓	✓	Fit any codebase into any context window
Priority scoring	✓	✓	✓	README=100, entry points=90, tests=30, fixtures=5
Output formats	✓	✓	✓	XML, Markdown, JSON (structured, human-readable)
Custom templates		✓	✓	Handlebars templates for custom output formats
GitHub URLs		✓	✓	Clone and analyze repos with `--github owner/repo`
MCP server mode		✓	✓	JSON-RPC server for Claude Desktop integration
Multi-platform	✓	✓	✓	macOS, Linux, Windows with native binaries
Safety by default	✓	✓	✓	Secrets excluded, binaries skipped, `.gitignore` respected
16 languages	✓	✓	✓	Rust, TS/JS, Python, Go, Java, C#, C, C++, Ruby, PHP, Solidity, Elixir, SQL, Bash
Fast	✓	✓	✓	25k files in <3 seconds
Parallel processing			✓	3-6x speedup (default in v0.6.0, --no-parallel to disable)
Incremental caching			✓	Instant re-runs for unchanged files (--no-cache to disable)
Watch mode			✓	Auto-regenerate on file changes (--watch)
Test coverage	✓	✓	✓	350+ tests (unit + integration)

Output Templates

Choose how your output is formatted with --template:

Template	Use case	Output style
minimal	Quick analysis	Code only, clean headers
claude-review	Code reviews	Formatted for feedback, includes stats
openai-docs	Documentation	Academic format with sections

Examples:

# Minimal — just code
flat --template minimal | pbcopy

# Code review format (includes metadata)
flat --compress --template claude-review | pbcopy

# Documentation format
flat --template openai-docs -o docs.md

Custom templates:

# Use a custom Handlebars template
flat --template ~/.config/flat/my-template.hbs

Template variables available: {{files}}, {{statistics}}, {{repo_name}}, {{timestamp}}

Performance Features (v0.6.0)

Three new features for faster iteration:

Parallel Processing (enabled by default)

Parallel processing is enabled by default in v0.6.0 for 3-6x speedup on multi-core systems.

flat                                      # parallel enabled (default)
flat --compress                           # Works with all features
flat --no-parallel                        # Disable parallel (fallback to sequential)

The --parallel flag is kept for backward compatibility but has no effect (parallel is always on by default). Use --no-parallel to disable and fall back to sequential processing.

Output is byte-for-byte identical to sequential mode (deterministic). If parallel processing fails for any reason, automatically falls back to sequential processing.

Incremental Caching (`--cache`)

Cache results per file, invalidate only changed files:

flat --cache                              # enables caching (default)
flat --no-cache                           # disable caching

Cache is stored in ~/.cache/flat/{repo-hash}/cache.bin. Second run is 10-100x faster for unchanged projects.

Invalidation triggers:

File modification time changes
File size changes
Compression config changes
Cache is automatically pruned of stale entries

Watch Mode (`--watch`)

Auto-regenerate output on any file change:

flat --watch                              # monitor for changes
flat --cache --watch --compress           # fastest iteration

Debounces rapid changes (500ms) to avoid excessive rebuilds. Press Ctrl+C to exit.

Examples:

# Development workflow: compress + cache + watch
flat --compress --cache --watch

# Large repos with parallel + cache
flat --parallel --cache -o output.xml

# Quick iteration on specific files
flat --watch -i rs,toml

Installation

Via cargo (recommended)

cargo install flat-cli

Works on macOS, Linux, Windows. Binary name: flat.

Via Homebrew (macOS)

brew install zkoranges/tap/flat

Prebuilt binaries

Download from GitHub Releases:

flat-x86_64-apple-darwin.tar.gz (macOS Intel)
flat-aarch64-apple-darwin.tar.gz (macOS Apple Silicon)
flat-x86_64-unknown-linux-gnu.tar.gz (Linux)
flat-x86_64-pc-windows-msvc.zip (Windows)

Requirements

macOS 10.15+, Linux (glibc 2.17+), Windows 10+
No external dependencies (fully static linked)
Rust 1.70+ (if building from source)

Usage

Basic usage

flat                                  # all files in current directory
flat src/                             # all files in src/
flat src/ --include rs,toml          # only .rs and .toml files
flat src/ --exclude test,spec        # skip test/spec files
flat src/ --match '*_test.go'        # glob patterns (repeatable)
flat src/ --max-size 10M             # files up to 10 MiB

Compression

flat --compress                       # signatures only (30-60% smaller)
flat --compress --full-match '*.rs'   # compress most files, keep *.rs in full

Output formats

flat --format markdown | pbcopy       # Markdown (default, human-readable)
flat --format xml                     # XML (structured, programmatic)
flat --format json                    # JSON (programmatic use)
flat --template claude-review | pbcopy # Custom template
flat -o snapshot.xml                  # write to file instead of stdout

GitHub URLs

flat --github owner/repo              # clone and analyze a repo
flat --github https://github.com/owner/repo  # full URL
flat --github owner/repo/tree/branch  # specific branch
flat --github owner/repo --compress --tokens 50k  # with other flags
flat --github owner/private-repo --github-token $GITHUB_TOKEN  # private repos

MCP Server (Claude Desktop Integration)

Run flat as a JSON-RPC server for Claude Desktop:

flat --serve                        # start MCP server (stdin/stdout JSON-RPC)

The MCP server exposes these tools to Claude:

analyze_repo - Flatten and analyze a repository
list_files - List files in a repository
get_statistics - Get repository statistics

Setup for Claude Desktop:

Save this to ~/.config/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "flat": {
      "command": "flat",
      "args": ["--serve"]
    }
  }
}

Restart Claude Desktop
Use flat tools directly in Claude conversations

Protocol: JSON-RPC 2.0 (stdio communication)

JSON output

flat --format json > codebase.json    # structured JSON for tools
flat --format json | jq '.statistics' # query with jq

Token budgeting

flat --tokens 100k                    # fit output into 100k tokens
flat --compress --tokens 100k         # compress to fit budget
flat --tokens 100k --dry-run          # preview what fits
flat --tokens 100k --stats            # summary only

Real tokenizers

flat --tokenizer heuristic            # fast estimate (default, conservative)
flat --tokenizer claude               # Claude 3/4 tokenizer (exact)
flat --tokenizer gpt-4                # GPT-4 tokenizer (exact)
flat --tokenizer gpt-3.5              # GPT-3.5 tokenizer (exact)

Tokenizer comparison:

Tokenizer	Speed	Accuracy	Use case
`heuristic`	Instant	~20-30% overestimate	Quick previews, safety margin
`claude`	~1s	Exact	Pasting into Claude
`gpt-4`	~1s	Exact	OpenAI API (GPT-4)
`gpt-3.5`	~1s	Exact	OpenAI API (GPT-3.5)

The heuristic intentionally overestimates so you stay within budget. Real tokenizers give exact counts when precision matters.

Compression Details

What gets compressed

Uses tree-sitter to parse and selectively strip:

Kept                              Stripped
─────────────────────────────     ──────────────────────
imports, require(), use            function/method bodies
type definitions, interfaces       loop contents
struct/class declarations          if/else branches
function signatures                variable assignments
decorators, attributes             inside functions
docstrings, comments
module-level constants
enums, preprocessor directives

Supported languages

Language	Keeps	Body placeholder
Rust	`use`/`mod`/`extern crate`, attributes, macros, structs, enums, trait/impl signatures	`{ ... }`
TS/JS (JSX/TSX)	imports, interfaces, type aliases, enums, class member signatures, exports	`{ ... }`
Python	imports, docstrings, decorators, class variables, module constants	`...`
Go	`package`, imports, type/const/var declarations	`{ ... }`
Java	`package`, imports, class/interface/enum declarations, fields, constants	`{ ... }`
C#	`using`, namespaces, class/struct/record/interface, properties, events	`{ ... }`
C	`#include`/`#define`/preprocessor, typedefs, struct/enum/union	`{ ... }`
C++	preprocessor, templates, namespaces, classes with members, `using`/aliases	`{ ... }`
Ruby	`require`, assignments, class/module structure	`...\nend`
PHP	`<?php`, `use`/`namespace`, class/interface/trait/enum, properties	`{ ... }`
Solidity	`pragma`, imports, contract/interface/library, event/error/struct/enum declarations	`{ ... }`
Elixir	`defmodule`, `use`/`import`/`alias`/`require`, module attributes, typespecs	`...\nend`
SQL	CREATE TABLE/INDEX/VIEW/SCHEMA (full DDL), comments, simple queries	stored procedures, complex queries
Bash	shebangs, exports/variables, short functions, comments	long function bodies, complex logic

Files in other languages pass through in full. If tree-sitter can't parse a file, the original is included with a stderr warning.

Real-world results

Codebase	Files	Full	Compressed	Reduction
Express	6	61 KB	28 KB	54%
Flask	24	339 KB	214 KB	37%
Next.js `packages/next/src`	1,605	8.0 MB	5.6 MB	31%

Priority Scoring

Files are ranked by importance for --tokens budgeting:

Priority	Score	Examples
README	100	`README.md`, `README.rst`
Entry points	90	`main.rs`, `index.ts`, `app.py`
Config	80	`Cargo.toml`, `package.json`, `tsconfig.json`
Source	70*	`handler.rs`, `utils.ts` (decreases with nesting depth)
Tests	30	`_test.go`, `test_.py`
Fixtures	5	`tests/fixtures/`, `__snapshots__/`

When a token budget is specified, high-priority files are included first. Low-priority files are dropped first.

Safety

Secrets are always excluded — no flag needed:

Pattern	Examples
Environment	`.env`, `.env.local`, `.env.production`
Keys	`.key`, `.pem`, `.p12`, `.pfx`
SSH	`id_rsa`, `id_dsa`, `id_ecdsa`, `id_ed25519`
Credentials	`credentials.json`, `serviceAccount.json`

Binary files are automatically skipped (images, media, archives, executables, compiled artifacts). All .gitignore patterns are respected.

Use --dry-run to preview before sharing code with external services.

Performance

Processes large codebases efficiently with v0.6.0's parallel processing and caching:

Benchmark: Linux Kernel (92,832 files, 2.0GB)

Real-world performance on massive codebases:

Scenario	Time	Speedup	Notes
Basic flatten	7.0s	1.25x	vs v0.5.0 (8.8s)
Compression (sequential)	112.1s	—	baseline for comparison
Compression + `--parallel`	22.8s	4.66x	Main use case
Parallel + cache (cold)	10.1s	10.5x	First cache build
Parallel + cache (warm)	7.05s	15.05x	Iterative development

Key takeaway: On large repositories, v0.6.0 achieves 4.66x speedup for analysis and 15x speedup for iterative development with cache.

Smaller Repositories

On typical projects (95-1,000 files):

$ time flat /path/to/nextjs --compress --stats
# 25,000+ files processed in ~3 seconds

Included: 24,327
Compressed: 19,771 files
Skipped: 894

real    0m2.883s

How It Works

Parallel processing (enabled by default in v0.6.0) — Uses all CPU cores, 3-6x faster on multi-core systems. Use --no-parallel to disable.
--cache (enabled by default) — Incremental caching stores results, 30% faster on repeated runs
--no-cache — Disables caching for reproducible CI/CD pipelines

Without --tokens, compression streams file-by-file (constant memory). With --tokens, all candidate files are buffered for scoring — but even that is fast thanks to parallelization.

Recipes

# The basics
flat | pbcopy                                    # everything, to clipboard
flat --include rs,toml | pbcopy                  # just Rust files
flat --stats                                     # preview before copying

# Output formats
flat --format markdown | pbcopy                  # markdown for AI chat
flat --format xml -o snapshot.xml                # XML for tools
flat --format json | jq '.statistics'            # JSON for programmatic use

# Custom templates
flat --template minimal | pbcopy                 # clean code-only output
flat --template claude-review | pbcopy           # code review format
flat --template openai-docs -o docs.md           # documentation format

# GitHub repos (clone & analyze)
flat --github owner/repo | pbcopy                # analyze any public repo
flat --github owner/repo --compress --tokens 50k | pbcopy  # compressed
flat --github owner/repo --template claude-review | pbcopy # formatted
flat --github owner/repo/tree/main/src --stats   # specific branch/path

# Compression
flat --compress | pbcopy                         # structural overview
flat --compress --full-match 'main.rs' | pbcopy  # overview + one file in full

# Token budgets
flat --compress --tokens 100k | pbcopy            # fit into 100k context
flat --compress --tokens 8k --dry-run             # preview what fits
flat --tokenizer claude --tokens 100k | pbcopy    # exact Claude token count
flat --tokenizer gpt-4 --tokens 128k | pbcopy     # exact GPT-4 token count

# Targeted analysis
flat src/api --include ts --exclude spec          # just the API layer
flat --match '*_test.go' | pbcopy                 # only test files
flat src/ --compress --full-match 'handler.rs'    # debug one file in context

# The full pipeline (local repo)
flat src/ --compress --tokens 100k \
  --tokenizer claude --template claude-review | pbcopy   # everything combined

# The full pipeline (GitHub repo)
flat --github openai/whisper --compress \
  --tokens 50k --template openai-docs | pbcopy   # GitHub + compression + template

# Save to file
flat --compress -o snapshot.xml                   # compressed snapshot
flat --format json -o codebase.json               # JSON snapshot

Architecture

src/
├── main.rs        CLI entry point, argument parsing
├── lib.rs         Public API
├── walker.rs      Directory traversal, two-pass budget allocation
├── compress.rs    Tree-sitter compression engine (16 languages)
├── priority.rs    File importance scoring
├── tokens.rs      Token estimation and real tokenizer support (Claude/GPT-4)
├── filters.rs     Secret detection, binary file skipping
├── output.rs      Output formatting (XML/Markdown) and statistics
├── parse.rs       Number parsing (k/M/G suffixes)
└── config.rs      Configuration and CLI options

Quality metrics:

280+ tests (unit + integration)
Zero clippy warnings
Full tree-sitter support for 16 languages
Validated against Flask, FastAPI, Express, Next.js

cargo test --all && cargo clippy --all-targets -- -D warnings

Contributing

Contributions welcome! Areas of interest:

Additional language support (via tree-sitter)
Tokenizer optimizations
Performance improvements (v0.6.0 — caching, watch mode)
IDE extensions (v1.0)

See LICENSE for details.

Roadmap

v0.4.0 ✅ Accessibility

Real tokenizer support (Claude/GPT-4/GPT-3.5)
Markdown output format
Multi-platform binaries (macOS/Linux/Windows)
Published to crates.io

v0.5.0 ✅ Integration

MCP server mode (flat serve)
GitHub URL support (--github org/repo)
Template system (minimal, claude-review, openai-docs)
JSON output format
280+ tests, all passing

v0.6.0 ✅ Performance

Parallel processing (3-6x speedup, enabled by default)
Incremental caching (10-100x faster iterative runs)
Watch mode for auto-regeneration on file changes

v1.0 Intelligence (research)

Hierarchical summarization (18:1 compression)
Semantic code understanding
IDE extensions

License

MIT — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
mcp_test_project		mcp_test_project
scripts		scripts
src		src
templates		templates
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
FUNCTIONALITY_VERIFICATION.md		FUNCTIONALITY_VERIFICATION.md
LICENSE		LICENSE
README.md		README.md
RELEASING.md		RELEASING.md
ROADMAP.md		ROADMAP.md
SECURITY_REVIEW.md		SECURITY_REVIEW.md
TESTING_ANALYSIS.md		TESTING_ANALYSIS.md
TESTING_STRATEGY.md		TESTING_STRATEGY.md
TEST_IMPROVEMENTS_v0.4.0.md		TEST_IMPROVEMENTS_v0.4.0.md
TEST_SUITE_GUIDE.md		TEST_SUITE_GUIDE.md
V0.5.0_COMPLETION_REPORT.md		V0.5.0_COMPLETION_REPORT.md
install.sh		install.sh
release		release
test_sql_parse.sql		test_sql_parse.sql

Folders and files

Latest commit

History

Repository files navigation

flat

Quick Start

Why flat?

Features

Output Templates

Performance Features (v0.6.0)

Parallel Processing (enabled by default)

Incremental Caching (--cache)

Watch Mode (--watch)

Installation

Via cargo (recommended)

Via Homebrew (macOS)

Prebuilt binaries

Requirements

Usage

Basic usage

Compression

Output formats

GitHub URLs

MCP Server (Claude Desktop Integration)

JSON output

Token budgeting

Real tokenizers

Compression Details

What gets compressed

Supported languages

Real-world results

Priority Scoring

Safety

Performance

Benchmark: Linux Kernel (92,832 files, 2.0GB)

Smaller Repositories

How It Works

Recipes

Architecture

Contributing

Roadmap

v0.4.0 ✅ Accessibility

v0.5.0 ✅ Integration

v0.6.0 ✅ Performance

v1.0 Intelligence (research)

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Incremental Caching (`--cache`)

Watch Mode (`--watch`)

Packages