Raven.Bench — RavenDB Knee-Finding Benchmark

Raven.Bench is a focused load generator for RavenDB that ramps concurrency in a closed loop to automatically detect the “knee” — the point where throughput gains flatten while latency and/or errors start climbing. It’s designed to answer: “How far can we push this setup reliably?” and to plug neatly into stress/perf benches and CI pipelines.

This README is written for engineers who don’t need all the internals — just a clear way to build, run, and integrate results.

Highlights

Closed-loop ramp: C = start .. end x factor until knee.
Two transports: raw HTTP and client (official RavenDB .NET client).
Compression modes: identity, gzip, zstd (via client), brotli/deflate (raw).
Workload mixes: --reads/--writes/--updates by weights or percents.
Key distributions: uniform, zipfian, latest.
Metrics per step: throughput, p50/p90/p95/p99 (raw and RTT-normalized), error rate, bytes in/out, client CPU, network utilization, and server metrics (when available).
Knee rule (default): Δthroughput < 5% AND Δp95 > 20% (or errors > 0.5%).
Structured outputs: JSON summary and per-step CSV.

Note: v0 implements closed-loop only and very limited read scenarios (it was designed to study full writes bottlenecks).

Why It Works

Closed-loop control keeps request rate bounded by actual completions, preventing client-side overload that hides the real limit.
The ramp grows concurrency geometrically (e.g., x2) to quickly traverse the performance curve.
A startup calibration measures lightweight endpoints to estimate baseline RTT, enabling RTT-normalized latencies that separate network/stack overhead from load-induced queuing.
Knee detection flags the last “safe” step before saturating a bottleneck (network/client CPU/server queues/etc.).

What It’s Not

Not a functional test tool; it’s a controlled throughput/latency probe.
Not an OLTP simulator; payloads are synthetic and simple by design.

Getting Started

Requirements
- .NET 8 SDK
- A reachable RavenDB server and database (dev or prod-like). Examples below use http://localhost:8080 and database ycsb.
- dotnet build RavenBench.sln -c Release

Usage

Basic shape
- Raven.Bench closed --url <server> --database <db> [options]
Common options
- --warmup <duration> and --duration <duration>: per-step timing, e.g., 20s, 60s.
- --distribution <uniform|zipfian|latest>: key selection strategy (default: uniform).
- --doc-size <bytes|KB|MB>: payload size for writes/updates (default: 1KB).
- --profile <mixed|writes|reads|query-by-id|bulk-writes|stackoverflow-reads|stackoverflow-queries|query-users-by-name>: required. Selects which workload to run.
- --query-profile <equality|range|text-prefix|text-search|text-search-rare|text-search-common|text-search-mixed>: query type for query workloads (default: equality). Only valid with query profiles.
- --reads/--writes/--updates <weight|percent>: only valid with --profile mixed; values normalize to 100%.
- --dataset <stackoverflow>: Auto-downloads and imports dataset before benchmark (optional, required for StackOverflow profiles).
- --dataset-profile <small|half|full>: Predefined dataset sizes with automatic database naming:
  - small: ~5GB data → database StackOverflow-5GB
  - half: ~20GB data → database StackOverflow-20GB
  - full: ~50GB data → database StackOverflow-50GB
- --dataset-size <N>: Custom dataset size: N post dump files (~1GB each + 2GB users). Auto-generates database name like StackOverflow-12GB for N=10. Overridden by --dataset-profile.
- --dataset-skip-if-exists: Skip dataset import if data already exists (default: true).
- --dataset-cache-dir <path>: Directory for caching downloaded dataset files.
- --http-version <auto|1.1|2.0|3.0> and --strict-http-version: version negotiation and enforcement.
- --transport <raw|client> and --compression <identity|gzip|zstd|br|deflate>:
- raw uses HTTP directly; identity/gzip/br/deflate supported.
- client uses RavenDB .NET client; identity/gzip/zstd supported (zstd recommended for realistic runs).
- --concurrency <start..endxfactor>: geometric ramp (default: 8..512x2).
  - Examples: --concurrency 8..32x2 (runs at C=8, 16, 32), --concurrency 16..128x2 (runs at C=16, 32, 64, 128)
- --preload <N>: pre-insert documents to grow keyspace before the ramp.
- --out <file.json> and --out-csv <file.csv>: write structured results.
  - --verbose: aggregate and print a summary of top error messages.
Expert options
- --max-errors <percent>: stop early if error rate exceeds this per step (default: 0.5%).
- --knee-rule dthr=<percent>,dp95=<percent>: threshold deltas for knee detection (default: 5%,20%).
- --latencies <normalized|raw|both>: which latencies to print to console.
- SNMP telemetry (opt-in server monitoring):
  - --snmp-enabled: enable SNMP collection (default: false)
  - --snmp-profile <minimal|extended>: metric profile (default: minimal, 4 metrics; extended adds IO/load/request counters, 14 metrics)
  - --snmp-port <int>: SNMP agent port (default: 161)
  - --snmp-interval <duration>: poll interval (default: 250ms)
  - --snmp-timeout <duration>: query timeout (default: 5s)
  - See docs/snmp-metric-catalog.md for metric details and troubleshooting
- --network-limited and --link-mbps <double>: annotate verdicts for known link speeds.
- --raw-endpoint <path-with-{id}>: with --transport raw, test a custom endpoint (e.g., /databases/db/docs?id={id}).
- --tp-workers/--tp-iocp <int>: adjust ThreadPool minimums (defaults are high to avoid client-side starvation).
- --expected-cores <int>: Expected CPU core count for validation.
- --notes <string>: Custom notes to include in output.

Quick Starts

Expose the hose (identity raw HTTP)
- Raven.Bench closed --url http://localhost:8080 --database ycsb --profile mixed --reads 75 --writes 25 --distribution uniform --transport raw --compression identity --concurrency 8..512x2 --duration 60s --out results.json --out-csv steps.csv
Realistic client with compression
- Raven.Bench closed --url http://localhost:8080 --database ycsb --profile mixed --transport client --compression zstd --reads 75 --writes 25 --concurrency 8..1024x2 --duration 60s --latencies both
Zipfian reads and small docs
- Raven.Bench closed --url http://localhost:8080 --database ycsb --profile mixed --reads 90 --writes 10 --distribution zipfian --doc-size 512B --duration 45s
Write-only profile
- Raven.Bench closed --url http://localhost:8080 --database ycsb --profile writes --concurrency 16..256x2 --duration 60s
Read-only with preload
- Raven.Bench closed --url http://localhost:8080 --database ycsb --profile reads --preload 100000 --distribution zipfian --concurrency 8..512x2
Query-by-id profile
- Raven.Bench closed --url http://localhost:8080 --database ycsb --profile query-by-id --preload 100000 --distribution uniform --concurrency 8..256x2
Bulk writes (100 docs per batch)
- Raven.Bench closed --url http://localhost:8080 --database ycsb --profile bulk-writes --bulk-batch-size 100 --concurrency 4..64x2
StackOverflow random reads with small dataset (auto-imports to StackOverflow-5GB)
- Raven.Bench closed --url http://localhost:8080 --profile stackoverflow-reads --dataset stackoverflow --dataset-profile small --concurrency 8..256x2 --duration 60s
Rate-based benchmark (constant RPS steps)
- Raven.Bench rate --url http://localhost:8080 --database ycsb --profile mixed --reads 75 --writes 25 --distribution uniform --transport raw --compression identity --step 200..20000x1.5 --duration 60s --out results.json --out-csv steps.csv
StackOverflow queries with half dataset (auto-imports to StackOverflow-20GB)
- Raven.Bench closed --url http://localhost:8080 --profile stackoverflow-queries --dataset stackoverflow --dataset-profile half --concurrency 8..128x2
StackOverflow with custom size (auto-imports to StackOverflow-12GB with 10 post dumps)
- Raven.Bench closed --url http://localhost:8080 --profile stackoverflow-reads --dataset stackoverflow --dataset-size 10 --concurrency 8..256x2
StackOverflow text search queries (auto-imports to StackOverflow-5GB)
- Raven.Bench closed --url http://localhost:8080 --profile stackoverflow-queries --query-profile text-search --dataset stackoverflow --dataset-profile small --concurrency 8..128x2
- Text search variants: text-search-rare (high selectivity), text-search-common (low selectivity), text-search-mixed (50/50 mix)
Users range queries (requires Users dataset)
- Raven.Bench closed --url http://localhost:8080 --profile query-users-by-name --query-profile range --dataset stackoverflow --dataset-profile small --concurrency 8..128x2
HTTP/3 (strict) or auto negotiate
- Strict HTTP/3: --http-version 3 --strict-http-version
- Negotiable HTTP/2: --http-version http2
- Auto (default): --http-version auto

Workloads and Distributions

Mix
- Requires --profile mixed and --preload N.
- Provide --reads/--writes/--updates as either weights or percents; values normalize to 100%.
- Defaults to 75% reads, 25% updates (no writes).
- Reads and updates operate on preloaded documents; writes (if specified) grow the keyspace beyond preload.
Key distributions for reads.
- uniform: equal probability across existing keys.
- zipfian: favors smaller (older) keys.
- latest: favors the most recently inserted portion of the keyspace.
Profiles (v0 + v1):
- --profile mixed: YCSB-style mix honoring --reads/--writes/--updates (defaults to 75% reads, 25% updates when omitted). Requires --preload N to seed data.
- --profile writes: single-document inserts only. Ids are sequential bench/00000001+. Payload uses YCSB-like fields sized to --doc-size.
- --profile reads: read-by-id only. Requires --preload N to seed the keyspace; distribution applies.
- --profile query-by-id: parameterized query by id only. Requires --preload N. Raw HTTP posts to /databases/<db>/queries with from @all_docs where id() = $id. Measures query endpoint overhead vs. direct reads.
- --profile bulk-writes: bulk insert batches via /bulk_docs endpoint. Use --bulk-batch-size (default: 100) and --bulk-depth (default: 1) to control batch size and parallelism. Mirrors batch-writes.lua behavior.
- --profile stackoverflow-reads (or so-reads): Random reads from StackOverflow dataset: 50% questions/{sampled-ids}, 50% users/{sampled-ids}. Automatically samples existing document IDs from the database. Requires StackOverflow dataset. Mirrors full-random-reads.lua.
- --profile stackoverflow-queries (or so-queries): Parameterized queries against questions collection. Use --query-profile to select query type (equality by ID, text-prefix, text-search variants for different selectivity). Requires StackOverflow dataset.
- --profile query-users-by-name: Parameterized queries against users collection. Use --query-profile to select query type (equality by name or range by reputation). Requires Users dataset.

HTTP Version and Compression

Version negotiation
- Default --http-version auto probes HTTP/3, then 2, then 1.1.
- Use --strict-http-version to fail if the requested version isn’t available.
Compression
- Raw transport: identity, gzip, brotli, deflate (zstd not supported by .NET decompression).
- Client transport: identity, gzip, zstd (recommended for realistic client measurements).

Outputs and Integrations

Console report
- Per-step tables with throughput, error rate, client CPU, network utilization, and server metrics (if accessible).
- SNMP metrics included when --snmp-enabled (gauge metrics: CPU, memory, load; rate metrics: IO ops/sec, requests/sec).
- Knee panel and a one-line Verdict.
JSON summary
- Use --out results.json to write a structured BenchmarkSummary with options, steps, knee, verdict, HTTP version, compression, and calibration points.
- SNMP metrics (all gauges and computed rates) included if enabled.
CSV per-step
- Use --out-csv steps.csv to write per-step metrics, including both raw and normalized latencies.
- SNMP metrics included when enabled (all profile metrics with per-second rates for counters).- CI / stress benches
- Parse JSON to extract knee concurrency and throughput, fail builds beyond thresholds, or chart historical trends.
- Example (jq): jq -r '.Knee | {C: .Concurrency, Thr: .Throughput, p95: .Raw.P95, p95n: .Normalized.P95}' results.json
- Compare CSV across runs to validate regressions after changes.

Interpreting Results

Knee indicates the last reliable step. Past knee, numbers get unstable and are flagged as such.
Verdicts
- network-limited: high link utilization near configured link speed; consider compression or faster NIC.
- client-limited (CPU): generator CPU-bound; scale out clients.
- unknown: server attribution needs more counters; see server metrics.
Normalized vs raw latencies
- Normalized subtracts an RTT baseline from startup calibration, better isolating load-induced queuing.

Troubleshooting

Connection or 404 errors during calibration
- Check --url, database name, and RavenDB version. For older servers, some endpoints may differ.
Server metrics show N/A
- Server counters come from admin/debug endpoints accessed via a RavenDB client. In secured environments, ensure proper certificates/permissions or run in dev mode.
SNMP validation fails
- Ensure RavenDB server has Monitoring.Snmp.Enabled=true in settings.json and firewall allows UDP port 161.
- Test connectivity: snmpwalk -v2c -c ravendb <server-host> .1.3.6.1.4.1.45751
- See docs/snmp-metric-catalog.md for detailed troubleshooting.
High errors early, HTTP/1.x
- Socket exhaustion can hit at low C with HTTP/1. Consider HTTP/2 or HTTP/3, or decrease step concurrency.
Identity runs hit network limit fast
- That’s expected; identity exposes the “hose.” Use client + zstd for realistic production behavior.

Development

Repo structure
- App: src/RavenBench (CLI in Cli/, runner in BenchmarkRunner.cs, transports in Transport/, metrics in Metrics/).
- Tests: tests/RavenBench.Tests.
Build and test
- dotnet build -c Release
- dotnet test
Running locally
- dotnet run --project src/RavenBench -- run --url http://localhost:8080 --database ycsb --profile mixed --reads 75 --writes 25

Notes and Limitations

v1 supports closed and rate commands.
zstd on raw transport is not supported; use --transport client --compression zstd.
Server-side attribution beyond basic CPU/memory/IO requires more counters and will improve over time.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
src		src
tests/RavenBench.Tests		tests/RavenBench.Tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
RavenBench.sln		RavenBench.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Raven.Bench — RavenDB Knee-Finding Benchmark

About

Uh oh!

Releases

Packages

Languages

redknightlois/Raven.Bench

Folders and files

Latest commit

History

Repository files navigation

Raven.Bench — RavenDB Knee-Finding Benchmark

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages