open-webui · westbrook-ai · Dec 6, 2025 · Dec 14, 2025 · Dec 14, 2025 · Dec 14, 2025
diff --git a/.env.example b/.env.example
@@ -0,0 +1,19 @@
+# Open WebUI Benchmark Configuration
+# Copy this file to .env and configure for your environment
+
+# Base URL of the Open WebUI instance to benchmark
+OPEN_WEBUI_URL=http://localhost:8080
+
+# Admin user credentials (REQUIRED - must exist in Open WebUI)
+# The admin account is used to:
+# - Create test channels
+# - Create temporary benchmark users dynamically
+# - Clean up benchmark users after tests
+ADMIN_USER_EMAIL=admin@example.com
+ADMIN_USER_PASSWORD=adminpassword123
+
+# Benchmark settings (optional - defaults shown)
+MAX_CONCURRENT_USERS=50
+USER_STEP_SIZE=10
+SUSTAIN_TIME_SECONDS=30
+MESSAGE_FREQUENCY=0.5
diff --git a/.gitignore b/.gitignore
@@ -0,0 +1,50 @@
+# Benchmark results directory
+results/
+
+# Python cache
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+
+# Virtual environments
+venv/
+ENV/
+env/
+.venv/
+**/.env
+
+# IDE
+.idea/
+.vscode/
+*.swp
+*.swo
+
+# Logs
+logs/
+*.log
+
+# Docker
+.docker/
+
+# OS
+.DS_Store
+Thumbs.db
diff --git a/README.md b/README.md
@@ -0,0 +1,300 @@
+# Open WebUI Benchmark Suite
+
+A comprehensive benchmarking framework for testing Open WebUI performance under various load conditions.
+
+## Overview
+
+This benchmark suite is designed to:
+
+1. **Measure concurrent user capacity** - Test how many users can simultaneously use features like Channels
+2. **Identify performance limits** - Find the point where response times degrade
+3. **Compare compute profiles** - Test performance across different resource configurations
+4. **Generate actionable reports** - Provide detailed metrics and recommendations
+
+## Quick Start
+
+### Prerequisites
+
+- Python 3.11+
+- Docker and Docker Compose
+- A running Open WebUI instance (or use the provided Docker setup)
+
+### Installation
+
+```bash
+cd benchmark
+python -m venv .venv
+source .venv/bin/activate  # On Windows: .venv\Scripts\activate
+pip install -e .
+```
+
+### Configuration
+
+Copy the example environment file and configure your admin credentials:
+
+```bash
+cp .env.example .env
+```
+
+Edit `.env` with your Open WebUI admin credentials:
+
+```dotenv
+OPEN_WEBUI_URL=http://localhost:8080
+ADMIN_USER_EMAIL=your-admin@example.com
+ADMIN_USER_PASSWORD=your-password
+```
+
+### Running Benchmarks
+
+1. **Start Open WebUI with benchmark configuration:**
+
+```bash
+cd docker
+./run.sh default  # Use the default compute profile (2 CPU, 8GB RAM)
+```
+
+2. **Run the benchmark:**
+
+```bash
+# Run all benchmarks
+owb run all
+
+# Run only channel concurrency benchmark
+owb run channels -m 50  # Test up to 50 concurrent users
+
+# Run with a specific target URL
+owb run channels -u http://localhost:3000
+
+# Run with a specific compute profile
+owb run channels -p cloud_medium
+```
+
+3. **View results:**
+
+Results are saved to the `results/` directory in JSON and CSV formats.
+
+## Compute Profiles
+
+Compute profiles define the resource constraints for the Open WebUI container:
+
+| Profile | CPUs | Memory | Use Case |
+|---------|------|--------|----------|
+| `default` | 2 | 8GB | Local MacBook testing |
+| `minimal` | 1 | 4GB | Testing lower bounds |
+| `cloud_small` | 2 | 4GB | Small cloud VM |
+| `cloud_medium` | 4 | 8GB | Medium cloud VM |
+| `cloud_large` | 8 | 16GB | Large cloud VM |
+
+List available profiles:
+
+```bash
+owb profiles
+```
+
+## Available Benchmarks
+
+### Channel Concurrency (`channels`)
+
+Tests concurrent user capacity in Open WebUI Channels:
+
+- Creates a test channel
+- Progressively adds users (10, 20, 30, ... up to max)
+- Each user sends messages at a configured rate
+- Measures response times and error rates
+- Identifies the maximum sustainable user count
+
+**Configuration options:**
+
+```yaml
+channels:
+  max_concurrent_users: 100  # Maximum users to test
+  user_step_size: 10         # Increment users by this amount
+  sustain_time: 30           # Seconds to run at each level
+  message_frequency: 0.5     # Messages per second per user
+```
+
+### Channel WebSocket (`channels-ws`)
+
+Tests WebSocket scalability for real-time message delivery.
+
+## Configuration
+
+Configuration files are located in `config/`:
+
+- `benchmark_config.yaml` - Main benchmark settings
+- `compute_profiles.yaml` - Resource profiles for Docker containers
+
+### Environment Variables
+
+| Variable | Description | Default |
+|----------|-------------|---------|
+| `OPEN_WEBUI_URL` | Open WebUI URL for benchmarking | `http://localhost:8080` |
+| `ADMIN_USER_EMAIL` | Admin user email (required) | - |
+| `ADMIN_USER_PASSWORD` | Admin user password (required) | - |
+| `OPEN_WEBUI_PORT` | Port for Docker container | `8080` |
+| `CPU_LIMIT` | CPU limit for container | `2.0` |
+| `MEMORY_LIMIT` | Memory limit for container | `8g` |
+
+## Extending the Benchmark Suite
+
+### Adding a New Benchmark
+
+1. Create a new file in `benchmark/scenarios/`:
+
+```python
+from benchmark.core.base import BaseBenchmark
+from benchmark.core.metrics import BenchmarkResult
+
+class MyNewBenchmark(BaseBenchmark):
+    name = "My New Benchmark"
+    description = "Tests something new"
+    version = "1.0.0"
+
+    async def setup(self) -> None:
+        # Set up test environment
+        pass
+
+    async def run(self) -> BenchmarkResult:
+        # Execute the benchmark
+        # Use self.metrics to record timings
+        return self.metrics.get_result(self.name)
+
+    async def teardown(self) -> None:
+        # Clean up
+        pass
+```
+
+2. Register the benchmark in `benchmark/cli.py`
+
+3. Add configuration options if needed in `config/benchmark_config.yaml`
+
+### Custom Metrics Collection
+
+```python
+from benchmark.core.metrics import MetricsCollector
+
+metrics = MetricsCollector()
+metrics.start()
+
+# Time individual operations
+with metrics.time_operation("my_operation"):
+    await do_something()
+
+# Or record manually
+metrics.record_timing(
+    operation="api_call",
+    duration_ms=150.5,
+    success=True,
+)
+
+metrics.stop()
+result = metrics.get_result("My Benchmark")
+```
+
+## Understanding Results
+
+### Key Metrics
+
+| Metric | Description | Good Threshold |
+|--------|-------------|----------------|
+| `avg_response_time_ms` | Average response time | < 2000ms |
+| `p95_response_time_ms` | 95th percentile response time | < 3000ms |
+| `error_rate_percent` | Percentage of failed requests | < 1% |
+| `requests_per_second` | Throughput | > 10 |
+
+### Result Files
+
+- `*.json` - Detailed results for each benchmark run
+- `benchmark_results_*.csv` - Combined results in CSV format
+- `summary_*.txt` - Human-readable summary
+
+### Interpreting Channel Benchmark Results
+
+The channel benchmark reports:
+
+- **max_sustainable_users**: Maximum users where performance thresholds are met
+- **results_by_level**: Performance at each user count level
+- **tested_levels**: All user counts that were tested
+
+Example result analysis:
+
+```
+Users: 10  | P95: 150ms  | Errors: 0%    | ✓ PASS
+Users: 20  | P95: 280ms  | Errors: 0.1%  | ✓ PASS
+Users: 30  | P95: 520ms  | Errors: 0.3%  | ✓ PASS
+Users: 40  | P95: 1200ms | Errors: 0.8%  | ✓ PASS
+Users: 50  | P95: 3500ms | Errors: 2.1%  | ✗ FAIL
+
+Maximum sustainable users: 40
+```
+
+## Architecture
+
+```
+benchmark/
+├── benchmark/
+│   ├── core/           # Core framework
+│   │   ├── base.py     # Base benchmark class
+│   │   ├── config.py   # Configuration management
+│   │   ├── metrics.py  # Metrics collection
+│   │   └── runner.py   # Benchmark orchestration
+│   ├── clients/        # API clients
+│   │   ├── http_client.py    # HTTP/REST client
+│   │   └── websocket_client.py # WebSocket client
+│   ├── scenarios/      # Benchmark implementations
+│   │   └── channels.py # Channel benchmarks
+│   ├── utils/          # Utilities
+│   │   └── docker.py   # Docker management
+│   └── cli.py          # Command-line interface
+├── config/             # Configuration files
+├── docker/             # Docker Compose for benchmarking
+└── results/            # Benchmark output (gitignored)
+```
+
+## Dependencies
+
+The benchmark suite reuses Open WebUI dependencies where possible:
+
+**From Open WebUI:**
+- `httpx` - HTTP client
+- `aiohttp` - Async HTTP
+- `python-socketio` - WebSocket client
+- `pydantic` - Data validation
+- `pandas` - Data analysis
+
+**Benchmark-specific:**
+- `locust` - Load testing (optional, for advanced scenarios)
+- `rich` - Terminal output
+- `docker` - Docker SDK
+- `matplotlib` - Plotting results
+
+## Troubleshooting
+
+### Common Issues
+
+1. **Connection refused**: Ensure Open WebUI is running and accessible
+2. **Authentication errors**: Check admin credentials in config
+3. **Docker resource errors**: Ensure Docker has enough resources allocated
+4. **WebSocket timeout**: Increase `websocket_timeout` in config
+
+### Debug Mode
+
+Set logging level to DEBUG:
+
+```bash
+export BENCHMARK_LOG_LEVEL=DEBUG
+owb run channels
+```
+
+## Contributing
+
+When adding new benchmarks:
+
+1. Follow the `BaseBenchmark` interface
+2. Add tests for the new benchmark
+3. Update configuration schema if needed
+4. Add documentation to this README
+
+## License
+
+MIT License - See LICENSE file
diff --git a/benchmark/__init__.py b/benchmark/__init__.py
@@ -0,0 +1,19 @@
+"""
+Open WebUI Benchmark Suite
+
+A comprehensive benchmarking framework for testing Open WebUI performance
+under various load conditions.
+"""
+
+__version__ = "0.1.0"
+__author__ = "Open WebUI Benchmark Team"
+
+from benchmark.core.runner import BenchmarkRunner
+from benchmark.core.config import BenchmarkConfig
+from benchmark.core.metrics import MetricsCollector
+
+__all__ = [
+    "BenchmarkRunner",
+    "BenchmarkConfig", 
+    "MetricsCollector",
+]