OpenCode Harness - Autonomous Coding Agent

⚠️ PROJECT DISCONTINUED ⚠️

This project has been discontinued due to critical issues with the OpenCode SDK's model selection behavior. Despite explicit configuration for free models, the OpenCode server consistently ignores the model parameter in .opencode.json and automatically selects expensive models (Gemini 3 Pro, Claude Haiku via Amazon Bedrock), resulting in unexpected costs.

Root Cause: The OpenCode SDK/Server has its own model selection logic that overrides client-side configuration, making cost control impossible for autonomous long-running agents.

See Issue: [Link to GitHub issue will be added]

Original Description

An autonomous coding agent that uses OpenCode (Claude-powered IDE) to automatically build applications from specifications. The agent iteratively implements features, runs tests, and refines code until the application is complete.

Features

Autonomous Development: AI agent implements features independently
Test-Driven Development: Works from a detailed feature list with test specifications
Multi-Model Support: Works with Anthropic Claude, OpenRouter, and free models
Flexible Architecture: Supports both provider/model and provider/vendor/model formats
Docker Integration: OpenCode server runs in Docker with volume mounts
Progress Tracking: Monitors feature completion across sessions
Security: Bash command allowlist and filesystem restrictions

Architecture

┌─────────────────┐     ┌──────────────────┐     ┌─────────────────┐
│  Python Client  │────▶│  OpenCode Server │────▶│  LLM Provider   │
│  (agent.py)     │     │  (Docker)        │     │  (OpenRouter)   │
└─────────────────┘     └──────────────────┘     └─────────────────┘
         │                       │
         │                       ▼
         │              ┌──────────────────┐
         └─────────────▶│  Project Dir     │
                        │  (Volume Mount)  │
                        └──────────────────┘

Quick Start

1. Prerequisites

Docker Desktop (for OpenCode server)
Python 3.11+
OpenRouter API Key (or Anthropic API Key)

2. Install Dependencies

pip install -r requirements.txt

3. Start OpenCode Server

# With OpenRouter (recommended for free models)
docker run -d -p 4096:4096 \
  -e OPENROUTER_API_KEY='your-key-here' \
  -v "$(pwd)/your-project:/workspace" \
  --workdir /workspace \
  --name opencode-server \
  ghcr.io/sst/opencode serve --port 4096 --hostname 0.0.0.0

# Or with Anthropic (paid, higher quality)
docker run -d -p 4096:4096 \
  -e ANTHROPIC_API_KEY='your-key-here' \
  -v "$(pwd)/your-project:/workspace" \
  --workdir /workspace \
  --name opencode-server \
  ghcr.io/sst/opencode serve --port 4096 --hostname 0.0.0.0

4. Create Your App Specification

Create prompts/app_spec.txt with your application requirements:

Build a CLI Todo Application

Requirements:
- Add tasks with descriptions
- List all tasks with status
- Mark tasks as completed
- Store tasks in JSON file
- Python implementation

5. Run the Agent

# With free model (Google Gemini)
python autonomous_agent_demo.py --project-dir ./my-app --model openrouter/google/gemini-flash-1.5-8b:free

# With paid model (Claude)
python autonomous_agent_demo.py \
  --project-dir ./my-app \
  --model anthropic/claude-sonnet-4-5-20250929

# With OpenRouter + Claude
python autonomous_agent_demo.py --project-dir ./my-app --model openrouter/anthropic/claude-3.5-sonnet

# With OpenRouter + Mistral 7B (free)
python autonomous_agent_demo.py --project-dir ./my-app --model openrouter/mistralai/mistral-7b-instruct:free

How It Works

Initialization Phase: Agent reads app_spec.txt and creates:
- feature_list.json - Test cases for all features
- init.sh - Setup script
- Basic project structure
Implementation Phase: Agent iteratively:
- Picks next unimplemented feature
- Writes code and tests
- Runs tests
- Marks feature as passing when tests succeed
Completion: Continues until all features pass

Configuration

Model Selection

The agent supports flexible model formats:

# Format: provider/model
--model anthropic/claude-sonnet-4-5-20250929

# Format: provider/vendor/model  
--model openrouter/anthropic/claude-3.5-sonnet

# Free models
--model openrouter/google/gemini-flash-1.5-8b:free
--model openrouter/mistralai/mistral-7b-instruct:free

Max Tokens

Configured automatically based on model:

Free models: 1000 tokens
Paid models: 4096 tokens

new

docker run -d -p 4096:4096 --name opencode-server -e OPENROUTER_API_KEY="yourkey" -e DEFAULT_MODEL="openrouter/mistralai/mistral-7b-instruct:free" ghcr.io/sst/opencode opencode serve --port 4096 --hostname 0.0.0.0

Security

Edit security.py to customize:

Bash command allowlist
Filesystem restrictions
Permission settings

Project Structure

opencode_harness_autonomous_coding/
├── agent.py                    # Core agent logic
├── client.py                   # OpenCode client wrapper
├── autonomous_agent_demo.py    # Main entry point
├── prompts.py                  # Prompt management
├── security.py                 # Security configuration
├── prompts/
│   ├── app_spec.txt           # Your app specification
│   ├── initializer_prompt.md  # First-run prompt
│   └── coding_prompt.md       # Feature implementation prompt
├── tests/                      # Test suite
└── README.md

API Keys

OpenRouter (Recommended)

Sign up at https://openrouter.ai
Add credits: https://openrouter.ai/settings/credits
Create API key: https://openrouter.ai/settings/keys
Use in Docker: -e OPENROUTER_API_KEY='sk-or-v1-...'

Note: Free models require ~$5 minimum credits due to server-side max_tokens settings.

Anthropic

Sign up at https://console.anthropic.com
Create API key: https://console.anthropic.com/settings/keys
Use in Docker: -e ANTHROPIC_API_KEY='sk-ant-api03-...'

Troubleshooting

"No API keys found"

This warning is safe to ignore if your server has the key
Keys are set in Docker container, not Python client

"Connection error"

Ensure OpenCode server is running: docker ps
Check logs: docker logs opencode-server

"Request requires more credits"

Add credits to your OpenRouter account
Or use a paid Anthropic API key

Files created in wrong location

Ensure Docker volume mount is correct: -v "$(pwd)/project:/workspace"
Check --workdir /workspace is set

Advanced Usage

Custom Iteration Limit

python autonomous_agent_demo.py \
  --project-dir ./my-app \
  --model openrouter/google/gemini-flash-1.5-8b:free \
  --max-iterations 10

Resume Existing Project

The agent automatically detects existing feature_list.json and continues from where it left off.

Adjust Feature Count

Edit prompts/initializer_prompt.md to change the number of features generated (default: 3 for testing, increase for production).

Development

Running Tests

# All tests
pytest

# Specific test
pytest tests/test_agent.py::test_agent_session

# With coverage
pytest --cov=. --cov-report=html

Code Style

Follow PEP 8 conventions
Type hints required
Use async/await for OpenCode SDK calls
See AGENTS.md for detailed guidelines

Contributing

Fork the repository
Create a feature branch
Make your changes
Run tests: pytest
Submit a pull request

License

MIT License - see LICENSE file for details.

Why This Project Was Discontinued

The Problem

Despite configuring the OpenCode client with explicit free model settings:

{
  "model": "openrouter/meta-llama/llama-3.1-8b-instruct:free",
  "max_tokens": 200,
  ...
}

And passing model parameters via the SDK:

result = await client.session.chat(
    session_id,
    model_id="mistralai/mistral-7b-instruct:free",
    provider_id="openrouter",
    parts=[{"type": "text", "text": message}],
    extra_body={"max_tokens": 200}
)

The OpenCode server consistently ignored these settings and selected expensive models:

google/gemini-3-pro-preview ($0.15/request)
claude-haiku-4.5 via Amazon Bedrock ($0.01-0.02/request)

Evidence

From OpenRouter usage logs during testing:

Dec 10, 08:56 PM - Gemini 3 Pro Preview - 19,849 tokens - $0.152
Dec 10, 08:56 PM - Claude Haiku 4.5 - 12,290 tokens - $0.0125
Dec 10, 08:54 PM - Gemini 3 Pro Preview - 12,617 tokens - $0.0136
Dec 10, 08:54 PM - Claude Haiku 4.5 - 1,862 tokens - $0.00192
...

Result: A single 10-minute test session cost ~$0.30-0.50 instead of $0.00 (free tier).

Attempted Solutions

✅ Configured .opencode.json with free model
✅ Passed explicit model parameters via SDK
✅ Set environment variables (DEFAULT_MODEL, OPENROUTER_API_KEY)
✅ Used Docker with environment configuration
❌ None of these approaches prevented the server from selecting paid models

Conclusion

The OpenCode SDK is unsuitable for cost-sensitive autonomous agents because:

Model selection is controlled server-side, not client-side
Configuration files and SDK parameters are ignored
No way to enforce free-tier models
Unexpected costs accumulate rapidly in long-running sessions

Alternative Approaches

For autonomous coding agents with cost control, consider:

Direct API Integration - Use OpenRouter/Anthropic APIs directly without OpenCode
Custom Harness - Implement your own tool execution layer (~200 lines)
LangChain/LangGraph - Use established frameworks with better cost controls

See the Anthropic Guide on Long-Running Agents for implementation patterns that don't require expensive SDKs.

Credits

Originally built on OpenCode by SST
Inspired by Anthropic's Long-Running Agents Guide
Uses OpenRouter for multi-model access
Python SDK: opencode-ai

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
.claude		.claude
.github/workflows		.github/workflows
legacy_claude_code		legacy_claude_code
migration-docs		migration-docs
prompts		prompts
test4		test4
.gitignore		.gitignore
AGENTS.md		AGENTS.md
OPENCODE_ISSUE.md		OPENCODE_ISSUE.md
README.md		README.md
TEST_PLAN.md		TEST_PLAN.md
agent.py		agent.py
autonomous_agent_demo.py		autonomous_agent_demo.py
client.py		client.py
create_features.py		create_features.py
progress.py		progress.py
prompts.py		prompts.py
requirements.txt		requirements.txt
security.py		security.py

enving/opencode_harness_autonomous_coding

Folders and files

Latest commit

History

Repository files navigation