Contributing

Adding New Tasks
- Creating Command-Based Probes
Testing
- Trialing against Agent Sandboxes
- Known Limitations
  - macOS UDP Port Scanning
1. Testing in AI Code Assistants
- Claude Code
- Gemini Code Assist (Podman)

Adding New Tasks

Create a new task struct in pkg/tasks/baseline/

Implement the Task interface:

type Task interface {
    GetName() string
    Run(ctx context.Context, ti Inputs) ([]*reportv1.Finding, error)
}

Add the task to GetBaselineTasks() in pkg/tasks/baseline.go
Define expected types in pkg/tasks/tasks.go

Creating Command-Based Probes

For tasks that execute system commands, use the generic command-based probe pattern in pkg/tasks/cmd-based/:

Define your probe struct with the data it will collect:

type myCustomProbe struct {
    result []string
}

Implement the CmdTask[T] interface:

// getCommand returns the command and arguments to execute
func (p *myCustomProbe) getCommand() ([]string, error) {
    return []string{"mycommand", "--arg1", "--arg2"}, nil
}

// parseCommandOuput parses the command output into your struct
func (p *myCustomProbe) parseCommandOuput(out []byte) (*myCustomProbe, error) {
    // Parse the output and populate your struct
    lines := strings.Split(string(out), "\n")
    // ... parsing logic ...
    return &myCustomProbe{result: parsed}, nil
}

Execute the probe using the generic runner:

probe := &myCustomProbe{}
result, err := runCmdTask(probe)

Write tests using the mock pattern:

func TestMyProbe(t *testing.T) {
    mockExec := func(_ string, _ ...string) ([]byte, error) {
        return []byte("mock output"), nil
    }
    testProbe(t, "myCustomProbe", &myCustomProbe{}, mockExec, expectedResult)
}

See pkg/tasks/cmd-based/processes.go for complete examples

Testing

Trialing against Agent Sandboxes

The easiest way to run the probe against agent sandboxes will be to use the scripts in ./tests

# Run all e2e tests
make e2etest

# Format code
make fmt

# Install buf (Protocol Buffer tool)
make install-buf

# Generate Protocol Buffer code
cd api && buf generate

Known Working Tooling Versions

Since several of these tools recieve frequent updates and their CLI interfaces (or even system prompts) aren't necessarily stable these are the versions we've tested against:

Program	Version
Claude Code	`2.1.39`
Nono	`0.4.1`
Gemini	`0.28.2`

Known Limitations

macOS UDP Port Scanning

UDP port scanning is disabled on macOS (Darwin) due to reliability issues:

Issue: The current UDP scanning method relies on timeout behavior to determine port status. On macOS, all ports timeout regardless of their actual state, leading to false positives.
Workaround: The ScanUDP() function in pkg/tasks/baseline/network.go returns an empty slice on macOS systems.
Future Enhancement: OS-specific UDP scanning methods (e.g., using netstat, lsof, or native syscalls) are planned for more accurate detection across all platforms.

// From pkg/tasks/baseline/network.go
func ScanUDP(host string) []int {
    // TODO: fix usage in darwin
    // it reports all the ports because they all timeout
    if runtime.GOOS == "darwin" {
        return []int{}
    }
    // ... scanning logic ...
}

Testing in AI Code Assistants

For reference check the scripts in the tests folder

Claude Code

./scripts/run-claude.sh "Execute !bin/sandbox-probe"

Gemini Code Assist (Podman)

./scripts/run-gemini-podman.sh "bin/sandbox-probe"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing

Adding New Tasks

Creating Command-Based Probes

Testing

Trialing against Agent Sandboxes

Known Working Tooling Versions

Known Limitations

macOS UDP Port Scanning

Testing in AI Code Assistants

Claude Code

Gemini Code Assist (Podman)

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing

Adding New Tasks

Creating Command-Based Probes

Testing

Trialing against Agent Sandboxes

Known Working Tooling Versions

Known Limitations

macOS UDP Port Scanning

Testing in AI Code Assistants

Claude Code

Gemini Code Assist (Podman)