Implement tool conflict resolution for Virtual MCP Server #2365

JAORMX · 2025-10-29T14:54:00Z

Summary

Implements tool conflict resolution strategies for the Virtual MCP Server aggregation system. This enables vMCP to merge capabilities from multiple backend MCP servers while resolving duplicate tool names.

Three conflict resolution strategies:

Prefix Strategy: Automatically prefixes all tools with workload identifier (e.g., {workload}_, {workload}., custom formats). Deterministic and requires no configuration beyond format.
Priority Strategy: Uses explicit priority ordering where first backend wins conflicts. Lower-priority conflicting tools are dropped with warnings. Non-conflicting tools from all backends are included.
Manual Strategy: Requires explicit overrides for ALL conflicts. Fails at startup with detailed error messages if any conflicts lack overrides. Safest option for production deployments.

Code reuse from existing middleware:

Extracted shared filtering/override logic from pkg/mcp/tool_filter.go into applyFilteringAndOverrides() function. Both the HTTP middleware (battle-tested) and the new aggregator use this exact same code, ensuring consistency and reducing duplication.

Key changes:

pkg/vmcp/aggregator/conflict_resolver.go - Three strategy implementations
pkg/vmcp/aggregator/tool_adapter.go - Minimal adapter to reuse middleware logic
pkg/vmcp/aggregator/default_aggregator.go - Integrated conflict resolver
pkg/mcp/tool_filter.go - Extracted shared logic, added ApplyToolFiltering() API
Comprehensive table-driven unit tests for all strategies

Per-backend tool filtering and overrides:

Applied during capability discovery BEFORE conflict resolution using the existing mcp.WithToolsFilter() and mcp.WithToolsOverride() patterns. This reduces the tool set entering conflict resolution.

Test Plan

All unit tests pass (go test ./pkg/vmcp/aggregator ./pkg/mcp)
Linting clean (task lint-fix)
Prefix strategy with different formats ({workload}_, {workload}.)
Priority strategy with 2-way and 3-way conflicts
Manual strategy validation catches unresolved conflicts
Per-backend filtering and overrides preserve InputSchema and BackendID
Existing middleware behavior unchanged (all tests pass)
Edge cases: empty priority order, description-only overrides, no conflicts
Integration test with real backend MCP servers (follow-up)

{
    name: "override references non-existent tool",
    backendID: "github",
    tools: []vmcp.Tool{
        {Name: "real_tool", Description: "Real"},
    },
    workloadConfig: &config.WorkloadToolConfig{
        Workload: "github",
        Overrides: map[string]*config.ToolOverride{
            "fake_tool": {Name: "renamed_fake"},
        },
    },
    wantCount: 1,
    wantNames: []string{"real_tool"},
},

🎯 Alignment with Best Practices

✅ Follows ToolHive Patterns

Public methods at top, private at bottom (pkg/vmcp/aggregator/conflict_resolver.go)
Clean interface segregation (ConflictResolver interface)
Proper use of contexts throughout
Factory pattern for resolver creation

✅ Go Best Practices

Proper error wrapping with fmt.Errorf and %w
No naked returns
Proper use of sync.Mutex for concurrent access
Clean separation of concerns
Good use of table-driven tests

⚠️ Minor Deviation: Gocyclo Complexity

Location: pkg/mcp/tool_filter.go:405

//nolint:gocyclo
func processEventStream(...)

Note: This is acceptable since it's inheriting complexity from existing middleware. No action needed.

📚 Alignment with Proposal

Reviewed against docs/proposals/THV-2106-virtual-mcp-server.md:

✅ Fully Aligned

Section "Tool Conflict Resolution" (lines 756-833): All three strategies implemented exactly as specified
Prefix formats: {workload}, {workload}_, {workload}. all supported
Priority behavior: First backend wins, lower priority tools dropped with warnings
Manual validation: Fails at startup with detailed error listing unresolved conflicts
Per-backend filtering/overrides: Applied before conflict resolution as specified

✅ Additional Implementation Detail

The proposal doesn't explicitly specify the order of operations, but the implementation correctly:

Applies per-backend filtering/overrides FIRST (processBackendTools)
Then applies conflict resolution (ResolveToolConflicts)

This is the right order and aligns with the proposal's intent.

🔍 Detailed Code Review Notes

`conflict_resolver.go`

Line 53-58: Good collision detection even with prefixing (rare but handled)
Line 105-108: Excellent validation of priority order (no empty strings)
Line 111: O(1) priority lookup map - smart optimization
Line 267-269: Clean validation separation in manual resolver
Line 374-384: Very clear error messages for unresolved conflicts

`tool_adapter.go`

Line 42-56: Smart reverse map for preserving InputSchema after overrides
Line 79-84: Excellent handling of name lookups (checks reverse map)
Line 93: Correctly uses backendID parameter as source of truth (not tool.BackendID)

`tool_filter.go`

Line 82-103: New ApplyToolFiltering() function is clean and reusable
Line 539-566: Extracted logic is identical to original (verified)
Line 568-577: Helper function for reverse name lookup

`default_aggregator.go`

Line 68: Per-backend filtering applied at right time (before conflict resolution)
Line 96: Concurrency limit prevents DoS (good)
Line 108-110: Graceful failure handling (logs but continues)
Line 159-178: Sensible fallback when no resolver configured

🎯 Test Quality Assessment

Excellent Coverage

Prefix strategy: Multiple formats, conflicts, no conflicts
Priority strategy: 2-way, 3-way conflicts, empty priority order
Manual strategy: All conflicts resolved, unresolved, description-only overrides
Tool adapter: Filtering, overrides, schema preservation

Test Quality Metrics

✅ Table-driven (idiomatic Go)
✅ Parallel execution (t.Parallel())
✅ Clear test names
✅ Good coverage of happy/sad paths
✅ Edge cases included

📋 Recommendations Summary

Must Fix (None)

No blocking issues found.

Should Fix

Add test case for override referencing non-existent tool (Medium priority)

Nice to Have

Consider making maxConcurrentQueries configurable (Low priority, future)
Standardize error wrapping pattern (Low priority)
Consider optimizing map allocation in processBackendTools (Very low priority)

✅ Final Verdict

Approve with minor suggestions

This is high-quality code that:

✅ Implements the proposal specifications correctly
✅ Follows ToolHive and Go best practices
✅ Has no security vulnerabilities
✅ Includes comprehensive test coverage
✅ Reuses existing battle-tested middleware logic
✅ Has excellent error handling and logging

The suggested improvements are all optional and don't block merging. Great work! 🎉

Todo List

Read THV-2106 proposal document to understand requirements
Review conflict_resolver.go for implementation correctness
Review tool_adapter.go for proper middleware integration
Review tool_filter.go refactoring
Review default_aggregator.go integration
Review test coverage and quality
Check for security issues
Verify alignment with Go and ToolHive best practices
Provide comprehensive feedback

codecov · 2025-10-29T14:59:14Z

Codecov Report

❌ Patch coverage is 87.81870% with 43 lines in your changes missing coverage. Please review.
✅ Project coverage is 54.22%. Comparing base (216a280) to head (1bd6867).
⚠️ Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
pkg/mcp/tool_filter.go	78.84%	10 Missing and 1 partial ⚠️
pkg/vmcp/aggregator/tool_adapter.go	80.95%	5 Missing and 3 partials ⚠️
pkg/vmcp/aggregator/default_aggregator.go	86.27%	5 Missing and 2 partials ⚠️
pkg/vmcp/aggregator/manual_resolver.go	90.14%	4 Missing and 3 partials ⚠️
pkg/vmcp/aggregator/prefix_resolver.go	78.57%	4 Missing and 2 partials ⚠️
pkg/vmcp/aggregator/conflict_resolver.go	93.93%	2 Missing ⚠️
pkg/vmcp/aggregator/priority_resolver.go	97.36%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2365      +/-   ##
==========================================
+ Coverage   53.79%   54.22%   +0.42%     
==========================================
  Files         254      259       +5     
  Lines       24509    24824     +315     
==========================================
+ Hits        13185    13461     +276     
- Misses      10039    10068      +29     
- Partials     1285     1295      +10

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

pkg/mcp/tool_filter.go

pkg/vmcp/aggregator/conflict_resolver.go

pkg/vmcp/aggregator/tool_adapter.go

pkg/vmcp/aggregator/conflict_resolver.go

yrobla

added some comments

JAORMX · 2025-10-29T15:46:39Z

@claude be cool and address the comments

claude · 2025-10-29T15:46:57Z

Claude finished @JAORMX's task —— View job

Addressing Review Comments ✅

All review comments from @yrobla have been addressed!

Read and understand all review comments
Address comment 1: O(n²) complexity in tool_filter.go:515
Address comment 2: Conflict resolution logic in conflict_resolver.go:63 (skipped - comment is clear enough)
Address comment 3: Potential nil panic in tool_adapter.go:86
Address comment 4: Dropping all backends in conflict_resolver.go:164
Run tests (task test)
Commit and push changes

Changes Made

Fixed potential nil panic (pkg/vmcp/aggregator/tool_adapter.go:86):
- Added existence check before accessing originalToolsByName[originalName]
- If tool not found, logs warning and skips instead of panicking
- Changed from fixed-size slice to append pattern to handle skipped tools
Added complexity clarification (pkg/mcp/tool_filter.go:511):
- Added comment explaining O(n²) loop is acceptable for small tool lists in non-hot path
- Typical tool lists are < 100 tools per backend
- Inner loop breaks early on match
Improved warning message (pkg/vmcp/aggregator/conflict_resolver.go:165):
- Enhanced logging when priority strategy drops all conflicting tools from backends not in priority list
- Now shows which backends were involved: Tool foo exists in multiple backends ["backend1", "backend2"] but none are in priority order, dropping all instances
- Added clarifying comment that this behavior is intentional
Comment 2 - No change needed:
- The counter at conflict_resolver.go:63 already has a clear comment
- It tracks when prefix strategy changed a tool name (e.g., list_repos → github_list_repos)

Test Results

All tests passing:

ok  	github.com/stacklok/toolhive/pkg/vmcp/aggregator	0.066s
ok  	github.com/stacklok/toolhive/pkg/mcp	0.053s

Commit

Pushed commit f58142b to branch vmcp-conflict-resolution:

pkg/vmcp/aggregator/default_aggregator.go

pkg/vmcp/aggregator/conflict_resolver.go

Add three conflict resolution strategies (prefix, priority, manual) to handle duplicate tool names across multiple backend MCP servers in vMCP aggregation. This implements the aggregation conflict resolution portion of the Virtual MCP Server proposal (THV-2106), enabling vMCP to merge capabilities from multiple backends while resolving naming conflicts. Key features: - Prefix strategy: automatically prefixes tools with workload identifier (supports {workload}_, {workload}., custom formats) - Priority strategy: explicit ordering with first-wins semantics (drops lower-priority conflicting tools with warnings) - Manual strategy: requires explicit overrides with startup validation (fails if any conflicts lack overrides, safest for production) - Reuses existing mcp.WithToolsFilter/Override middleware logic - Per-backend tool filtering and overrides applied before conflict resolution - Tracks conflict metadata (count resolved, strategy used) Implementation: - Extracted shared filtering/override logic from pkg/mcp/tool_filter.go - Created applyFilteringAndOverrides() as single source of truth - Both HTTP middleware and aggregator use the same battle-tested code - Updated defaultAggregator to integrate conflict resolver - Comprehensive table-driven unit tests for all strategies This follows DDD principles with clear bounded contexts and maintains backward compatibility with existing middleware behavior. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

- Add nil check in tool_adapter.go to prevent panic when tool not found in originalToolsByName map - Add comment explaining O(n²) complexity is acceptable for small tool lists - Improve warning message when priority strategy drops tools from backends not in priority list - Show which backends were involved when dropping conflicting tools Co-authored-by: Juan Antonio Osorio <JAORMX@users.noreply.github.com>

Address review feedback by adding detailed comments explaining: 1. What "resolved conflict" means in the context of conflict resolution: - Prefix strategy: Renaming tools proactively prevents collisions - Manual strategy: Explicit overrides resolve existing conflicts - Priority strategy: Drops duplicates rather than renaming 2. Manual resolver validation: Documents that collision detection after override prevents new conflicts from being introduced by bad config These clarifications help reviewers understand that renaming IS conflict resolution, not just a side effect. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Remove the ConflictsResolved counter as it was not defined in the proposal and caused confusion about what constitutes a "resolved conflict" vs a preventive renaming. The ConflictStrategy field remains to indicate which resolution approach was used (prefix, priority, manual), which is sufficient for observability. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Split conflict_resolver.go into focused files for better code organization: - prefix_resolver.go: PrefixConflictResolver implementation - priority_resolver.go: PriorityConflictResolver implementation - manual_resolver.go: ManualConflictResolver implementation - conflict_resolver.go: Factory function and shared helpers Each strategy now has its own file, making the code easier to navigate and maintain. Shared helpers (groupToolsByName, toolWithBackend) remain in conflict_resolver.go for reuse. No functional changes - pure refactoring. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>

Change priority resolver to use prefix strategy for backends not in the priority list instead of dropping their conflicting tools. This prevents data loss while maintaining explicit control for prioritized backends. Behavior: - Backends in priority list: priority strategy (first wins) - Backends NOT in priority list with conflicts: prefix strategy fallback - Example: priority_order=["github"], but slack+teams both have "send_message" Result: "slack_send_message" and "teams_send_message" (both included) This addresses review feedback about dropping tools unnecessarily and provides a more practical default behavior. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>