Extension Proposal: Agent Trust Scoring via MCP-T #301
realpercivallabs
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Problem
UCP enables agents to discover merchants, negotiate capabilities, and complete purchases. The protocol secures identity through namespace binding and capability negotiation. What it doesn't currently address is agent reputation: should a merchant trust a specific agent to complete a transaction autonomously?
As the agent ecosystem grows, merchants will face the same problem app stores faced a decade ago: how do you tell the difference between a reliable agent and a malicious one? Current options are binary (allow/deny based on namespace) rather than graduated (trust-tiered based on behavioral history).
Proposed Extension: MCP-T Trust Scoring
MCP-T (Model Context Protocol, Trust Extension) is an open spec (CC-BY-4.0) that adds trust scoring to the MCP ecosystem. It's designed to be composable with any protocol that needs trust decisions, including UCP.
How it works with UCP
Merchants add trust requirements to their
/.well-known/ucpprofile using UCP's reverse-domain extension mechanism:{ "extensions": { "ai.percival-labs.mcp-t": { "required": true, "trust_endpoint": "https://trust-provider.example.com/mcp-t/v1", "thresholds": { "browse": { "composite_min": 0 }, "checkout": { "composite_min": 600, "dimension_mins": { "behavioral_fidelity": 700 } } }, "escalation_policy": "agents_below_threshold_require_human_approval" } } }During UCP capability negotiation, the agent (or merchant server) queries the trust provider:
{ "jsonrpc": "2.0", "method": "trust/verify", "params": { "subject_id": "did:key:z6Mk...", "domain": "financial", "threshold": { "composite_min": 600 }, "nonce": "checkout-session-id" } }Agents that meet the threshold proceed autonomously. Agents below the threshold trigger UCP's existing
requires_escalationstate.Trust Dimensions (10 default)
verificationtenureperformancecommitmentcommunityconsistencytransparencycompliancesecuritybehavioral_fidelityKey Design Choices
trust/verifyreturns binary pass/fail without revealing the actual score. ZK proofs (Level 3) extend this further.Relationship to Other UCP Discussions
Resources
The spec is CC-BY-4.0 and open for collaboration. Looking for feedback on the UCP integration pattern, the threshold model for
/.well-known/ucp, and which trust dimensions are most relevant for commerce use cases.Beta Was this translation helpful? Give feedback.
All reactions