Pelican MCP for public namespaces #2808

h2zh · 2025-11-18T03:47:52Z

This PR comes with Pelican Docs updates. Check out how to use this feature there (spoiler: only one command required!)

Benefits

This Model Context Protocol (MCP) integration enables researchers to access scientific datasets using natural language through AI assistants like Claude Code and VSCode Copilot. This positions Pelican at the forefront of the AI-powered scientific computing revolution and reach a wider user base.

MCP is the emerging standard for connecting AI to the real world. Think of it as "USB for AI assistants" - a standardized protocol that allows any MCP-compatible AI assistant (Claude, VS Code Copilot, Cline, etc.) to interact with external tools and data sources. As a data federation CLI tool, Pelican Client is a perfect fit as a MCP to plug in the Agentic AI world.

For users, no more sophisticated commands memorization required - AI can understand the most download demands. What's more, it can suggest the next step, even accomplish the entire workflow from data downloads to data analysis, eventually provides a report on its own.

System design in a nutshell

AI Assistant (e.g. VS Code Copilot) spawns a MCP server process, which is just a thin JSON-RPC wrapper around existing client API. The MCP server does NOT execute pelican object get as a subprocess. Instead, it imports and calls the client library functions (e.g. client.DoGet) directly.

Limitation

This MCP only supports public namespaces, because MCP cannot open an external browser to complete the OAuth flow. But since a large portion of Pelican usage is public data, this documented feature is ready for the production.

jhiemstrawisc

I was mostly poking around the PR out of curiosity -- these comments were small things I noticed while nosing around.

docs/app/getting-data-with-pelican/mcp/page.mdx

cmd/mcp.go

1. mcp/types.go - All MCP protocol message structures (JSONRPCRequest, JSONRPCResponse, RPCError) - MCP-specific structures (InitializeParams, Tool, CallToolResult, etc.) 2. mcp/server.go - Server struct and core server logic - Request handling (initialize, list tools, call tools) - Response/error sending functions 3. mcp/tools.go - getToolsList() - Returns tool definitions - handleDownload() - Pelican download implementation - handleStat() - File metadata retrieval - handleList() - Directory listing

- Move config.InitClient() from server startup to lazy initialization - Initialize Pelican client only when first tool is called - Prevents corrupting JSON-RPC stream with startup errors - Fixes 'Invalid input' error in Claude Desktop - All logs go to stderr, stdout is clean JSON-RPC only (cherry picked from commit e8df70f)

- Handle 'initialized' notification (sent after initialize) - Don't respond to notifications (JSON-RPC requests without ID) - Remove omitempty from response ID field for spec compliance This fixes the 'Invalid input' error in Claude Desktop caused by responding to the 'initialized' notification when we shouldn't. (cherry picked from commit 4babe40)

- The global PersistentPreRunE in cmd/root.go already sets the logging level via config when --debug is present.

- Update the MCP docs to the requested one‑sentence‑per‑line style - Wording improvement

h2zh added the client Issue affecting the OSDF client label Nov 18, 2025

h2zh force-pushed the client-mcp branch 2 times, most recently from 26f2b16 to 1279c1a Compare November 18, 2025 04:38

h2zh requested a review from bbockelm November 19, 2025 22:50

jhiemstrawisc reviewed Nov 21, 2025

View reviewed changes

docs/app/getting-data-with-pelican/mcp/page.mdx Outdated Show resolved Hide resolved

docs/app/getting-data-with-pelican/mcp/page.mdx Outdated Show resolved Hide resolved

cmd/mcp.go Outdated Show resolved Hide resolved

h2zh and others added 7 commits January 5, 2026 22:02

Add comprehensive MCP documentation

19ac1f4

Fix linter and date problems

6722c6f

Remove the local log‑level override in cmd/mcp.go.

038b260

- The global PersistentPreRunE in cmd/root.go already sets the logging level via config when --debug is present.

Semantic changes

080d103

- Update the MCP docs to the requested one‑sentence‑per‑line style - Wording improvement

h2zh force-pushed the client-mcp branch from 1279c1a to 080d103 Compare January 5, 2026 22:02

h2zh linked an issue Jan 5, 2026 that may be closed by this pull request

Pelican as a MCP #2947

Open

7 tasks

Copilot AI mentioned this pull request Feb 3, 2026

Add MCP authentication tools for protected namespaces via OAuth device flow h2zh/pelican#4

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pelican MCP for public namespaces #2808

Pelican MCP for public namespaces #2808

Uh oh!

h2zh commented Nov 18, 2025 •

edited

Loading

Uh oh!

jhiemstrawisc left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Pelican MCP for public namespaces #2808

Are you sure you want to change the base?

Pelican MCP for public namespaces #2808

Uh oh!

Conversation

h2zh commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benefits

System design in a nutshell

Limitation

Uh oh!

jhiemstrawisc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

h2zh commented Nov 18, 2025 •

edited

Loading