MCP Queue Example (#99)

nendhruv · web-flow · commit e2eddb9f1e98 · 2025-10-22T14:35:59.000+08:00
* MCP Queue Example

* mock AI mode, KV rename, cleanup, docs

* chore(lockfile): update for queues-agent deps

* fix test

* fixing test errors for queue
diff --git a/examples/queues-agent/.dev.vars.example b/examples/queues-agent/.dev.vars.example
@@ -0,0 +1,3 @@
+USE_MOCK_AI=true
+
+
diff --git a/examples/queues-agent/.editorconfig b/examples/queues-agent/.editorconfig
@@ -0,0 +1,14 @@
+# http://editorconfig.org
+root = true
+
+[*]
+indent_style = tab
+end_of_line = lf
+charset = utf-8
+trim_trailing_whitespace = true
+insert_final_newline = true
+
+[*.yml]
+indent_style = space
+
+
diff --git a/examples/queues-agent/.gitignore b/examples/queues-agent/.gitignore
@@ -0,0 +1,7 @@
+# Local development artifacts
+.wrangler/
+.dev.vars
+data/
+node_modules/
+
+
diff --git a/examples/queues-agent/.prettierrc b/examples/queues-agent/.prettierrc
@@ -0,0 +1,6 @@
+{
+	"printWidth": 140,
+	"singleQuote": true,
+	"semi": true,
+	"useTabs": true
+}
diff --git a/examples/queues-agent/.vscode/settings.json b/examples/queues-agent/.vscode/settings.json
@@ -0,0 +1,5 @@
+{
+	"files.associations": {
+		"wrangler.json": "jsonc"
+	}
+}
diff --git a/examples/queues-agent/README.md b/examples/queues-agent/README.md
@@ -0,0 +1,143 @@
+## Queues Agent Example
+
+Showcase of using Cloudflare Queues with the NullShot Agent Toolkit:
+
+- HTTP producer endpoint enqueues chat jobs
+- Queue consumer triggers inside the same Worker and forwards each job directly to a Durable Object Agent to process with AI
+
+### Architecture
+
+- Producer: `POST /enqueue` → pushes `{ sessionId, messages }` into `REQUEST_QUEUE`
+- Consumer: `queue()` handler → forwards each message to `AGENT` Durable Object at `/agent/chat/:sessionId`
+- Agent: `QueueAgent` extends the toolkit `AiSdkAgent` and streams an AI response (Workers AI by default)
+
+### Files
+
+- `src/index.ts` – Worker with producer route, queue consumer, and DO Agent
+- `wrangler.jsonc` – Bindings for Queue, Durable Object, Workers AI
+
+### Prerequisites
+
+- Node.js 18+
+- Wrangler CLI
+- Cloudflare account (optional for local mock; required for Workers AI and cloud)
+
+### Setup
+
+Run modes
+
+- Local (Free): Uses Miniflare’s local queue simulation. Workers AI still requires login to produce real model output; without login you’ll see “Not logged in” in logs, but the queue flow runs end-to-end.
+  - Tip: Set `USE_MOCK_AI=true` in `.dev.vars` to run locally without a Cloudflare account. The agent returns a deterministic mock response.
+- Cloud (Paid): Uses real Cloudflare Queues and Workers AI on your account. Requires a paid Workers plan for Queues.
+
+1. Install deps
+
+```bash
+pnpm install
+```
+
+2. Create a Queue and (optionally) a DLQ (Cloud only, Paid)
+
+Cloud deployment only. Not required for local dev (--local). Requires a paid Workers plan and `npx wrangler login` first.
+
+```bash
+# Create main queue (cloud)
+npx wrangler queues create request-queue
+
+# Optional: create a dead letter queue (cloud)
+npx wrangler queues create request-queue-dlq
+```
+
+3. Configure `wrangler.jsonc`
+
+Edit the queue names if you used different names in step 2. The default config expects:
+
+- producer/consumer queue: `request-queue`
+- dead letter queue: `request-queue-dlq`
+
+4. Authenticate and run with real services
+
+```bash
+# Login interactively (recommended for dev)
+npx wrangler login
+
+# Or set a token for non-interactive shells
+export CLOUDFLARE_API_TOKEN=...   # least-privilege token
+
+# Use real edge runtime (queues + Workers AI)
+npx wrangler dev --remote
+
+# Deploy to Cloudflare
+npx wrangler deploy
+```
+
+5. Privacy and OSS hygiene
+
+- Never commit secrets or tokens. Use Wrangler secrets or environment variables.
+- This repo example does not include any secrets. Avoid adding `.dev.vars` to git.
+- For local only, you can create `.dev.vars` (excluded by `.gitignore`) to store non-sensitive vars.
+
+### Usage
+
+Enqueue a chat job (producer):
+
+```bash
+curl -X POST "http://127.0.0.1:8787/enqueue" \
+  -H "Content-Type: application/json" \
+  -d '{
+    "sessionId": "demo-session-1",
+    "messages": [
+      { "role": "user", "content": "Say hello in one sentence." }
+    ]
+  }'
+```
+
+The consumer will receive queue messages and route them to the Agent Durable Object. You can tail logs to observe processing:
+
+```bash
+npx wrangler tail
+```
+
+### Configuration Notes
+
+- `compatibility_date`: set to 2025-02-11 per repo rules
+- `compatibility_flags`: `["nodejs_compat"]`
+- Observability enabled with `head_sampling_rate = 1`
+- Uses Workers AI by default; set `USE_MOCK_AI=true` for zero‑auth local output.
+
+### Retrieve results (optional KV persistence)
+
+- This example stores the latest agent response per `sessionId` in KV (binding `RESULTS`).
+- Fetch the persisted output:
+
+```bash
+curl "http://127.0.0.1:8787/result/demo-session-1"
+```
+
+Returns:
+
+```json
+{ "result": "... assistant text ..." }
+```
+
+### Production checklist
+
+- Auth: `npx wrangler login` (or set `CLOUDFLARE_API_TOKEN`).
+- Queues: create `request-queue` and optional `request-queue-dlq` (Paid plan required for cloud queues).
+- Retries/DLQ: keep `retry_delay` configured; monitor DLQ.
+- Persistence: KV (included), or D1/R2 for richer storage.
+- Security: no secrets in code; use Wrangler secrets/env vars.
+- Observability: `wrangler tail`, dashboard logs, metrics.
+- Limits: model usage, queue throughput, DO CPU limits.
+
+Note: Workers AI requires Cloudflare auth even in local dev. Without login you’ll see “Not logged in” in logs.
+If you prefer zero-auth local output, set `USE_MOCK_AI=true` in `.dev.vars`.
+
+### Example Payload
+
+```json
+{
+	"sessionId": "demo-session-1",
+	"messages": [{ "role": "user", "content": "Summarize Cloudflare Queues in one line." }]
+}
+```
diff --git a/examples/queues-agent/package.json b/examples/queues-agent/package.json
@@ -0,0 +1,24 @@
+{
+	"name": "queues-agent",
+	"version": "0.0.1",
+	"private": true,
+	"scripts": {
+		"dev": "wrangler dev",
+		"deploy": "wrangler deploy",
+		"build": "wrangler build",
+		"test": "vitest run"
+	},
+	"devDependencies": {
+		"typescript": "catalog:",
+		"wrangler": "catalog:",
+		"vitest": "catalog:",
+		"@cloudflare/vitest-pool-workers": "catalog:",
+		"@nullshot/test-utils": "workspace:*"
+	},
+	"dependencies": {
+		"@nullshot/agent": "workspace:*",
+		"ai": "catalog:",
+		"hono": "^4.7.7",
+		"workers-ai-provider": "catalog:"
+	}
+}
diff --git a/examples/queues-agent/src/index.ts b/examples/queues-agent/src/index.ts
@@ -0,0 +1,103 @@
+import { Hono } from 'hono';
+import { cors } from 'hono/cors';
+import { applyPermissionlessAgentSessionRouter } from '@nullshot/agent';
+import { AiSdkAgent, AIUISDKMessage } from '@nullshot/agent/aisdk';
+import { Service } from '@nullshot/agent';
+import { ToolboxService } from '@nullshot/agent/services';
+import { createWorkersAI } from 'workers-ai-provider';
+
+// Minimal agent that echoes a short response via Workers AI
+export class QueueAgent extends AiSdkAgent<Env> {
+	constructor(state: DurableObjectState, env: Env) {
+		// If USE_MOCK_AI is enabled, we don't require the Workers AI binding
+		let model: any;
+		if (env.USE_MOCK_AI === 'true') {
+			// Provide a dummy model; processMessage will short-circuit in mock mode
+			model = {} as any;
+		} else {
+			if (!env.AI) throw new Error('AI binding missing. Configure Workers AI in wrangler.jsonc');
+			const workersai = createWorkersAI({ binding: env.AI });
+			model = workersai('@cf/meta/llama-3.1-8b-instruct' as any);
+		}
+		const services: Service[] = [new ToolboxService(env)];
+		super(state, env, model, services);
+	}
+
+	async processMessage(sessionId: string, messages: AIUISDKMessage): Promise<Response> {
+		// Mock mode: return deterministic response without calling Workers AI
+		if (this.env.USE_MOCK_AI === 'true') {
+			const last = messages.messages[messages.messages.length - 1];
+			const userText = typeof last?.content === 'string' ? last.content : 'Hello';
+			const reply = `Mock response: ${userText}`;
+			return new Response(reply, { headers: { 'Content-Type': 'text/plain' } });
+		}
+
+		const result = await this.streamTextWithMessages(sessionId, messages.messages, {
+			system: 'You are a helpful assistant. Keep responses concise.',
+			maxSteps: 5,
+		});
+		return result.toTextStreamResponse();
+	}
+}
+
+// Hono app for producer and agent gateway
+const app = new Hono<{ Bindings: Env }>();
+app.use('*', cors());
+
+// Simple enqueue endpoint: { sessionId, messages }
+app.post('/enqueue', async (c) => {
+	const body = await c.req.json<any>();
+	const sessionId: string = body.sessionId || crypto.randomUUID();
+	const messages = body.messages || [{ role: 'user', content: 'Hello!' }];
+
+	await c.env.REQUEST_QUEUE.send({ sessionId, messages });
+
+	return c.json({ enqueued: true, sessionId });
+});
+
+// Fetch latest result for session
+app.get('/result/:sessionId', async (c) => {
+	const sessionId = c.req.param('sessionId');
+	const value = await c.env.RESULTS_KV.get(`result:${sessionId}`);
+	if (!value) return c.json({ result: null }, 200);
+	return c.json({ result: value }, 200);
+});
+
+// Route /agent/chat/:sessionId to the DO agent
+applyPermissionlessAgentSessionRouter(app);
+
+export default {
+	async fetch(request: Request, env: Env, ctx: ExecutionContext): Promise<Response> {
+		return app.fetch(request, env, ctx);
+	},
+
+	// Queue consumer: run messages through the Agent DO
+	async queue(batch: MessageBatch<any>, env: Env, ctx: ExecutionContext) {
+		for (const msg of batch.messages) {
+			try {
+				const { sessionId, messages } = msg.body || {};
+				if (!sessionId || !messages) {
+					console.warn('Invalid queue message, skipping');
+					continue;
+				}
+				const id = env.AGENT.idFromName(sessionId);
+				const req = new Request('https://internal/agent/chat/' + sessionId, {
+					method: 'POST',
+					headers: { 'Content-Type': 'application/json' },
+					body: JSON.stringify({ id: crypto.randomUUID(), messages }),
+				});
+				// Synchronously fetch the agent and persist full text to KV for retrieval
+				const resp = await env.AGENT.get(id).fetch(req);
+				const text = await resp.text();
+				ctx.waitUntil(
+					env.RESULTS_KV.put(`result:${sessionId}`, text, {
+						expirationTtl: 60 * 60,
+					}),
+				);
+			} catch (e) {
+				console.error('Queue processing error:', e);
+				throw e;
+			}
+		}
+	},
+};
diff --git a/examples/queues-agent/test/env.d.ts b/examples/queues-agent/test/env.d.ts
@@ -0,0 +1,3 @@
+declare module 'cloudflare:test' {
+	interface ProvidedEnv extends Env {}
+}
diff --git a/examples/queues-agent/test/queues-agent.test.ts b/examples/queues-agent/test/queues-agent.test.ts
@@ -0,0 +1,40 @@
+import { expect, it } from 'vitest';
+import { SELF } from 'cloudflare:test';
+// Use the in-memory worker provided by the Vitest Workers pool
+const BASE = 'https://example.com';
+
+it(
+	'enqueues and returns a result (may be null without Workers AI auth)',
+	async () => {
+		const sessionId = `test-${crypto.randomUUID()}`;
+
+		const enqueue = await SELF.fetch(`${BASE}/enqueue`, {
+			method: 'POST',
+			headers: { 'Content-Type': 'application/json' },
+			body: JSON.stringify({
+				sessionId,
+				messages: [{ role: 'user', content: 'One sentence about queues.' }],
+			}),
+		});
+		expect(enqueue.ok).toBe(true);
+		const ej = await enqueue.json();
+		expect(ej.enqueued).toBe(true);
+
+		// Poll result up to ~8s
+		let result: any = null;
+		for (let i = 0; i < 8; i++) {
+			const r = await SELF.fetch(`${BASE}/result/${sessionId}`);
+			expect(r.ok).toBe(true);
+			const j = await r.json();
+			if (j.result) {
+				result = j.result;
+				break;
+			}
+			await new Promise((res) => setTimeout(res, 1000));
+		}
+
+		// We allow null when not authenticated to Workers AI locally
+		expect(result === null || typeof result === 'string').toBe(true);
+	},
+	{ timeout: 20000 },
+);
diff --git a/examples/queues-agent/test/tsconfig.json b/examples/queues-agent/test/tsconfig.json
@@ -0,0 +1,8 @@
+{
+	"extends": "../tsconfig.json",
+	"compilerOptions": {
+		"types": ["@cloudflare/workers-types/experimental", "@cloudflare/vitest-pool-workers"]
+	},
+	"include": ["./**/*.ts", "../worker-configuration.d.ts"],
+	"exclude": []
+}
diff --git a/examples/queues-agent/tsconfig.json b/examples/queues-agent/tsconfig.json
@@ -0,0 +1,21 @@
+{
+  "compilerOptions": {
+    "target": "es2021",
+    "lib": ["es2021"],
+    "jsx": "react-jsx",
+    "module": "es2022",
+    "moduleResolution": "Bundler",
+    "types": [],
+    "resolveJsonModule": true,
+    "allowJs": true,
+    "checkJs": false,
+    "noEmit": true,
+    "isolatedModules": true,
+    "allowSyntheticDefaultImports": true,
+    "forceConsistentCasingInFileNames": true,
+    "strict": true,
+    "skipLibCheck": true
+  },
+  "exclude": ["test"],
+  "include": ["worker-configuration.d.ts", "src/**/*.ts"]
+}
diff --git a/examples/queues-agent/vitest.config.mts b/examples/queues-agent/vitest.config.mts
@@ -0,0 +1,6 @@
+import { createMcpWorkersConfig } from '@nullshot/test-utils/vitest/mcpWorkersConfig';
+
+export default createMcpWorkersConfig({
+	wranglerConfigPath: './wrangler.jsonc',
+	includeAjvMock: true,
+});
diff --git a/examples/queues-agent/worker-configuration.d.ts b/examples/queues-agent/worker-configuration.d.ts
diff --git a/examples/queues-agent/wrangler.jsonc b/examples/queues-agent/wrangler.jsonc
diff --git a/pnpm-lock.yaml b/pnpm-lock.yaml

-Original file line number
+Diff line change
@@ @@ -0,0 +1,5 @@ @@
 +{
 +	"files.associations": {
 +		"wrangler.json": "jsonc"
 +	}
 +}
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,3 @@`
	`1`	`+declare module 'cloudflare:test' {`
	`2`	`+ interface ProvidedEnv extends Env {}`
	`3`	`+}`