Skip to content

Conversation

@TimPietruskyRunPod
Copy link
Collaborator

Summary

  • Fix test-endpoint-agent.mjs to use /generate instead of /chat (404 fix)
  • Update README.md with correct model name (Qwen3-32B instead of GPT-OSS-120B)
  • Update .runpod/README.md with correct endpoint path and model reference

Test plan

  • Tested agent script locally against deployed endpoint
  • Verified /api/agents/weatherAgent/generate returns correct response

… notice

- fix test-endpoint-agent.mjs to use /generate instead of /chat
- update README.md with correct model name (qwen3-32b-awq)
- update .runpod/README.md with experimental notice explaining:
  - cold start time (20-40s) and flash boot coming for cpu
  - load balancer is experimental for mastra api exposure
  - recommend active workers = 1 to avoid cold start
  - in-memory storage is file-based and lost when worker stops
@TimPietruskyRunPod TimPietruskyRunPod deleted the fix/test-scripts-and-docs branch December 16, 2025 08:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants