-
Notifications
You must be signed in to change notification settings - Fork 47
Open
Description
Summary
I'm building a web application that needs to send AI inference requests to the Cocoon network. I'd like to understand the recommended way for client applications to connect.
My Setup
- Application: Next.js web app with chat interface
- Integration: Using OpenAI-compatible API format (
/v1/chat/completions) - Environment: macOS (development), will deploy to cloud
What I've Tried
- Downloaded the worker release from the repo
- Attempted to build locally with
./scripts/cocoon-launch --local-all - Build fails on macOS due to assembly incompatibilities (BLST library)
Questions
-
Is there a public proxy endpoint that client applications can connect to for inference requests?
- If yes, what is the endpoint URL?
- What authentication/payment setup is required?
-
For developers without TDX/H100 hardware, what is the recommended way to:
- Test real inference (not mock)?
- Build production applications?
-
Is there a client-only package that doesn't require building the full worker infrastructure?
Environment Details
- macOS 14.x (Sonoma)
- Node.js 18+
- No access to Intel TDX or NVIDIA H100 hardware
Thank you for building this! Looking forward to integrating Cocoon into my application.
Zebatuq, egormzln and fdesu
Metadata
Metadata
Assignees
Labels
No labels