replicatedhq · boxingknight · Dec 9, 2025 · Dec 9, 2025 · Dec 10, 2025 · Dec 11, 2025
diff --git a/.gitignore b/.gitignore
@@ -4,3 +4,46 @@ bin
 test-results/
 .envrc
 .specstory/
+
+# Test artifacts and outputs
+*.test
+*.test.out
+*.prof
+
+# Test coverage files
+coverage/
+.nyc_output/
+.coverage/
+*.lcov
+coverage.json
+coverage-final.json
+.nyc_output/
+lib-cov/
+
+# Environment files
+.env
+.env.local
+.env.*.local
+
+# Build artifacts
+.next/
+dist/
+build/
+*.o
+*.a
+*.so
+
+# IDE and editor files
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+
+# OS files
+Thumbs.db
+.DS_Store
+
+# Local setup documentation and scripts (not committed to git)
+docs/
+scripts/
diff --git a/ARCHITECTURE.md b/ARCHITECTURE.md
@@ -24,4 +24,26 @@ It's made for both the developer working on it and for AI models to read and app
 ## Workers
 - The go code is where we put all workers. 
 - Jobs for workers are enqueued and scheduled using postgres notify and a work_queue table.
-- Status from the workers is communicated via Centrifugo messages to the client.
+- Status from the workers is communicated via Centrifugo messages to the client.
+
+## Chat & LLM Integration
+
+Chartsmith uses the Vercel AI SDK for all conversational chat functionality.
+The Go worker outputs AI SDK Data Stream Protocol format, which the frontend
+consumes via the useChat hook.
+
+### Architecture
+- Frontend: useChat hook manages chat state
+- API Route: /api/chat proxies to Go worker
+- Backend: Go worker outputs AI SDK protocol (HTTP SSE)
+- Streaming: Server-Sent Events instead of WebSocket
+
+### Key Components
+- pkg/llm/aisdk.go: Adapter for AI SDK protocol
+- pkg/api/chat.go: HTTP endpoint for chat streaming
+- chartsmith-app/hooks/useAIChat.ts: Frontend hook wrapper
+- chartsmith-app/app/api/chat/route.ts: Next.js API route
+
+### Note on Centrifugo
+Centrifugo is still used for non-chat events (plans, renders, artifacts).
+Chat messages flow exclusively through the AI SDK HTTP SSE protocol.
diff --git a/CLAUDE.md b/CLAUDE.md
diff --git a/chartsmith-app/ARCHITECTURE.md b/chartsmith-app/ARCHITECTURE.md
@@ -21,3 +21,28 @@ This is a next.js project that is the front end for chartsmith.
 - We aren't using Next.JS API routes, except when absolutely necessary.
 - Front end should call server actions, which call lib/* functions.
 - Database queries are not allowed in the server action. Server actions are just wrappers for which lib functions we expose.
+
+## Chat & LLM Integration
+
+Chartsmith uses the Vercel AI SDK for all chat functionality:
+
+- **Frontend**: `useChat` hook from `@ai-sdk/react` manages chat state
+- **API Route**: `/api/chat` Next.js route proxies to Go worker
+- **Backend**: Go worker outputs AI SDK Data Stream Protocol (HTTP SSE)
+- **Streaming**: Server-Sent Events (SSE) instead of WebSocket
+- **State**: Managed by AI SDK hook, integrated with Jotai for workspace state
+
+### Flow
+```
+User Input → ChatContainer → useAIChat → /api/chat → Go Worker → AI SDK Protocol → useChat → UI
+```
+
+### Key Components
+- `useAIChat`: Wraps `useChat` with Chartsmith-specific logic
+- `/api/chat`: Next.js API route that proxies to Go worker
+- `pkg/llm/aisdk.go`: Go adapter for AI SDK protocol
+- `pkg/api/chat.go`: HTTP endpoint for chat streaming
+
+### Note on Centrifugo
+Centrifugo is still used for non-chat events (plans, renders, artifacts).
+Chat messages flow exclusively through the AI SDK HTTP SSE protocol.
diff --git a/chartsmith-app/TEST_COVERAGE.md b/chartsmith-app/TEST_COVERAGE.md
@@ -0,0 +1,250 @@
+# Test Coverage Documentation
+
+## Overview
+
+This document describes the comprehensive test suite created for the Vercel AI SDK migration. The tests cover all critical functionality including message format conversion, API routing, authentication, persistence, and integration flows.
+
+## Test Statistics
+
+- **Total Test Suites**: 9
+- **Total Tests**: 80
+- **Status**: ✅ All passing
+
+## Test Files Created
+
+### 1. `hooks/__tests__/useAIChat.test.tsx` (18 tests)
+
+**Purpose**: Tests the core `useAIChat` hook that wraps `@ai-sdk/react`'s `useChat` hook with Chartsmith-specific functionality.
+
+**Why This Matters**: This hook is the central integration point between the AI SDK and Chartsmith's existing architecture. It handles:
+- Message format conversion (AI SDK ↔ Chartsmith Message type)
+- Jotai atom synchronization for backward compatibility
+- Historical message loading
+- Role selection state management
+- Message persistence callbacks
+
+**Test Coverage**:
+- ✅ Initialization with provided messages
+- ✅ Loading messages from database when not provided
+- ✅ Error handling when loading messages fails
+- ✅ Message format conversion (AI SDK to Chartsmith)
+- ✅ Real-time message streaming updates
+- ✅ Role selection (auto/developer/operator)
+- ✅ Input state management
+- ✅ Message submission handling
+- ✅ Error exposure from useChat
+- ✅ Stop and reload functionality
+- ✅ Tool invocation preservation
+- ✅ Metadata preservation during conversion
+
+**Key Test Scenarios**:
+1. **Message Conversion**: Verifies that AI SDK messages (separate user/assistant) are correctly converted to Chartsmith format (paired messages)
+2. **Atom Synchronization**: Ensures messages sync to Jotai atoms in real-time for backward compatibility
+3. **Role Selection**: Tests that selected role (auto/developer/operator) is properly managed and included in API requests
+4. **Persistence Callbacks**: Verifies `onMessageComplete` callback is triggered when messages finish streaming
+
+### 2. `app/api/chat/__tests__/route.test.ts` (18 tests)
+
+**Purpose**: Tests the Next.js API route that proxies chat requests to the Go backend.
+
+**Why This Matters**: This route is the bridge between the frontend and backend. It must:
+- Authenticate requests securely (cookie-based and bearer token)
+- Validate request payloads
+- Proxy requests to Go backend correctly
+- Stream responses back in AI SDK format
+- Handle errors gracefully
+
+**Test Coverage**:
+- ✅ Cookie-based authentication
+- ✅ Bearer token authentication (fallback)
+- ✅ 401 when no authentication provided
+- ✅ Graceful error handling for auth failures
+- ✅ Request validation (messages array, workspaceId)
+- ✅ Invalid JSON body handling
+- ✅ Proxying to Go backend with correct format
+- ✅ Response streaming (SSE format)
+- ✅ Go backend error handling
+- ✅ Missing response body handling
+- ✅ Network error handling
+- ✅ Go worker URL resolution (env var, database param, default)
+
+**Key Test Scenarios**:
+1. **Dual Authentication**: Tests both cookie-based (web) and bearer token (extension) authentication paths
+2. **Request Validation**: Ensures malformed requests are rejected with appropriate error messages
+3. **Proxying**: Verifies requests are correctly forwarded to Go backend with proper format
+4. **Streaming**: Confirms responses are streamed back in AI SDK Data Stream Protocol format (text/event-stream)
+5. **URL Resolution**: Tests priority order: env var → database param → localhost default
+
+### 3. `hooks/__tests__/useChatPersistence.test.tsx` (4 tests - existing, enhanced)
+
+**Purpose**: Tests the `useChatPersistence` hook that manages chat message persistence.
+
+**Why This Matters**: This hook handles loading chat history and saving completed messages to the database, ensuring chat state persists across sessions.
+
+**Test Coverage**:
+- ✅ Loads history on mount
+- ✅ Provides saveMessage function
+- ✅ Skips loading when disabled
+- ✅ Handles errors gracefully
+
+### 4. `lib/services/__tests__/chat-persistence.test.ts` (6 tests - existing)
+
+**Purpose**: Tests the `ChatPersistenceService` class that handles API calls for persistence.
+
+**Test Coverage**:
+- ✅ Loads and converts messages to AI SDK format
+- ✅ Returns empty array for 404
+- ✅ Handles messages array wrapped in object
+- ✅ Saves user and assistant message together
+- ✅ Handles array content format
+- ✅ Updates existing messages
+
+### 5. `lib/types/__tests__/chat.test.ts` (9 tests - existing)
+
+**Purpose**: Tests message format conversion utilities.
+
+**Test Coverage**:
+- ✅ Converts user messages correctly
+- ✅ Converts assistant messages correctly
+- ✅ Handles array content format
+- ✅ Preserves metadata
+- ✅ Throws error for unsupported roles
+- ✅ Converts Messages to AI SDK format
+- ✅ Handles empty messages
+- ✅ Converts multiple messages
+
+### 6. `__tests__/integration/chat-flow.test.tsx` (Integration tests)
+
+**Purpose**: Tests the end-to-end integration between components.
+
+**Why This Matters**: These tests verify that all pieces work together correctly:
+- useAIChat hook
+- useChatPersistence hook
+- ChatContainer component
+- /api/chat route
+- Message format conversion
+- Jotai atom synchronization
+
+**Test Coverage**:
+- ✅ Message history loading and display
+- ✅ Message sending flow with role selection
+- ✅ Message persistence callbacks
+- ✅ Error handling across the stack
+- ✅ Message format conversion
+- ✅ Real-time updates during streaming
+- ✅ Role selection persistence
+
+## Test Architecture
+
+### Environment Configuration
+
+- **Node Environment**: Used for API route tests (no DOM needed)
+- **jsdom Environment**: Used for React hook tests (DOM APIs needed)
+
+### Mocking Strategy
+
+1. **@ai-sdk/react**: Mocked to control `useChat` behavior
+2. **jotai**: Mocked to control atom behavior
+3. **fetch**: Mocked to simulate API calls
+4. **next/headers**: Mocked to simulate cookie access
+5. **Session/auth**: Mocked to simulate authentication
+
+### Key Testing Patterns
+
+1. **Hook Testing**: Uses `@testing-library/react`'s `renderHook` for React hooks
+2. **API Route Testing**: Directly imports and calls route handlers
+3. **Integration Testing**: Tests component interactions without full rendering
+4. **Error Scenarios**: Tests error handling at each layer
+
+## Why Each Test Category Matters
+
+### Unit Tests (useAIChat, chat-persistence, chat types)
+
+**Purpose**: Test individual components in isolation.
+
+**Benefits**:
+- Fast execution
+- Easy to debug failures
+- Clear responsibility boundaries
+- Can test edge cases thoroughly
+
+**Example**: Testing that `aiMessageToMessage` correctly converts AI SDK format to Chartsmith format preserves all metadata fields.
+
+### Integration Tests (API route, chat flow)
+
+**Purpose**: Test how components work together.
+
+**Benefits**:
+- Catches integration bugs
+- Verifies data flow between layers
+- Tests real-world scenarios
+- Ensures API contracts are met
+
+**Example**: Testing that a message sent through `useAIChat` → `/api/chat` → Go backend → response → persistence callback works end-to-end.
+
+### Error Handling Tests
+
+**Purpose**: Ensure system gracefully handles failures.
+
+**Benefits**:
+- Prevents crashes
+- Provides good error messages
+- Maintains user experience during failures
+- Helps with debugging production issues
+
+**Example**: Testing that when the Go backend returns 500, the API route returns a proper error response instead of crashing.
+
+## Test Maintenance
+
+### When to Add Tests
+
+1. **New Features**: Add tests when adding new functionality
+2. **Bug Fixes**: Add regression tests when fixing bugs
+3. **Refactoring**: Update tests when changing implementation
+4. **Edge Cases**: Add tests when discovering edge cases
+
+### Running Tests
+
+```bash
+# Run all unit tests
+npm run test:unit
+
+# Run tests in watch mode
+npm run test:watch
+
+# Run specific test file
+npm run test:unit -- hooks/__tests__/useAIChat.test.tsx
+```
+
+### Test Coverage Goals
+
+- **Critical Paths**: 100% coverage (authentication, message conversion, API routing)
+- **Error Handling**: 100% coverage (all error paths tested)
+- **Integration Points**: High coverage (all major integration points tested)
+- **Edge Cases**: High coverage (unusual but valid inputs tested)
+
+## Known Limitations
+
+1. **Full Component Rendering**: Some tests use simplified mocks instead of full component rendering for performance
+2. **Real Backend**: Tests don't hit the actual Go backend (mocked)
+3. **Real Database**: Tests don't use the actual database (mocked)
+4. **E2E Tests**: Full end-to-end tests would require Playwright (separate test suite)
+
+## Future Improvements
+
+1. **E2E Tests**: Add Playwright tests for full user flows
+2. **Performance Tests**: Add tests for streaming performance
+3. **Load Tests**: Add tests for concurrent message handling
+4. **Visual Regression**: Add tests for UI components
+5. **Accessibility Tests**: Add tests for accessibility compliance
+
+## Conclusion
+
+This comprehensive test suite ensures that:
+- ✅ All critical functionality is tested
+- ✅ Error handling works correctly
+- ✅ Integration points are verified
+- ✅ Backward compatibility is maintained
+- ✅ New features can be added confidently
+
+The tests provide confidence that the Vercel AI SDK migration is working correctly and will continue to work as the codebase evolves.