Boyeep
diff --git a/‎README.md‎
Lines changed: 75 additions & 61 deletions b/‎README.md‎
Lines changed: 75 additions & 61 deletions
diff --git a/‎docs/assets/sample-scene.png‎
38.5 KB b/‎docs/assets/sample-scene.png‎
38.5 KB
diff --git a/‎docs/assets/vision-console.png‎
523 KB b/‎docs/assets/vision-console.png‎
523 KB
diff --git a/‎docs/assets/webcam-extension.png‎
442 KB b/‎docs/assets/webcam-extension.png‎
442 KB
diff --git a/‎frontend/public/docs/sample-scene.png‎
38.5 KB b/‎frontend/public/docs/sample-scene.png‎
38.5 KB
diff --git a/‎frontend/src/app/docs-preview/page.tsx‎
Lines changed: 95 additions & 0 deletions b/‎frontend/src/app/docs-preview/page.tsx‎
Lines changed: 95 additions & 0 deletions
diff --git a/‎frontend/src/app/docs-preview/webcam/page.tsx‎
Lines changed: 128 additions & 0 deletions b/‎frontend/src/app/docs-preview/webcam/page.tsx‎
Lines changed: 128 additions & 0 deletions
@@ -1,8 +1,47 @@
 # nextjs-python-computer-vision-kit
 
-A full-stack starter monorepo for detection-first computer vision products built with Next.js and FastAPI.
+A product-minded monorepo starter for detection-first computer vision apps built with Next.js and FastAPI.
 
-It combines a polished frontend, a Python API designed for image-processing workloads, shared root scripts, a documented OpenAPI contract, and a sample detection pipeline that runs on CPU with OpenCV so teams can start shipping product workflows before committing to a heavier model stack.
+It gives you a polished upload-to-inference UI, a typed OpenAPI contract, CPU-friendly starter pipelines, and a clean path into webcam capture, segmentation, and heavier model backends later.
+
+<p>
+  <a href="#quick-start">Quick start</a> ·
+  <a href="#screenshots">Screenshots</a> ·
+  <a href="#what-you-get">What you get</a> ·
+  <a href="./soon.md">Roadmap</a>
+</p>
+
+## Screenshots
+
+![Vision console screenshot](docs/assets/vision-console.png)
+
+![Webcam extension screenshot](docs/assets/webcam-extension.png)
+
+## Why This Repo Exists
+
+Most computer-vision starters fall into one of two buckets:
+
+- model notebooks with no product layer
+- web templates with no real inference contract
+
+This kit sits in the middle. It starts with a real product flow:
+
+- upload an image
+- run a detection-oriented pipeline
+- inspect typed boxes, metrics, and image metadata
+- keep the same contract when you add segmentation or webcam capture later
+
+## What You Get
+
+- detection-first starter UX with annotated preview overlays
+- inference-first architecture with a separate Next.js frontend and FastAPI backend
+- shared OpenAPI contract in `docs/openapi.yaml`
+- generated frontend API types from `openapi-typescript`
+- optional webcam extension that reuses the same API surface
+- first live segmentation extension with polygons, masks, and derived boxes
+- CPU-first OpenCV sample pipelines that are easy to replace later
+- root dev and verification scripts for a monorepo-style workflow
+- GitHub Actions template CI
 
 ## Stack
 
@@ -15,32 +54,22 @@ It combines a polished frontend, a Python API designed for image-processing work
 - OpenCV
 - Docker Compose
 
-## Monorepo Structure
-
-- `frontend/`: Next.js app with a vision-console UI, API client helpers, and generated OpenAPI types
-- `backend/`: FastAPI service with health, pipeline catalog, and image-analysis routes
-- `docs/`: shared API contract
-- `scripts/`: root development and verification scripts
-- `.github/`: CI workflow for the template
-
-## Recommended Shape
-
-- architecture: inference-first
-- default demo: detection-first
-- optional frontend extension: webcam capture
-- later backend extension: segmentation
-- later workspace/package: training pipeline
+## Included Pipelines
 
-This keeps the template easy to understand while still leaving a clean path into more advanced CV workflows.
+- `starter-detection`: default object-style detection flow for the main UI
+- `foreground-segmentation`: first extension pipeline with polygons plus derived boxes
+- `document-layout`: document-style region extraction for capture and scanning products
+- `dominant-color`: metrics-only example for QA and analytics workflows
 
-## Why This Template Exists
+These pipelines are intentionally lightweight. They prove the repo shape and developer workflow without forcing you into toy logic forever. Swap them for YOLO, ONNX Runtime, PyTorch, TensorRT, or a hosted inference service when you are ready.
 
-Most computer-vision starters are either model notebooks with no product layer or web templates with no real inference shape. This template sits in the middle:
+## Repo Shape
 
-- product-minded frontend by default
-- backend structure ready for image upload, preprocessing, and model-serving extensions
-- typed API contract between the web app and the inference service
-- one-command local development from the repo root
+- `frontend/`: Next.js app shell, upload flow, webcam flow, and generated API types
+- `backend/`: FastAPI service, pipeline registry, validation, and starter image logic
+- `docs/`: OpenAPI contract and screenshot assets
+- `scripts/`: root development and verification commands
+- `.github/`: template CI workflow
 
 ## Quick Start
 
@@ -51,9 +80,11 @@ Most computer-vision starters are either model notebooks with no product layer o
 5. Run `npm run api:types`.
 6. Run `npm run dev`.
 
-Frontend: `http://localhost:3000`
+Frontend: `http://localhost:3000`  
 Backend: `http://127.0.0.1:8000`
 
+If you create `backend/.venv`, the root scripts will prefer that interpreter automatically.
+
 ## Commands
 
 ```bash
@@ -63,45 +94,28 @@ npm run api:types
 npm run check
 ```
 
-## API Contract
+## Verification
 
-- `docs/openapi.yaml` is the source of truth for the shared HTTP contract.
-- `frontend/src/generated/openapi.ts` is generated from that spec with `openapi-typescript`.
-- Run `npm run api:types` whenever backend payloads change.
+The root check runs:
 
-## Sample Pipelines Included
+- frontend lint
+- frontend typecheck
+- frontend production build
+- backend `pytest`
+- backend `compileall`
 
-- `starter-detection`: the default object-style detection sample used by the main frontend flow
-- `foreground-segmentation`: the first live extension pipeline, returning region polygons and derived boxes
-- `document-layout`: document-oriented box extraction for scanning and capture products
-- `dominant-color`: metrics-only extension pipeline for QA and analytics
+## Contract Notes
 
-These are intentionally lightweight starter pipelines. They are there to prove the architecture and developer workflow, not to lock you into toy logic. Swap them for YOLO, ONNX Runtime, PyTorch, TensorRT, or a custom service when you are ready.
+- `docs/openapi.yaml` is the source of truth for the HTTP contract.
+- `frontend/src/generated/openapi.ts` is generated from that spec.
+- Run `npm run api:types` whenever backend payloads change.
 
-## What You Get
+## Recommended Growth Path
+
+1. Keep the main story detection-first.
+2. Add webcam polish once upload mode feels strong.
+3. Add segmentation depth without changing the response boundary.
+4. Introduce a real model adapter layer.
+5. Split training and experimentation into a separate workspace later.
 
-- reusable Next.js + Python computer-vision monorepo layout
-- upload-and-detect frontend starter UI
-- optional webcam capture mode that reuses the same inference contract
-- first segmentation extension pipeline using the same response boundary
-- FastAPI inference endpoint with typed response models
-- OpenCV-based sample processing that runs without a GPU
-- root scripts for local dev and checks
-- GitHub Actions workflow for frontend and backend verification
-- Docker Compose dev option
-
-## Notes
-
-- The backend in this starter is CPU-first on purpose so it is easier to clone, run, and extend.
-- The main story is intentionally detection-first so the template stays easy to explain and demo.
-- The current environment used to build this template did not have Python installed, so the frontend was verified locally but backend execution was prepared rather than run here.
-- If you move to heavier vision workloads, add a worker or model-service layer and keep the current API as the contract boundary.
-
-## Next Expansions
-
-- async job queue for long-running inference
-- persistent artifact storage
-- model registry and experiment tracking
-- richer segmentation overlays and mask visualizations
-- video ingestion pipelines
-- training or experiment workspace in a separate `ml/` or `training/` package
+The short public roadmap lives in [soon.md](./soon.md).
@@ -0,0 +1,95 @@
+import { AnalysisPreview } from "@/components/analysis-preview";
+import { AnalysisResults } from "@/components/analysis-results";
+import { docsDemoImagePath, docsPreviewResult } from "@/lib/docs-demo";
+
+export default function DocsPreviewPage() {
+  return (
+    <main className="min-h-screen overflow-hidden px-6 py-8 lg:px-10 lg:py-10">
+      <div className="mx-auto flex max-w-7xl flex-col gap-6">
+        <section className="rounded-[36px] border border-black/10 bg-white/76 px-7 py-8 shadow-[0_32px_90px_rgba(10,20,25,0.12)] backdrop-blur-xl lg:px-10">
+          <div className="flex flex-wrap items-center gap-3">
+            <span className="rounded-full bg-[var(--accent-soft)] px-3 py-1 text-xs font-semibold uppercase tracking-[0.3em] text-[var(--foreground)]">
+              Docs Preview
+            </span>
+            <span className="rounded-full border border-black/10 px-3 py-1 font-mono text-xs text-black/60">
+              Detection + segmentation showcase
+            </span>
+          </div>
+
+          <div className="mt-6 grid gap-4 lg:grid-cols-[1.2fr_0.8fr] lg:items-end">
+            <div>
+              <h1 className="max-w-4xl text-5xl font-semibold tracking-[-0.05em] text-[var(--foreground)]">
+                Screenshot-ready preview of the kit&apos;s main product path.
+              </h1>
+              <p className="mt-4 max-w-3xl text-base leading-8 text-black/70">
+                One static scene, one typed response shape, and the same polished overlay
+                UI the public starter ships with.
+              </p>
+            </div>
+
+            <div className="grid gap-3 rounded-[28px] border border-black/10 bg-[#13262e] p-5 text-white">
+              <div>
+                <p className="font-mono text-xs uppercase tracking-[0.3em] text-white/45">
+                  Included in the template
+                </p>
+                <p className="mt-3 text-lg font-semibold tracking-tight">
+                  Upload workflow, overlay controls, typed results, and the first
+                  segmentation extension.
+                </p>
+              </div>
+              <div className="flex flex-wrap gap-2">
+                {["starter-detection", "foreground-segmentation", "webcam mode"].map(
+                  (item) => (
+                    <span
+                      key={item}
+                      className="rounded-full border border-white/10 bg-white/8 px-3 py-1 font-mono text-[11px] text-white/72"
+                    >
+                      {item}
+                    </span>
+                  ),
+                )}
+              </div>
+            </div>
+          </div>
+        </section>
+
+        <section className="grid gap-6 lg:grid-cols-[0.95fr_1.05fr]">
+          <div className="rounded-[32px] border border-black/10 bg-white/78 p-6 shadow-[0_32px_90px_rgba(10,20,25,0.12)] backdrop-blur-xl">
+            <div className="flex flex-wrap items-center gap-3">
+              <span className="rounded-full bg-[var(--accent-soft)] px-3 py-1 text-xs font-semibold uppercase tracking-[0.3em] text-[var(--foreground)]">
+                Vision Console
+              </span>
+              <span className="rounded-full border border-black/10 px-3 py-1 font-mono text-xs text-black/65">
+                seeded demo state
+              </span>
+            </div>
+
+            <div className="mt-6 space-y-3">
+              <h2 className="text-2xl font-semibold tracking-tight text-[var(--foreground)]">
+                Upload once, inspect detections, and keep the contract stable.
+              </h2>
+              <p className="max-w-xl text-sm leading-7 text-black/70">
+                The docs preview uses a seeded response so the overlay, legend,
+                segmentation controls, and review panel all stay screenshot-friendly.
+              </p>
+            </div>
+
+            <AnalysisPreview
+              fileName={docsPreviewResult.image.filename}
+              previewDimensions={null}
+              previewUrl={docsDemoImagePath}
+              result={docsPreviewResult}
+            />
+          </div>
+
+          <AnalysisResults
+            emptyDescription=""
+            emptyEyebrow=""
+            emptyTitle=""
+            result={docsPreviewResult}
+          />
+        </section>
+      </div>
+    </main>
+  );
+}
@@ -0,0 +1,128 @@
+import Image from "next/image";
+
+import { AnalysisResults } from "@/components/analysis-results";
+import { docsDemoImagePath, docsWebcamResult } from "@/lib/docs-demo";
+
+export default function DocsPreviewWebcamPage() {
+  return (
+    <main className="min-h-screen overflow-hidden px-6 py-8 lg:px-10 lg:py-10">
+      <div className="mx-auto flex max-w-7xl flex-col gap-6">
+        <section className="rounded-[36px] border border-black/10 bg-white/76 px-7 py-8 shadow-[0_32px_90px_rgba(10,20,25,0.12)] backdrop-blur-xl lg:px-10">
+          <div className="flex flex-wrap items-center gap-3">
+            <span className="rounded-full bg-[var(--accent-soft)] px-3 py-1 text-xs font-semibold uppercase tracking-[0.3em] text-[var(--foreground)]">
+              Webcam Preview
+            </span>
+            <span className="rounded-full border border-black/10 px-3 py-1 font-mono text-xs text-black/60">
+              Same API, different input source
+            </span>
+          </div>
+
+          <h1 className="mt-6 max-w-4xl text-5xl font-semibold tracking-[-0.05em] text-[var(--foreground)]">
+            The webcam extension still looks and feels like the same product.
+          </h1>
+          <p className="mt-4 max-w-3xl text-base leading-8 text-black/70">
+            Capture is a frontend concern. The review surface, pipeline selection, and
+            result shape all stay aligned with the upload flow.
+          </p>
+        </section>
+
+        <section className="grid gap-6 lg:grid-cols-[0.95fr_1.05fr]">
+          <div className="rounded-[32px] border border-black/10 bg-white/78 p-6 shadow-[0_32px_90px_rgba(10,20,25,0.12)] backdrop-blur-xl">
+            <div className="flex flex-wrap items-center gap-3">
+              <span className="rounded-full bg-[var(--accent-soft)] px-3 py-1 text-xs font-semibold uppercase tracking-[0.3em] text-[var(--foreground)]">
+                Optional Mode
+              </span>
+              <span className="rounded-full border border-black/10 px-3 py-1 font-mono text-xs text-black/65">
+                seeded capture preview
+              </span>
+            </div>
+
+            <div className="mt-6 space-y-3">
+              <h2 className="text-2xl font-semibold tracking-tight text-[var(--foreground)]">
+                Reuse the detection contract from a live camera frame.
+              </h2>
+              <p className="max-w-xl text-sm leading-7 text-black/70">
+                This mock state mirrors the public webcam page, but with a seeded frame so
+                the docs can show the extension path clearly.
+              </p>
+            </div>
+
+            <div className="mt-8 space-y-5">
+              <div className="rounded-[24px] border border-black/10 bg-white px-4 py-4 text-sm text-black/70">
+                <p className="font-semibold text-[var(--foreground)]">Starter Detection</p>
+                <p className="mt-2 leading-7">
+                  Detection-first sample pipeline that returns object-style boxes and
+                  confidence scores.
+                </p>
+                <div className="mt-3 flex flex-wrap gap-2">
+                  {["object boxes", "confidence scores", "coverage metrics"].map((item) => (
+                    <span
+                      key={item}
+                      className="rounded-full bg-[var(--accent-soft)] px-3 py-1 font-mono text-xs text-black/70"
+                    >
+                      {item}
+                    </span>
+                  ))}
+                </div>
+              </div>
+
+              <div className="overflow-hidden rounded-[24px] border border-black/10 bg-[#12242c]">
+                <div className="relative aspect-video w-full">
+                  <Image
+                    alt="Seeded webcam capture preview"
+                    className="object-cover"
+                    fill
+                    sizes="(max-width: 1024px) 100vw, 560px"
+                    src={docsDemoImagePath}
+                    unoptimized
+                  />
+                  <div className="absolute inset-0 bg-[linear-gradient(180deg,rgba(18,36,44,0.08),rgba(18,36,44,0.18))]" />
+                </div>
+                <div className="flex flex-wrap items-center justify-between gap-3 px-4 py-4 text-sm text-white/75">
+                  <p className="font-mono text-xs uppercase tracking-[0.3em] text-white/45">
+                    Camera state: live
+                  </p>
+                  <div className="flex flex-wrap gap-2">
+                    <button
+                      className="rounded-full bg-[var(--accent)] px-4 py-2 font-medium text-[#1d1007]"
+                      type="button"
+                    >
+                      Restart camera
+                    </button>
+                    <button
+                      className="rounded-full border border-white/15 px-4 py-2 font-medium text-white"
+                      type="button"
+                    >
+                      Stop
+                    </button>
+                    <button
+                      className="rounded-full border border-white/15 bg-white/8 px-4 py-2 font-medium text-white"
+                      type="button"
+                    >
+                      Capture and analyze
+                    </button>
+                  </div>
+                </div>
+              </div>
+
+              <div className="rounded-[24px] border border-black/10 bg-[#fff4ea] px-4 py-4 text-sm text-black/70">
+                <p className="font-semibold text-[var(--foreground)]">Keep upload as the main path</p>
+                <p className="mt-2 leading-7">
+                  The starter still teaches image upload first. Webcam stays here as a
+                  believable extension once the base contract already feels solid.
+                </p>
+              </div>
+            </div>
+          </div>
+
+          <AnalysisResults
+            emptyDescription=""
+            emptyEyebrow=""
+            emptyTitle=""
+            result={docsWebcamResult}
+          />
+        </section>
+      </div>
+    </main>
+  );
+}