feat(helm): add persistent volume for local model cache by isac322 · Pull Request #861 · vectorize-io/hindsight

isac322 · 2026-04-03T09:45:10Z

Problem

When using local reranker models (e.g., BAAI/bge-reranker-v2-m3) or local embedding models, the models are downloaded to /home/hindsight/.cache on every pod restart. This causes slow startup (~1GB+ download) and unnecessary bandwidth usage.

Closes #860

Changes

values.yaml: Add api.persistence.modelCache and worker.persistence.modelCache config (disabled by default)
api-deployment.yaml: Mount PVC at /home/hindsight/.cache when enabled
api-model-cache-pvc.yaml: New PVC template for API model cache
worker-statefulset.yaml: Add volumeClaimTemplates for model cache when enabled

Usage

api:
  persistence:
    modelCache:
      enabled: true
      size: 5Gi
      storageClass: "standard"

worker:
  persistence:
    modelCache:
      enabled: true
      size: 5Gi

Notes

Disabled by default — no breaking changes
API uses a standalone PVC (Deployment)
Worker uses volumeClaimTemplates (StatefulSet) for per-replica storage

When using local reranker (e.g., BAAI/bge-reranker-v2-m3) or local embedding models, the models are downloaded to /home/hindsight/.cache on every pod restart, causing slow startup and unnecessary bandwidth. Add optional persistent volume support: - api: PVC mounted at /home/hindsight/.cache - worker: volumeClaimTemplate (StatefulSet) at same path Disabled by default. Enable via: api.persistence.modelCache.enabled: true worker.persistence.modelCache.enabled: true Closes vectorize-io#860 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Allow users to mount arbitrary volumes (configMaps, secrets, emptyDir, etc.) into api and worker pods via values, following common helm chart library conventions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

nicoloboschi

awesome, thank you!
I wanted to add this some time ago but couldn't find the time, thanks!!!

isac322 force-pushed the feat/helm-model-cache-volume branch from ab2e152 to 4de08f4 Compare April 3, 2026 09:49

nicoloboschi approved these changes Apr 7, 2026

View reviewed changes

nicoloboschi merged commit cefa755 into vectorize-io:main Apr 7, 2026
4 checks passed

isac322 deleted the feat/helm-model-cache-volume branch April 7, 2026 07:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(helm): add persistent volume for local model cache#861

feat(helm): add persistent volume for local model cache#861
nicoloboschi merged 2 commits intovectorize-io:mainfrom
isac322:feat/helm-model-cache-volume

isac322 commented Apr 3, 2026

Uh oh!

nicoloboschi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

isac322 commented Apr 3, 2026

Problem

Changes

Usage

Notes

Uh oh!

nicoloboschi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants