Skip to content

fix: correct order-service image name typo causing ImagePullBackOff (fixes #33)#34

Closed
github-actions[bot] wants to merge 1 commit intomainfrom
fix/cluster-doctor/order-service-typo-issue-33-20260326
Closed

fix: correct order-service image name typo causing ImagePullBackOff (fixes #33)#34
github-actions[bot] wants to merge 1 commit intomainfrom
fix/cluster-doctor/order-service-typo-issue-33-20260326

Conversation

@github-actions
Copy link
Copy Markdown

🩺 Cluster Doctor — Automated Fix for Issue DevExpGbb#33

Summary

Root Cause: The order-service Deployment in Act-3/argocd/apps/broken-aks-store-all-in-one.yaml references a non-existent container image due to a one-character typo.

Value
Broken image ghcr.io/azure-samples/aks-store-demo/order-servic:2.1.0
Fixed image ghcr.io/azure-samples/aks-store-demo/order-service:2.1.0

Diagnosis Evidence

Cluster: msftgbb · Namespace: default · ArgoCD App: agentic-platform-engineering-demo

The ArgoCD Application is configured to source from Act-3/argocd/apps/ (confirmed via live cluster inspection). The broken-aks-store-all-in-one.yaml in that path had the typo, causing:

Failed to pull image "ghcr.io/azure-samples/aks-store-demo/order-servic:2.1.0":
  failed to resolve reference: 403 Forbidden

Pod state at time of diagnosis:

order-service-74887bf86-rhnrh   0/1   ImagePullBackOff   0   96s

ArgoCD sync result (from cluster):

images:
  - ghcr.io/azure-samples/aks-store-demo/order-servic:2.1.0  # ← typo deployed
message: deployment.apps/order-service configured

Change

File: Act-3/argocd/apps/broken-aks-store-all-in-one.yaml

-          image: ghcr.io/azure-samples/aks-store-demo/order-servic:2.1.0
+          image: ghcr.io/azure-samples/aks-store-demo/order-service:2.1.0

Test Plan

  1. Merge this PR → ArgoCD auto-sync will pick up the corrected image reference within ~3 minutes
  2. Validate: kubectl get pods -n defaultorder-service pod should transition to Running
  3. ArgoCD health should change from DegradedHealthy

Rollback

Revert this PR and ArgoCD selfHeal: true will re-apply the broken image. No data loss risk.


🤖 Automatically diagnosed and remediated by Cluster Doctor

…ice)

Fixes DevExpGbb#33 - ArgoCD deployment degraded due to ImagePullBackOff on
order-service pod. The image name was missing the trailing 'e':
  BROKEN:  ghcr.io/azure-samples/aks-store-demo/order-servic:2.1.0
  FIXED:   ghcr.io/azure-samples/aks-store-demo/order-service:2.1.0

The ArgoCD application 'agentic-platform-engineering-demo' is sourcing
from Act-3/argocd/apps/ where this typo caused the order-service
Deployment to fail with:
  "failed to resolve reference ... 403 Forbidden"

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@raykao
Copy link
Copy Markdown

raykao commented Mar 28, 2026

Closing - this is a standard demo scenario PR. The image typo is intentional in the demo environment to demonstrate cluster failure detection.

@raykao raykao closed this Mar 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant