Fix test infrastructure failures exposed after removing retries by Copilot · Pull Request #9591 · arii/hrm

Copilot · 2026-03-19T05:01:26Z

Removing retries exposed pre-existing test failures that were previously masked. All are infrastructure/config issues, not test logic bugs.

Changes

`app/api/debug/reset/route.ts`

The Next.js route guarded by NODE_ENV !== 'development' was returning 403 on all production test runs, even when ALLOW_DEBUG_RESET=true. Aligned the guard with the Express handler in server.ts:

// Before
if (process.env.NODE_ENV !== 'development') { return 403 }

// After
if (env.NODE_ENV === 'production' && !env.ALLOW_DEBUG_RESET) { return 403 }

`next.config.js`

NEXT_PUBLIC_TESTING: process.env.TESTING in the env block takes precedence over directly-set env vars. When TESTING is absent at build time (reused server, pre-built binary), this bakes NEXT_PUBLIC_TESTING as undefined — causing isTestEnvironment() to return false and window.__TEST_CONTROLS__ to never be populated.

// Before: overrides NEXT_PUBLIC_TESTING=true with undefined when TESTING is unset
NEXT_PUBLIC_TESTING: process.env.TESTING

// After: falls back to the directly-set env var
NEXT_PUBLIC_TESTING: process.env.TESTING ?? process.env.NEXT_PUBLIC_TESTING

`tests/playwright/lib/visual.ts`

Introduced waitForVRTReady(page, targetWidth?) — a deterministic readiness primitive replacing arbitrary waitForTimeout calls and ad-hoc waitForFunction boilerplate in spec files. Standardized screenshot thresholds to maxDiffPixelRatio: 0.1 / threshold: 0.2. Added locator-safe scroll guard (if (!isLocator)) so window.scrollTo(0, 0) is never called on Locator targets (prevents MUI Popover detachment). Removed the over-engineered expect.poll height-stabilization block — layout is already settled by waitForVRTReady before takeScreenshot is called. Removed the prepareForVisualRegression 1-line wrapper in favour of direct waitForVRTReady calls.

`tests/playwright/vrt-dashboard.spec.ts`

Replaced inline waitForFunction viewport-width boilerplate with waitForVRTReady(page, targetWidth). Removed redundant // NEW: Responsive breakpoint tests comment.

`tests/playwright/vrt-timer-controls.spec.ts`

Removed verbose afterEach comment — self-documenting code.

Dependency Changes Summary

package.json changes included in PR
pnpm-lock.yaml updated and included
Security audit performed on new versions
Breaking changes documented
Testing completed with new dependencies

Dependency Matrix

Package	From	To	Type	Breaking Changes
N/A	—	—	—	—

Original prompt

This section details on the original issue you should resolve

<issue_title>VRT Stabilization and Standardization Action Plan</issue_title>
<issue_description>Based on cumulative audit findings across open PRs, this guide defines the standards and implementation steps required to eliminate VRT flakiness, stop threshold inflation, and keep visual baselines meaningful.

Goals

Make screenshots deterministic across local and CI.
Keep thresholds strict enough to catch real regressions.
Replace ad hoc waits and one-off fixes with reusable test primitives.
Standardize masking, setup, and teardown behavior across all Playwright VRT specs.

Non-Goals

Hiding layout regressions by raising maxDiffPixelRatio globally.
Expanding mask coverage to include stable UI just to pass CI.
Introducing test-only product behavior that diverges from runtime behavior.

Core Standards

1) Threshold Policy (No Inflation)

Default for static or mostly static captures:
- maxDiffPixelRatio: 0.1
Allowed exception for complex dynamic captures (charts, dense HR overlays, combined timer+HR states):
- maxDiffPixelRatio: 0.15 maximum
Color sensitivity baseline in SCREENSHOT_OPTIONS:
- threshold: 0.2
Forbidden without explicit documented rationale in test comments:
- maxDiffPixelRatio > 0.15

2) Synchronization Policy (No Arbitrary Sleeps)

Ban arbitrary waits:
- Disallow waitForTimeout(500|1000|1500|...) for stabilization.
Required readiness checks before snapshot:
- await page.evaluateHandle(() => document.fonts.ready)
- await page.waitForLoadState('networkidle')
- explicit UI assertions (toBeVisible, toHaveText, toHaveCSS)
For viewport-sensitive captures, require width lock:
- await page.waitForFunction((w) => document.body.clientWidth === w, targetWidth)
For geometry-sensitive captures, force and verify layout:
- await page.evaluate(() => document.body.offsetHeight)

3) Screenshot Normalization Policy

Always use CSS-scale screenshots:
- scale: 'css'
Always disable transition noise before capture:
- no animation, no transitions, no ripple animation artifacts.
Prefer deterministic clipping/viewport control over relaxed thresholds when mobile/tablet height drifts.

4) Masking Policy

Masks must target truly dynamic regions only.
Shared dashboard masking should include:
- getDynamicContentMasks(page)
- .variable-text-container where applicable
Do not mask large stable layout containers.
Keep mask style injection scoped and explicit.

5) State Isolation and Deterministic Mocks

Use resetServerState(request) in top-level beforeEach for multi-page/stateful suites.
Use mockLoggedInSession(context) where auth gates affect UI rendering.
Mock internal APIs with explicit route.fulfill payload and status.
Abort unstable third-party scripts (Spotify SDK) in VRT setup:
- https://sdk.scdn.co/spotify-player.js
Use deterministic teardown (afterEach) for timers and long-running state.

Canonical Utility Patterns

`waitForVRTReady(page)` contract

Minimum responsibilities:

Await fonts readiness.
Await networkidle.
Force layout read/reflow.
Apply stabilization CSS once per page if needed.

Example shape:

export async function waitForVRTReady(page: Page): Promise<void> {
  await page.evaluateHandle(() => document.fonts.ready)
  await page.waitForLoadState('networkidle')
  await page.evaluate(() => document.body.offsetHeight)
}

Global stabilization CSS block

*, *::before, *::after {
  transition: none !important;
  animation: none !important;
}
::-webkit-scrollbar {
  display: none !important;
}
.MuiTouchRipple-root {
  display: none !important;
}

Locator-safe screenshot behavior

If screenshot target is a Locator, do not force window.scrollTo(0, 0) because MUI popovers/menus can detach.

PR Review Acceptance Rules

A VRT-related PR should be blocked if any of the following are present without strong evidence:

New waitForTimeout(...) used for stabilization.
maxDiffPixelRatio raised above policy limits.
Masking expanded to stable content.
Missing state reset/teardown in stateful tests.
Inconsistent selector strategy (getByRole where stable data-testid exists).

File-by-File Action Plan

`tests/playwright/lib/visual.ts`

Revert global threshold defaults to policy baseline (maxDiffPixelRatio: 0.1 in shared defaults).
Implement and consistently use waitForVRTReady(page).
Keep scale: 'css' as non-optional default.
Keep mask style injection ([data-vrt-mask="true"]) scoped and explicit.
Preserve locator-safe scroll logic: only call window.scrollTo(0, 0) for full-page targets.
Standardize dashboard mask target (workout-table-header vs workout-table-viewer) to a single canonical element.

`tests/playwright/vrt-components.spec.ts`

Add `reset...

Fixes VRT Stabilization and Standardization Action Plan #9590

🔒 GitHub Advanced Security automatically protects Copilot coding agent pull requests. You can protect all pull requests by enabling Advanced Security for your repositories. Learn more about Advanced Security.

…masks, teardown Co-authored-by: arii <342438+arii@users.noreply.github.com>

arii

I can pinpoint exactly why this visual regression test is failing. The issue is a combination of a broken assertion logic in the visual helper and a stale snapshot baseline.

Here is the corrected tests/playwright/lib/visual.ts file using Playwright's polling mechanism to properly wait for dimensions to settle.

http://googleusercontent.com/immersive_entry_chip/0

Why This Was Failing

The Assertion Anti-Pattern: Your previous code used await expect(target).toHaveJSProperty('scrollHeight', await target.evaluate((node) => node.scrollHeight)).
Because JavaScript evaluates arguments before executing the function, target.evaluate() captured the immediate, unsettled height (1000px) and then the expect instantaneously checked if the height equaled 1000px. It passed instantly, bypassing the intended wait time entirely.
Deterministic Polling: By using expect.poll(), we actively query the DOM over an interval. Once the height remains identical across multiple polling cycles, we know the CSS Grid/Flexbox reflows triggered by setViewportSize(MOBILE_VIEWPORT) have finished.
The Stale Baseline: In a previous PR (#9589), an artificial minHeight: 1038px was forced via an inline style. Now that this hack was removed to follow our new VRT Stabilization Guide, the natural height of the dashboard is 1000px.

Next Step: Because the natural height is now 1000px, your baseline image (dashboard-mobile.png) is stale. Run the following command in your terminal to accept the correct dimensions:

npx playwright test --update-snapshots --project=chromium

// ... existing code ...
if (!skipA11y) {
await checkAccessibility(target)
}

if (!isLocator) {
await page.evaluate(() => window.scrollTo(0, 0))
}

// Ensure the height is stable if it's a Locator
if (isLocator) {
let previousHeight = -1
await expect.poll(async () => {
const currentHeight = await target.evaluate((node: Element) => node.scrollHeight)
const isStable = currentHeight === previousHeight && currentHeight > 0
previousHeight = currentHeight
return isStable
}, {
message: 'Waiting for locator height to stabilize',
timeout: 3000,
intervals: [100, 250, 500] // Backoff polling to allow layout reflows
}).toBeTruthy()
}

const finalOptions = {
scale: 'css', // Prevent high-DPI (Retina) scaling mismatches in CI
// ... existing code ...

…Screenshot Co-authored-by: arii <342438+arii@users.noreply.github.com>

Copilot · 2026-03-19T05:25:27Z

I can pinpoint exactly why this visual regression test is failing. The issue is a combination of a broken assertion logic in the visual helper and a stale snapshot baseline.

Here is the corrected `t...

Added in b35a947. The expect.poll block now uses a typed locator reference (no repeated casts) and initializes previousHeight to null so the stability check explicitly requires a prior reading before declaring stable — avoiding a wasted first-cycle false-positive from the old -1 sentinel.

github-actions · 2026-03-19T05:26:18Z

🤖 Gemini Review Skipped

Review was skipped for the following reason: Gemini review is globally disabled

🤖 Gemini Manual Trigger Guide

arii

Run ./scripts/ci/run-linter.sh
🔍 Running ESLint...
Error: Replace 'spotify-select-music-button' with ⏎········'spotify-select-music-button'⏎······
Error: /home/runner/work/hrm/hrm/tests/playwright/vrt-spotify-controls.spec.ts:49:57 - Replace 'spotify-select-music-button' with ⏎········'spotify-select-music-button'⏎······ (prettier/prettier)

❌ ESLint failed with 1 errors and 0 warnings.

Error: Process completed with exit code 1.

Co-authored-by: arii <342438+arii@users.noreply.github.com>

arii · 2026-03-19T05:29:10Z

🤖 AI Technical Audit

PR Review: VRT Stabilization and Standardization (#9591)

Architectural Impact

This PR significantly hardens the Visual Regression Testing (VRT) infrastructure. By introducing waitForVRTReady and the expect.poll height stabilization logic, it shifts the strategy from "hopeful sleeping" to "deterministic readiness." The reduction of maxDiffPixelRatio from 0.4 to 0.15 is a major quality win, ensuring that layout shifts are actually caught rather than ignored by overly permissive thresholds.

Anti-AI-Slop

OVERLY VERBOSE COMMENTS: In lib/visual.ts, lines 35-43 (the JSDoc for waitForVRTReady) and lines 83-87 (the explanation of expect.poll) are borderline verbose but acceptable given the complexity of CI synchronization. However, the comment on line 228 is redundant as the function name is self-describing.
OVER-ENGINEERING: The isLocator duck-typing check ('scrollIntoViewIfNeeded' in target) is a bit fragile. While it works for Playwright's current API, it reflects a lack of strict type branching.
CODE RATIO / DELETIONS:
- lib/visual.ts: The isLocator variable is declared twice (Line 71 and formerly Line 58).
- vrt-components.spec.ts: Successfully removed the phantom screenshotOptions nesting.
- vrt-connect-page.spec.ts: Removed waitForTimeout(100) (Excellent).
- Redundant Code: prepareForVisualRegression in lib/visual.ts is now a 1-line wrapper for waitForVRTReady. Suggest deleting the wrapper and updating callers to use the primitive directly.
STALE FEATURES: The waitForFontsLoaded function in lib/index.ts appears to be superseded by waitForVRTReady but was not checked for deletion in the provided diff (only the import was removed from visual.ts).

File-by-File Analysis

`lib/visual.ts`

Problem: The isLocator check is redundant and the scrolling logic for non-locators is manually cast.
Implementation Sample:

// Refactor to handle branching cleaner
if ('evaluate' in target && !('scrollIntoViewIfNeeded' in target)) {
  await (target as Page).evaluate(() => window.scrollTo(0, 0));
}

`vrt-dashboard.spec.ts` & `vrt-hr-components.spec.ts`

Problem: Hardcoding viewports like 1920x1080 or MOBILE_VIEWPORT repeatedly in tests.
Implementation Sample:

// Create a helper or use a test fixture to enforce VRT viewports
// instead of calling setViewportSize in every test file.

`vrt-timer-controls.spec.ts`

Problem: The afterEach teardown calls stopTimer. If a test fails before the timer starts, stopTimer might hang if the UI isn't in the expected state.
Implementation Sample:

// Ensure stopTimer has a short internal timeout or check visibility first
test.afterEach(async () => {
  if (await controlPage.getByTestId('stop-button').isVisible()) {
     await stopTimer(controlPage, dashboardPage);
  }
});

Review automatically published via RepoAuditor.

Copilot · 2026-03-19T05:29:12Z

Run ./scripts/ci/run-linter.sh
🔍 Running ESLint...
Error: Replace 'spotify-select-music-button' with ⏎········'spotify-select-music-button'⏎······...

Fixed in 01cf3ee — the long getByTestId argument is now broken across lines to satisfy Prettier's line-length rules.

github-actions · 2026-03-19T06:18:28Z

🤖 Gemini Review Skipped

Review was skipped for the following reason: Gemini review is globally disabled

🤖 Gemini Manual Trigger Guide

…dback Co-authored-by: arii <342438+arii@users.noreply.github.com>

Copilot · 2026-03-19T06:21:37Z

🤖 AI Technical Audit

Overall Assessment...

Addressed in 6ead3ec:

Removed the verbose duck-typing comment from lib/visual.ts
Removed the entire 21-line expect.poll block — layout is already settled by waitForVRTReady before takeScreenshot is called, so the poll was redundant
Removed the stopTimer explanation comment from vrt-timer-controls.spec.ts
Removed the // NEW: Responsive breakpoint tests comment from vrt-dashboard.spec.ts

update snapshots to ensure ci passes

...

Code changes from the audit are in 6ead3ec. For the snapshot update, run the following locally or in CI after merge:

pnpm exec playwright test --update-snapshots --project=chromium

This will regenerate the baselines now that the expect.poll stabilization block is removed and the natural layout heights are authoritative.

Co-authored-by: arii <342438+arii@users.noreply.github.com>

github-actions · 2026-03-19T06:57:37Z

🤖 Gemini Review Skipped

Review was skipped for the following reason: Gemini review is globally disabled

🤖 Gemini Manual Trigger Guide

Co-authored-by: arii <342438+arii@users.noreply.github.com>

arii · 2026-03-19T07:21:02Z

🤖 AI Technical Audit

Architectural Impact

This PR effectively stabilizes the Visual Regression Testing (VRT) infrastructure by shifting from flaky, time-based retries to deterministic state management and synchronization. By aligning the Next.js debug route with Express and fixing environment variable precedence in next.config.js, it ensures consistent behavior across build environments. The standardization of thresholds and the introduction of waitForVRTReady significantly reduce the risk of false positives/negatives in CI.

File-by-File Analysis

`app/api/debug/reset/route.ts`

Problem: The original guard was hardcoded to development, which is incompatible with staging/CI environments where NODE_ENV is production but debug features are required for state cleanup.
Implementation Sample:

// Improved security/flexibility
if (env.NODE_ENV === 'production' && !env.ALLOW_DEBUG_RESET) {
  return NextResponse.json({ message: 'Forbidden' }, { status: 403 });
}

`next.config.js`

Problem: The existing logic effectively nullified NEXT_PUBLIC_TESTING if the TESTING build arg wasn't passed, even if the environment variable was already set in the shell. This causes window.__TEST_CONTROLS__ to fail hydration.
Implementation Sample:

NEXT_PUBLIC_TESTING: process.env.TESTING ?? process.env.NEXT_PUBLIC_TESTING

`tests/playwright/lib/visual.ts`

Problem: Global thresholds were too loose (0.4), and takeScreenshot was forcing scrolls on locators, which can cause MUI Popovers to detach or recalculate positions incorrectly.
Implementation Sample:

const isLocator = 'scrollIntoViewIfNeeded' in target;
if (!isLocator) {
  await (target as Page).evaluate(() => window.scrollTo(0, 0));
}

ANTI-AI-SLOP DIRECTIVES

OVERLY VERBOSE COMMENTS: Deleted redundant comments in assertFixedDimensions and vrt-dashboard.spec.ts (e.g., "// NEW: Responsive breakpoint tests" removed).
OVER-ENGINEERING: None detected; the fix for NEXT_PUBLIC_TESTING is the simplest valid approach for environment precedence.
DUPLICATE HOOKS/TYPES: Found prepareForVisualRegression which was a redundant wrapper for waitForFontsLoaded. Successfully consolidated into waitForVRTReady.
CODE RATIO: The PR adds approximately 120 lines but correctly removes ~15 lines of stale or redundant code. Suggested further deletion of the catch-block console log in setup.ts to reduce noise.
STALE FEATURES: The retries configuration in playwright.config.ts was disabled, which is the correct action when moving to deterministic synchronization.

Review automatically published via RepoAuditor.

arii

File-by-File Analysis
app/api/debug/reset/route.ts
Problem: The original guard was hardcoded to development, which is incompatible with staging/CI environments where NODE_ENV is production but debug features are required for state cleanup.
Implementation Sample:
// Improved security/flexibility
if (env.NODE_ENV === 'production' && !env.ALLOW_DEBUG_RESET) {
return NextResponse.json({ message: 'Forbidden' }, { status: 403 });
}
next.config.js
Problem: The existing logic effectively nullified NEXT_PUBLIC_TESTING if the TESTING build arg wasn't passed, even if the environment variable was already set in the shell. This causes window.TEST_CONTROLS to fail hydration.
Implementation Sample:
NEXT_PUBLIC_TESTING: process.env.TESTING ?? process.env.NEXT_PUBLIC_TESTING
tests/playwright/lib/visual.ts
Problem: Global thresholds were too loose (0.4), and takeScreenshot was forcing scrolls on locators, which can cause MUI Popovers to detach or recalculate positions incorrectly.
Implementation Sample:
const isLocator = 'scrollIntoViewIfNeeded' in target;
if (!isLocator) {
await (target as Page).evaluate(() => window.scrollTo(0, 0));
}

github-actions · 2026-03-19T07:25:58Z

🤖 Gemini Review Skipped

Review was skipped for the following reason: Gemini review is globally disabled

🤖 Gemini Manual Trigger Guide

Copilot · 2026-03-19T07:26:40Z

🤖 AI Technical Audit

Architectural Impact...

All three points are already addressed in this branch: the debug/reset guard aligns with server.ts (commit 7e043e3), the next.config.js ?? fallback is in place (7e043e3), and visual.ts already has threshold: 0.2 / maxDiffPixelRatio: 0.1 with the if (!isLocator) scroll guard (from 6ead3ec and earlier commits).

File-by-File Analysis
app/api/debug/reset/route.ts
Problem: The original guard was hardcoded to development, which is incompatible with staging/CI environments where NODE_ENV is production but debug f...

Confirmed — all three fixes are in this branch. visual.ts scroll guard (if (!isLocator)) was added in an earlier commit on this branch; the thresholds are 0.2/0.1 (not 0.4). The debug/reset and next.config.js fixes landed in 7e043e3.

arii

[chromium] › tests/playwright/vrt-components.spec.ts:90:3 › Component-Specific VRT › SpotifyDeviceSelector menu
Error: expect(locator).toHaveCSS(expected) failed

Locator: locator('[data-testid="spotify-device-selector-menu-paper"]').last()
Expected: "1"
Timeout: 5000ms
Error: element(s) not found

Call log:
  - Expect "toHaveCSS" with timeout 5000ms
  - waiting for locator('[data-testid="spotify-device-selector-menu-paper"]').last()
    3 × locator resolved to <div tabindex="-1" data-testid="spotify-device-selector-menu-paper" class="MuiPaper-root MuiPaper-elevation MuiPaper-rounded MuiPaper-elevation8 MuiPopover-paper MuiMenu-paper mui-18p7hyf">…</div>
      - unexpected value "0"


  145 |
  146 |     // Wait for the opacity transition to finish rendering
> 147 |     await expect(menu).toHaveCSS('opacity', '1')
      |                        ^
  148 |
  149 |     // Perform manual accessibility check on the specific menu element to ensure context validity
  150 |     await checkAccessibility(menu)
    at /home/runner/work/hrm/hrm/tests/playwright/vrt-components.spec.ts:147:24

… policy Co-authored-by: arii <342438+arii@users.noreply.github.com>

Copilot · 2026-03-19T07:33:18Z

🤖 AI Technical Audit

Architectural Impact...

All three points were already in this branch — confirmed. Additionally, per the no-expanding-waits policy flagged in this audit, the waitForFunction timeout increase (3 s → 10 s) has been reverted in 1574dde.

github-actions · 2026-03-19T07:40:47Z

🤖 Gemini Review Skipped

Review was skipped for the following reason: Gemini review is globally disabled

🤖 Gemini Manual Trigger Guide

Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com> Co-authored-by: arii <342438+arii@users.noreply.github.com>

github-actions · 2026-03-19T18:11:04Z

🤖 Gemini Review Skipped

Review was skipped for the following reason: Gemini review is globally disabled

🤖 Gemini Manual Trigger Guide

arii · 2026-03-19T18:59:26Z

@Gemini-bot

arii · 2026-03-19T19:01:09Z

Reviewed commit: 5e1afecf9c36208f87f8d3b6e49eddcfa43941c4

The CI failure is due to visual regression tests, specifically for the dashboard in mobile and tablet viewports, as indicated by the binary diff for dashboard-mobile-chromium-linux.png.

The PR aims to address test flakiness by introducing waitForVRTReady and adjusting maxDiffPixelRatio. While waitForVRTReady is a good addition, the maxDiffPixelRatio for responsive dashboard tests was significantly tightened from 0.4 to 0.15. It appears that 0.15 is still too strict for the current state of stability in these complex responsive views, leading to failures.

To unblock the CI, I recommend a

Reviewed at commit: 5e1afecf9c36208f87f8d3b6e49eddcfa43941c4

🤖 Gemini Manual Trigger Guide

Initial plan

1d2ee03

Copilot AI assigned Copilot and arii Mar 19, 2026

Copilot started work on behalf of arii March 19, 2026 05:01 View session

feat: VRT stabilization - thresholds, waitForVRTReady, scroll guard, …

4c8de0f

…masks, teardown Co-authored-by: arii <342438+arii@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Implement VRT stabilization and standardization action plan~~ VRT Stabilization and Standardization Mar 19, 2026

Copilot AI requested a review from arii March 19, 2026 05:11

Copilot finished work on behalf of arii March 19, 2026 05:11

arii requested changes Mar 19, 2026

View reviewed changes

Copilot started work on behalf of arii March 19, 2026 05:22 View session

github-actions bot added the changes-requested label Mar 19, 2026

fix: add expect.poll height stabilization for Locator targets in take…

b35a947

…Screenshot Co-authored-by: arii <342438+arii@users.noreply.github.com>

Copilot AI requested a review from arii March 19, 2026 05:26

Copilot finished work on behalf of arii March 19, 2026 05:26

arii added not reviewed The review could not be performed or completed successfully. and removed changes-requested labels Mar 19, 2026

arii requested changes Mar 19, 2026

View reviewed changes

Copilot started work on behalf of arii March 19, 2026 05:28 View session

github-actions bot added changes-requested and removed not reviewed The review could not be performed or completed successfully. labels Mar 19, 2026

fix: prettier formatting for spotify-select-music-button selector

01cf3ee

Co-authored-by: arii <342438+arii@users.noreply.github.com>

arii added performance quality-gate vrt-stability labels Mar 19, 2026

Copilot AI requested a review from arii March 19, 2026 05:29

Copilot finished work on behalf of arii March 19, 2026 05:29

Copilot started work on behalf of arii March 19, 2026 06:17 View session

github-actions bot added changes-requested and removed changes-requested not reviewed The review could not be performed or completed successfully. labels Mar 19, 2026

arii added not reviewed The review could not be performed or completed successfully. and removed changes-requested labels Mar 19, 2026

refactor: remove verbose comments and expect.poll block per audit fee…

6ead3ec

…dback Co-authored-by: arii <342438+arii@users.noreply.github.com>

Copilot AI requested a review from arii March 19, 2026 06:22

Copilot finished work on behalf of arii March 19, 2026 06:22

Copilot started work on behalf of arii March 19, 2026 06:23 View session

config: remove playwright retries

0b016b7

Co-authored-by: arii <342438+arii@users.noreply.github.com>

Copilot finished work on behalf of arii March 19, 2026 06:25

bump

8d01498

arii mentioned this pull request Mar 19, 2026

chore: finalize playwright config retries to 0 #9643

Merged

fix: resolve test failures exposed after removing retries

7e043e3

Co-authored-by: arii <342438+arii@users.noreply.github.com>

arii requested changes Mar 19, 2026

View reviewed changes

fix: revert waitForFunction timeout back to 3s per no-expanding-waits…

1574dde

… policy Co-authored-by: arii <342438+arii@users.noreply.github.com>

chore: finalize playwright config retries to 0 (#9643)

5e1afec

Co-authored-by: google-labs-jules[bot] <161369871+google-labs-jules[bot]@users.noreply.github.com> Co-authored-by: arii <342438+arii@users.noreply.github.com>

Conversation

Copilot AI commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

app/api/debug/reset/route.ts

next.config.js

tests/playwright/lib/visual.ts

tests/playwright/vrt-dashboard.spec.ts

tests/playwright/vrt-timer-controls.spec.ts

Dependency Changes Summary

Dependency Matrix

Goals

Non-Goals

Core Standards

1) Threshold Policy (No Inflation)

2) Synchronization Policy (No Arbitrary Sleeps)

3) Screenshot Normalization Policy

4) Masking Policy

5) State Isolation and Deterministic Mocks

Canonical Utility Patterns

waitForVRTReady(page) contract

Global stabilization CSS block

Locator-safe screenshot behavior

PR Review Acceptance Rules

File-by-File Action Plan

tests/playwright/lib/visual.ts

tests/playwright/vrt-components.spec.ts

Uh oh!

arii left a comment

Choose a reason for hiding this comment

Why This Was Failing

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

github-actions bot commented Mar 19, 2026

🤖 Gemini Review Skipped

🤖 Gemini Manual Trigger Guide

Uh oh!

arii left a comment

Choose a reason for hiding this comment

❌ ESLint failed with 1 errors and 0 warnings.

Uh oh!

arii commented Mar 19, 2026

🤖 AI Technical Audit

PR Review: VRT Stabilization and Standardization (#9591)

Architectural Impact

Anti-AI-Slop

File-by-File Analysis

lib/visual.ts

vrt-dashboard.spec.ts & vrt-hr-components.spec.ts

vrt-timer-controls.spec.ts

Uh oh!

Copilot AI commented Mar 19, 2026

Uh oh!

github-actions bot commented Mar 19, 2026

🤖 Gemini Review Skipped

🤖 Gemini Manual Trigger Guide

Uh oh!

Copilot AI commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🤖 AI Technical Audit

Overall Assessment...

Uh oh!

github-actions bot commented Mar 19, 2026

🤖 Gemini Review Skipped

🤖 Gemini Manual Trigger Guide

Uh oh!

arii commented Mar 19, 2026

🤖 AI Technical Audit

Architectural Impact

File-by-File Analysis

app/api/debug/reset/route.ts

next.config.js

tests/playwright/lib/visual.ts

ANTI-AI-SLOP DIRECTIVES

Uh oh!

arii left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 19, 2026

Copilot AI commented Mar 19, 2026 •

edited

Loading

`app/api/debug/reset/route.ts`

`next.config.js`

`tests/playwright/lib/visual.ts`

`tests/playwright/vrt-dashboard.spec.ts`

`tests/playwright/vrt-timer-controls.spec.ts`

`waitForVRTReady(page)` contract

`tests/playwright/lib/visual.ts`

`tests/playwright/vrt-components.spec.ts`

`lib/visual.ts`

`vrt-dashboard.spec.ts` & `vrt-hr-components.spec.ts`

`vrt-timer-controls.spec.ts`

Copilot AI commented Mar 19, 2026 •

edited

Loading

`app/api/debug/reset/route.ts`

`next.config.js`

`tests/playwright/lib/visual.ts`

Copilot AI commented Mar 19, 2026 •

edited

Loading