Skip to content

fix(jianyu): block inaccessible detail links and verification pages#918

Open
leozejia wants to merge 1 commit intojackwener:mainfrom
leozejia:feat/jianyu-blocked-detail-clean
Open

fix(jianyu): block inaccessible detail links and verification pages#918
leozejia wants to merge 1 commit intojackwener:mainfrom
leozejia:feat/jianyu-blocked-detail-clean

Conversation

@leozejia
Copy link
Copy Markdown
Contributor

@leozejia leozejia commented Apr 9, 2026

Summary

  • classify jianyu detail targets as ok | blocked | entry_only
  • block inaccessible links (/nologin/content/, /article/bdprivate/) from search outputs
  • keep only detail_status=ok rows, and expose published_at, source_id, notice_id
  • add --since_days recency gate (default 30 days)
  • treat verification/captcha detail pages as [taxonomy=selector_drift] instead of successful detail extraction

Why

Some runs still surfaced inaccessible Jianyu links as valid candidates, and detail extraction could return verification pages as if they were usable content.

Validation

  • npm run test:adapter -- clis/jianyu/search.test.ts clis/jianyu/shared/procurement-detail.test.ts
  • Manual check: jianyu detail <nologin url> now fails fast with selector_drift

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant