Skip to content

feat: Replace Playwright-based browser automation with browser-use #194

@0xallam

Description

@0xallam

Our current Playwright-based browser layer effectively requires vision/screenshot reasoning. Non-vision LLMs can’t interact with the browser today.

Motivations:

Non‑vision model support: Browser Use exposes browser control via higher‑level APIs designed for LLM agents, making interactions accessible without needing vision models (Playwright’s current use relies on visual screenshots). This means non‑vision models can work reliably with the browser.

Stealth/anti‑bot advantages: Browser Use ecosystem includes “stealth browser” features, proxy rotation, CAPTCHA bypass, and session persistence which can help bypass Cloudflare/anti‑bot defenses more reliably than plain Playwright.

Browser profiles: Built-in support for persistent browser profiles allows long-lived sessions, cookies, stored tokens, etc., improving stability across agent runs.

Headful login flows: Supports running a real visible browser for initial onboarding flows (SSO, 2FA, magic links, CAPTCHA), then re-using the authenticated profile headlessly afterward.


Browser Use docs: https://docs.browser-use.com/introduction https://github.com/browser-use/browser-use

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    Status

    In progress

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions