fix: resolve dotenv loading in examples and tests #1155

shamsharoon · 2025-10-22T17:28:47Z

why

Examples were failing with "OpenAI API key is missing" errors despite having OPENAI_API_KEY in the .env file. The issue was that dotenv.config() in stagehand.config.ts was looking for .env in the current working directory (process.cwd()), which pointed to examples/ when running via pnpm run example, not the project root where .env actually exists.

Related Issue: dotenv peerDependency

what changed

stagehand.config.ts: Updated dotenv.config() to use path.resolve(__dirname, ".env") instead of relying on process.cwd(). This ensures .env is loaded relative to the config file's location, working correctly whether running from source or compiled code.
examples/2048.ts: Updated to import and use StagehandConfig instead of hardcoding configuration, aligning with other examples in the repo.
evals/deterministic/tests/Errors/apiKeyError.test.ts: Fixed tests that verify error handling when API keys are missing:
- Added beforeAll/afterAll hooks to temporarily clear and restore OPENAI_API_KEY
- Added config overrides (llmClient: undefined, modelClientOptions: undefined) to properly test error conditions
- Ensures tests don't interfere with other test files

test plan

✅ Ran pnpm run example 2048 - example executes successfully without API key errors
✅ Ran pnpm run e2e:local - all 43 tests pass
✅ Specifically tested apiKeyError.test.ts - all 3 tests pass and correctly throw expected errors
✅ Verified afterAll hook restores environment for subsequent test files

# why solves browserbase#1060 patch regression of playwright arguments being removed from agent execute response # what changed agent.execute now returns playwright arguments in its response # test plan tested locally

…ms to docs (browserbase#1065) # why reflect project id changes in docs # what changed advanced configuration comments # test plan reviewed via mintlify on localhost

# why Easier to use for Custom LLM Clients and keep users up to date with our aisdk file # what changed added export of aisdk to lib/index.ts # test plan build local stagehand, import local AISdkClient, run Azure Stagehand session

…onfigu… (browserbase#1073) …ration settings # why Updated docs to match the new fingerprint params in the Browserbase docs here: https://docs.browserbase.com/guides/stealth-customization#customization-options # what changed Update browser configuration docs to reflect the docs changes. # test plan

# why Updating docs to reflect aisdk can be imported directly # what changed The model page # test plan Reviewed page with mintlify dev locally

# why # what changed # test plan

# why Currently, we do not support stagehand agent within the api # what changed When api is enabled, stagehand agent now routes through the api # test plan Tested locally

# why Currently, using playwright screenshot command is not available when the execution environment is Stagehand. A customer has indicated they would prefer to use Playwright's native screenshot command instead of CDP when using Browserbase as CDP screenshot causes unexpected behavior for their target site. # what changed - added a StagehandScreenshotOptions type with useCDP argument added - extended page type to accept custom stagehand screeenshot options - update screenshot proxy to default useCDP to true if the env is browserbase and use playwright screenshot if false - added eval for screenshot with and without cdp # test plan - tested and confirmed functionality with eval and external example script (not committed)

…rowserbase#1057) # why We want to build a best in class agent in stagehand. Therefore, we need more eval benchmarks. # what changed - Added Web-bench evals dataset - Added a subset of OS World evals - those that can be run in a chrome browser (desktop-based tasks omitted) - added LICENSE noticed to the copied evals tasks - Added ground truth / expected result to some WebVoyager tasks using reference_answer.json from Browser Use public evals repo. Improvements to `pnpm run evals -man` to better describe how to run evals. # test plan Evals should run locally and bb for these new benchmarks.

# why Initial instructions didn't mention uv or pip prerequisites and also didn't mention venv. Fix reduces friction on first timers. # what changed - added link to install uv - added details for initializing venv - adjusted code example respectively # test plan docs change

# why - webpage structure changed, needed to update the xpath in the expected locator

… with LanguageModelV1 + LiteLLM works for python (browserbase#1086) # why 1. aisdk not yet available through npm package 2. customLLM provider only works with LanguageModelV1 3. LiteLLM compatible providers are supported in python # what changed 1. change docs to install stagehand from git repo 2. pin versions that use LanguageModelV1 # test plan local test

# why currently we pass stagehand page to agent, this results in our page management having issues when facing new tabs # what changed the stagehand object is now passed instead of stagehandPage # test plan tested locally

# why Our existing screenshot service is a dummy time-based triggered service. It also does not trigger based on any actions of the agent. # what changed Added img hash diff algo (quick check with MSE, verify with SSIM algo) to see if there was an actual UI change and only store ss in the buffer if that is so. Added ss interceptor which copies each screenshot the agent is taking to a buffer (if different enough from the previous ss) to be later used for evals. - There's also a small refactor of the agent initialization config to enable the screenshot collector service to be attached # test plan Tests pass locally --------- Co-authored-by: Miguel <36487034+miguelg719@users.noreply.github.com> Co-authored-by: miguel <miguelg71921@gmail.com>

# why To help make sense of eval test cases and results # what changed Added metadata to eval runs, cleaned deprecated code # test plan

# why # what changed # test plan

# why anthropic released a new sota computer use model # what changed added claude-sonnet-4-5-20250929 as a model to the list # test plan ran evals

…ase#1103) Why Custom AI SDK tools and MCP integrations weren't working properly with Anthropic CUA - parameters were empty {} and tools weren't tracked. What Changed - Convert Zod schemas to JSON Schema before sending to Anthropic (using zodToJsonSchema) - Track custom tool calls in the actions array - Silence "Unknown tool name" warnings for custom tools Test Plan Tested with examples file. Parameters passed correctly ({"city":"San Francisco"} instead of {}) Custom tools execute and appear in actions array No warnings

# why To improve context # what changed Added current page and url to the system prompt # test plan

# why To inform the user throughout the agent execution process # what changed Added logs to tool calls, and on the stagehand agent handler # test plan - [x] tested locally

PR to make clearer the dependencies for `extract` (for those who haven't used zod or pydantic before) --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

# why Adding support for Gemini's new Computer Use model # what changed We partnered with Google Deepmind to help integrate and test their new Computer Use models. <img width="1238" height="655" alt="Screenshot 2025-10-07 at 1 14 44 PM" src="https://github.com/user-attachments/assets/af0d854a-8e55-4937-a071-10335497f686" /> The new model tag `gemini-2.5-pro-computer-use-preview-10-2025` is available for Stagehand Agent. You can try it today with the example `cua-example.ts` To learn more, check out the blog post [https://www.browserbase.com/blog/evaluating-browser-agents](https://www.browserbase.com/blog/evaluating-browser-agents) --------- Co-authored-by: tkattkat <tkat@tkat.net> Co-authored-by: Kylejeong2 <kylejeong21@gmail.com> Co-authored-by: Sameel <sameel.m.arif@gmail.com>

# why # what changed # test plan

@tkattkat

This PR was opened by the [Changesets release](https://github.com/changesets/action) GitHub action. When you're ready to do a release, you can merge this and the packages will be published to npm automatically. If you're not ready to do a release yet, that's fine, whenever you add more changesets to main, this PR will be updated. # Releases ## @browserbasehq/stagehand@2.5.1 ### Patch Changes - [browserbase#1082](browserbase#1082) [`8c0fd01`](browserbase@8c0fd01) Thanks [@tkattkat](https://github.com/tkattkat)! - Pass stagehand object to agent instead of stagehand page - [browserbase#1104](browserbase#1104) [`a1ad06c`](browserbase@a1ad06c) Thanks [@miguelg719](https://github.com/miguelg719)! - Fix logging for stagehand agent - [browserbase#1066](browserbase#1066) [`9daa584`](browserbase@9daa584) Thanks [@tkattkat](https://github.com/tkattkat)! - Add playwright arguments to agent execute response - [browserbase#1077](browserbase#1077) [`7f38b3a`](browserbase@7f38b3a) Thanks [@tkattkat](https://github.com/tkattkat)! - adds support for stagehand agent in the api - [browserbase#1032](browserbase#1032) [`bf2d0e7`](browserbase@bf2d0e7) Thanks [@miguelg719](https://github.com/miguelg719)! - Fix for zod peer dependency support - [browserbase#1014](browserbase#1014) [`6966201`](browserbase@6966201) Thanks [@tkattkat](https://github.com/tkattkat)! - Replace operator handler with base of new agent - [browserbase#1089](browserbase#1089) [`536f366`](browserbase@536f366) Thanks [@miguelg719](https://github.com/miguelg719)! - Fixed info logs on api session create - [browserbase#1103](browserbase#1103) [`889cb6c`](browserbase@889cb6c) Thanks [@tkattkat](https://github.com/tkattkat)! - patch custom tool support in anthropic cua client - [browserbase#1056](browserbase#1056) [`6a002b2`](browserbase@6a002b2) Thanks [@chrisreadsf](https://github.com/chrisreadsf)! - remove need for duplicate project id if already passed to Stagehand - [browserbase#1090](browserbase#1090) [`8ff5c5a`](browserbase@8ff5c5a) Thanks [@miguelg719](https://github.com/miguelg719)! - Improve failed act error logs - [browserbase#1014](browserbase#1014) [`6966201`](browserbase@6966201) Thanks [@tkattkat](https://github.com/tkattkat)! - replace operator agent with scaffold for new stagehand agent - [browserbase#1107](browserbase#1107) [`3ccf335`](browserbase@3ccf335) Thanks [@seanmcguire12](https://github.com/seanmcguire12)! - fix: url extraction not working inside an array - [browserbase#1102](browserbase#1102) [`a99aa48`](browserbase@a99aa48) Thanks [@miguelg719](https://github.com/miguelg719)! - Add current page and date context to agent - [browserbase#1110](browserbase#1110) [`dda52f1`](browserbase@dda52f1) Thanks [@miguelg719](https://github.com/miguelg719)! - Add support for new Gemini Computer Use models ## @browserbasehq/stagehand-evals@1.1.0 ### Minor Changes - [browserbase#1057](browserbase#1057) [`b7be89e`](browserbase@b7be89e) Thanks [@filip-michalsky](https://github.com/filip-michalsky)! - added web voyager ground truth (optional), added web bench, and subset of OSWorld evals which run on a browser ### Patch Changes - [browserbase#1072](browserbase#1072) [`dc2d420`](browserbase@dc2d420) Thanks [@filip-michalsky](https://github.com/filip-michalsky)! - improve evals screenshot service - add img hashing diff to add screenshots and change to screenshot intercepts from the agent - Updated dependencies \[[`8c0fd01`](browserbase@8c0fd01), [`a1ad06c`](browserbase@a1ad06c), [`9daa584`](browserbase@9daa584), [`7f38b3a`](browserbase@7f38b3a), [`bf2d0e7`](browserbase@bf2d0e7), [`6966201`](browserbase@6966201), [`536f366`](browserbase@536f366), [`889cb6c`](browserbase@889cb6c), [`6a002b2`](browserbase@6a002b2), [`8ff5c5a`](browserbase@8ff5c5a), [`6966201`](browserbase@6966201), [`3ccf335`](browserbase@3ccf335), [`a99aa48`](browserbase@a99aa48), [`dda52f1`](browserbase@dda52f1)]: - @browserbasehq/stagehand@2.5.1 ## @browserbasehq/stagehand-examples@1.0.10 ### Patch Changes - Updated dependencies \[[`8c0fd01`](browserbase@8c0fd01), [`a1ad06c`](browserbase@a1ad06c), [`9daa584`](browserbase@9daa584), [`7f38b3a`](browserbase@7f38b3a), [`bf2d0e7`](browserbase@bf2d0e7), [`6966201`](browserbase@6966201), [`536f366`](browserbase@536f366), [`889cb6c`](browserbase@889cb6c), [`6a002b2`](browserbase@6a002b2), [`8ff5c5a`](browserbase@8ff5c5a), [`6966201`](browserbase@6966201), [`3ccf335`](browserbase@3ccf335), [`a99aa48`](browserbase@a99aa48), [`dda52f1`](browserbase@dda52f1)]: - @browserbasehq/stagehand@2.5.1 Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

# why The original example used JavaScript destructuring syntax [table] which doesn't work in Python. Fixed to use proper Python array indexing. # what changed fixed example to proper python syntax # test plan Co-authored-by: Steven Bryan <steven@mac.local.meter>

# why - need to set default viewport when running on browserbase. previously, we only defined the default inside the exported `StagehandConfig` # what changed - set default viewport to 1288 * 711 when running on browserbase # test plan - tested locally, - regression evals

@seanmcguire12

This PR was opened by the [Changesets release](https://github.com/changesets/action) GitHub action. When you're ready to do a release, you can merge this and the packages will be published to npm automatically. If you're not ready to do a release yet, that's fine, whenever you add more changesets to main, this PR will be updated. # Releases ## @browserbasehq/stagehand@2.5.2 ### Patch Changes - [browserbase#1114](browserbase#1114) [`c0fbc51`](browserbase@c0fbc51) Thanks [@seanmcguire12](https://github.com/seanmcguire12)! - configure default viewport when running on browserbase ## @browserbasehq/stagehand-evals@1.1.1 ### Patch Changes - Updated dependencies \[[`c0fbc51`](browserbase@c0fbc51)]: - @browserbasehq/stagehand@2.5.2 ## @browserbasehq/stagehand-examples@1.0.11 ### Patch Changes - Updated dependencies \[[`c0fbc51`](browserbase@c0fbc51)]: - @browserbasehq/stagehand@2.5.2 Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Updated link in the Getting Started section to point to the correct Quickstart Guide. # why Quickstart link in README leads to a non-existent page. <img width="1556" height="763" alt="image" src="https://github.com/user-attachments/assets/20a1a5b5-8534-43b4-89d5-e3a062b3965a" /> # what changed Updated quickstart link in README to the correct quickstart address `https://docs.stagehand.dev/first-steps/quickstart` # test plan Access new link to quickstart

# why currently, for openai cua agent, we are handling keypress actions incorrectly currently, there is no way to pass a custom system prompt to the Google cua agent # what changed - All key actions, are now ran through mapKeyToPlaywright function to ensure we are properly mapping the agents actions to valid playwright keys - Custom system prompts now override the default system prompt for Google Cua agent # test plan tested locally with google & openai cua agents Fixes browserbase#1122

# why currently when using stagehand agent through api, it returns early without executing # what changed we now properly handle the options when none are present # test plan tested locally, and tested across other cua agents to ensure no breaking changes

# why New model dropped # what changed Added support for haiku 4.5 # test plan

…ocs (browserbase#1140) # why We recently shipped updates to our MCP server that should be reflected in the documentation. # what changed update tools list for MCP update, removing mentions of multisession, adding experimental flag + get url tool related PR: browserbase/mcp-server-browserbase#123 # test plan n/a

# why Broken links. # what changed Fixed broken links # test plan This PR. --------- Co-authored-by: GG <guergabo@mac.local.meter>

# why currently, we have no sense of what url an action was taken on and what time it was taken when using agent this is useful to have, because in the dashboard it will allow us to filter the agents actions by url, and display timestamps of the individual actions # what changed - added pageUrl to every action - added timestamp to every action the url is grabbed prior to the action being taken. This is because if we do it after the action is taken, there is a chance the action could have caused a navigation, which would result in the incorrect url for the action # test plan tested locally

# why Make it easier to parse/filter/group evals # what changed Evals tagged with more granular metadata and error parsing # test plan --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

# why - there was previously no logger parameter in the external aisdk client # what changed - added logger param # test plan - evals, - install from alpha & smoke test --------- Co-authored-by: Miguel <36487034+miguelg719@users.noreply.github.com>

**Summary** This PR fixes the default viewport mentioned in our docs.

- Update stagehand.config.ts to use __dirname for reliable .env path - Update 2048 example to use StagehandConfig - Fix API key error tests to properly test without env keys - Add beforeAll/afterAll hooks to manage test environment isolation

changeset-bot · 2025-10-22T17:28:51Z

⚠️ No Changeset found

Latest commit: a2ad8d0

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

greptile-apps

Greptile Overview

Summary

Fixed environment variable loading in examples and tests by updating stagehand.config.ts to use path.resolve(__dirname, ".env") instead of relying on process.cwd(), ensuring .env is loaded correctly regardless of execution context.

Key Changes:

stagehand.config.ts: Changed dotenv.config() to use __dirname-based path resolution for reliable .env loading
examples/2048.ts: Updated to import and use shared StagehandConfig for consistency
evals/deterministic/tests/Errors/apiKeyError.test.ts: Added beforeAll/afterAll hooks and explicit config overrides (llmClient: undefined, modelClientOptions: undefined) to properly isolate tests and verify error handling

Impact:
Examples now run successfully from any directory without "OpenAI API key is missing" errors, and API key error tests are properly isolated and won't interfere with other test files.

Confidence Score: 5/5

This PR is safe to merge with minimal risk
The changes are well-tested (43 tests passing), address a clear bug with a straightforward fix, and improve test isolation without introducing new dependencies or complex logic
No files require special attention

Important Files Changed

File Analysis

Filename	Score	Overview
stagehand.config.ts	5/5	Fixed dotenv path resolution to use `__dirname` instead of `process.cwd()` for reliable `.env` loading
examples/2048.ts	5/5	Updated to use shared `StagehandConfig` for consistency with other examples
evals/deterministic/tests/Errors/apiKeyError.test.ts	5/5	Added proper test isolation with `beforeAll`/`afterAll` hooks and explicit config overrides to test error conditions

Sequence Diagram

sequenceDiagram
    participant User
    participant ExampleScript as examples/2048.ts
    participant Config as stagehand.config.ts
    participant Dotenv as dotenv
    participant FS as File System
    
    User->>ExampleScript: pnpm run example 2048
    Note over User,ExampleScript: CWD = examples/
    
    ExampleScript->>Config: import StagehandConfig
    activate Config
    
    Config->>Dotenv: dotenv.config({ path: path.resolve(__dirname, ".env") })
    Note over Config,Dotenv: __dirname points to project root<br/>(where config file is located)
    
    Dotenv->>FS: Read .env from project root
    FS-->>Dotenv: Environment variables loaded
    
    Dotenv-->>Config: Success
    Config-->>ExampleScript: Config with OPENAI_API_KEY
    deactivate Config
    
    ExampleScript->>ExampleScript: new Stagehand({ ...StagehandConfig })
    Note over ExampleScript: API key properly loaded

_{3 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

tkattkat and others added 30 commits September 10, 2025 13:40

add playwright arguments to agent (browserbase#1066)

9daa584

# why solves browserbase#1060 patch regression of playwright arguments being removed from agent execute response # what changed agent.execute now returns playwright arguments in its response # test plan tested locally

[docs] add info on not needing project id in browserbase session para…

f6f05b0

…ms to docs (browserbase#1065) # why reflect project id changes in docs # what changed advanced configuration comments # test plan reviewed via mintlify on localhost

Export aisdk (browserbase#1058)

c886544

# why Easier to use for Custom LLM Clients and keep users up to date with our aisdk file # what changed added export of aisdk to lib/index.ts # test plan build local stagehand, import local AISdkClient, run Azure Stagehand session

[docs] export aisdk (browserbase#1074)

3c39a05

# why Updating docs to reflect aisdk can be imported directly # what changed The model page # test plan Reviewed page with mintlify dev locally

Fix zod peer dependency support (browserbase#1032)

bf2d0e7

# why # what changed # test plan

add stagehand agent to api (browserbase#1077)

7f38b3a

# why Currently, we do not support stagehand agent within the api # what changed When api is enabled, stagehand agent now routes through the api # test plan Tested locally

update xpath in observe_vantechjournal (browserbase#1088)

b9c8102

# why - webpage structure changed, needed to update the xpath in the expected locator

Fix session create logs on api (browserbase#1089)

536f366

Improve failed act logs (browserbase#1090)

8ff5c5a

Eval metadata (browserbase#1092)

f89b13e

# why To help make sense of eval test cases and results # what changed Added metadata to eval runs, cleaned deprecated code # test plan

update evals cli docs (browserbase#1096)

108de3c

# why # what changed # test plan

adding support for new claude 4.5 sonnet agent model (browserbase#1099)

e0e6b30

# why anthropic released a new sota computer use model # what changed added claude-sonnet-4-5-20250929 as a model to the list # test plan ran evals

Add current date and page url to agent context (browserbase#1102)

a99aa48

# why To improve context # what changed Added current page and url to the system prompt # test plan

Additional agent logging (browserbase#1104)

a1ad06c

# why To inform the user throughout the agent execution process # what changed Added logs to tool calls, and on the stagehand agent handler # test plan - [x] tested locally

Include import statements in extract code examples (browserbase#1105)

0791404

PR to make clearer the dependencies for `extract` (for those who haven't used zod or pydantic before) --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

google cua docs (browserbase#1111)

9a29937

# why # what changed # test plan

renl and others added 11 commits October 9, 2025 17:18

Add Haiku 4.5 computer use support (browserbase#1137)

2dbac99

# why New model dropped # what changed Added support for haiku 4.5 # test plan

Fixing broken links (browserbase#1142)

438f5af

# why Broken links. # what changed Fixed broken links # test plan This PR. --------- Co-authored-by: GG <guergabo@mac.local.meter>

update evals (browserbase#1139)

9afc0a8

# why Make it easier to parse/filter/group evals # what changed Evals tagged with more granular metadata and error parsing # test plan --------- Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

[Docs] Fix default viewport info in docs (browserbase#1150)

c826da5

**Summary** This PR fixes the default viewport mentioned in our docs.

greptile-apps bot reviewed Oct 22, 2025

View reviewed changes

miguelg719 force-pushed the main branch from 4994eab to bd0a799 Compare October 29, 2025 16:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: resolve dotenv loading in examples and tests #1155

fix: resolve dotenv loading in examples and tests #1155

Uh oh!

shamsharoon commented Oct 22, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Oct 22, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

fix: resolve dotenv loading in examples and tests #1155

Are you sure you want to change the base?

fix: resolve dotenv loading in examples and tests #1155

Uh oh!

Conversation

shamsharoon commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

why

what changed

test plan

Uh oh!

changeset-bot bot commented Oct 22, 2025

⚠️ No Changeset found

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

shamsharoon commented Oct 22, 2025 •

edited

Loading