Update_testing #15

acere · 2025-08-26T15:31:34Z

Apply ruff formatting and fix linting issues across codebase
Fix unused variable assignments in tests
Update import re-exports in endpoints/init.py
Use tmp_path fixture for test file generation to prevent repository pollution
Add test_file.json* to .gitignore to exclude generated test files

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

- Improve LiteLLM endpoint implementation with additional features - Add comprehensive test suite for LiteLLM endpoint (499 lines) - Expand experiments and plotting functionality - Enhance test coverage across runner, experiments, and plotting modules - Update utility functions and MLflow callback tests

- Apply ruff formatting and fix linting issues across codebase - Fix unused variable assignments in tests - Update import re-exports in endpoints/__init__.py - Use tmp_path fixture for test file generation to prevent repository pollution - Add test_file.json* to .gitignore to exclude generated test files

athewsey · 2025-09-18T08:59:21Z

(Rebased following merge of the small fix in #13)

athewsey

I'm seeing some test failures on this since I merged #13 (😅) but hopefully these should be pretty isolated & easy to fix with just expecting write_html instead.

A few questions raised which it'd be good to address, but apart from those it's looking good IMO

athewsey · 2025-09-18T09:17:49Z

llmeter/utils.py

+        else:
+            # If it's not a BaseException, wrap it in an ImportError
+            self.exc = ImportError(str(exception))


Although we use it only for ImportErrors so far, this class was originally written to be generic. The need for this if/else is arising from inconsistent usage across LLMeter:

In some cases e.g. plotly in plotting.py, we use it as originally documented - to store a ModuleNotFoundError/ImportError.

In others e.g. kaleido in plotting.py, we pass in a message string instead.

My asks would probably be to:

Standardize our usage one way or the other, unless we have good reasons not to, and

If we still want to support both string and error in here, then either

Create a more generic base class like Exception in the else clause, OR

Change the name and docstring of this DeferredError (e.g. DeferredImportError?) to indicate that it's only intended for import errors.

athewsey · 2025-09-18T09:19:39Z

tests/__init__.py

@@ -1,0 +1,2 @@
+# Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
+# SPDX-License-Identifier: Apache-2.0


For me, ruff format picks up & adds the missing trailing newline in this and tests/endpoint/__init__.py... Does it not for you?

athewsey · 2025-09-18T09:21:48Z

tests/callbacks/cost/providers/test_sagemaker.py

-from unittest.mock import call, Mock, patch
+from unittest.mock import Mock, call, patch

 # External Dependencies:
 import boto3
-from moto import mock_aws
 import pytest
+from moto import mock_aws


Any particular reason to un-alphabetize these imports? The call vs Mock one I could understand depending how the formatter treats different cases, but if I revert these changes Ruff doesn't ask to apply either of them in my environment?

athewsey · 2025-09-18T09:30:40Z

llmeter/endpoints/litellm.py

-            pass
+            response.num_tokens_input = None
+            response.num_tokens_output = None


Looking at the InvocationResponse definition, these fields should default to None anyway. Why do we need to set them here?

athewsey · 2025-09-18T09:36:41Z

llmeter/endpoints/litellm.py

-            return InvocationResponse.error_output(
-                id=uuid4().hex, error=str(e), input_prompt=self._parse_payload(payload)
+            response = InvocationResponse.error_output(
+                input_payload=payload, error=e, id=uuid4().hex


Looks like runner.py, endpoints/bedrock.py, endpoints/openai.py, and endpoints/sagemaker.py all still have some cases using error=str(e) - do we care enough to fix that consistently and add an Exception | str | None type annotation to InvocationResponse.error_output()'s definition?

athewsey · 2025-09-18T09:41:21Z

llmeter/endpoints/litellm.py

+            existing_options = kwargs.get("stream_options", {})
+            payload_copy["stream_options"] = {**existing_options, "include_usage": True}


Should we merge in case there are some stream_options defined in payload as well as the kwargs? Currently we're supporting passing in either way, but not mixing.

Could be e.g.

existing_kwargs_options = kwargs["stream_options"] existing_payload_options = payload_copy.get("stream_options", {}) payload_copy["stream_options"] = { **existing_payload_options, **existing_kwargs_options, "include_usage": True, }

acere requested a review from athewsey August 26, 2025 15:31

acere added 4 commits September 18, 2025 16:58

Update test files and add new test modules

9851d74

Update Bedrock endpoint tests

093da4f

athewsey force-pushed the update_testing branch from 7aff0bc to 093da4f Compare September 18, 2025 08:58

athewsey reviewed Sep 18, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update_testing #15

Update_testing #15

Uh oh!

acere commented Aug 26, 2025

Uh oh!

athewsey commented Sep 18, 2025

Uh oh!

athewsey left a comment

Uh oh!

athewsey Sep 18, 2025

Uh oh!

athewsey Sep 18, 2025

Uh oh!

athewsey Sep 18, 2025

Uh oh!

athewsey Sep 18, 2025

Uh oh!

athewsey Sep 18, 2025

Uh oh!

athewsey Sep 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		@@ -1,0 +1,2 @@
		# Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
		# SPDX-License-Identifier: Apache-2.0

		existing_options = kwargs.get("stream_options", {})
		payload_copy["stream_options"] = {**existing_options, "include_usage": True}

Update_testing #15

Are you sure you want to change the base?

Update_testing #15

Uh oh!

Conversation

acere commented Aug 26, 2025

Uh oh!

athewsey commented Sep 18, 2025

Uh oh!

athewsey left a comment

Choose a reason for hiding this comment

Uh oh!

athewsey Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

athewsey Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

athewsey Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

athewsey Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

athewsey Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

athewsey Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants