fix(tracing-limit): group `info!(message = "foo")` and `info!("foo")` in same bucket #24077

WaterWhisperer · 2025-10-26T09:53:14Z

Summary

Previously, these two equivalent log formats were treated as separate rate limit groups due to different callsite identifiers. Now, rate limiting is based on message content and contextual fields (like component_id) rather than callsite.

Vector configuration

N/A - This is an internal library change to the tracing-limit crate. No Vector configuration is required for testing.

How did you test this PR?

Added a new test message_field_explicit_vs_implicit_same_bucket that verifies both info!(message = "Hello") and info!("Hello") are grouped under the same rate limit bucket
All 9 existing tests in the tracing-limit crate pass
Ran cargo test -p tracing-limit to verify no regressions
Ran cargo clippy -p tracing-limit with no warnings

Change Type

Bug fix
New feature
Non-functional (chore, refactoring, docs)
Performance

Is this a breaking change?

Yes
No

Does this PR include user facing changes?

Yes. Please add a changelog fragment based on our guidelines.
No. A maintainer will apply the no-changelog label to this PR.

References

Closes: bug(tracing_limit): info!(message = "foo") and info!("foo") are not grouped under the same bucket #24054

Notes

Please read our Vector contributor resources.
Do not hesitate to use @vectordotdev/vector to reach out to us regarding this PR.
Some CI checks run only after we manually approve them.
- We recommend adding a pre-push hook, please see this template.
- Alternatively, we recommend running the following locally before pushing to the remote branch:
  - make fmt
  - make check-clippy (if there are failures it's possible some of them can be fixed with make clippy-fix)
  - make test
After a review is requested, please avoid force pushes to help us review incrementally.
- Feel free to push as many commits as you want. They will be squashed into one before merging.
- For example, you can run git merge origin master and git push.
If this PR introduces changes Vector dependencies (modifies Cargo.lock), please
run make build-licenses to regenerate the license inventory and commit the changes (if any). More details here.

WaterWhisperer · 2025-11-18T08:13:27Z

Hi @thomasqueirozb ,

I noticed that this PR needs some CI checks to be approved. .

Just wanted to check is there anything needs to be modified? I'm happy to rebase on the current main branch if needed

Thanks for your time and feedback!

pront · 2025-11-18T16:13:31Z

Hi @thomasqueirozb ,

I noticed that this PR needs some CI checks to be approved. .

Just wanted to check is there anything needs to be modified? I'm happy to rebase on the current main branch if needed

Thanks for your time and feedback!

Hi @WaterWhisperer, please resolve any merge conflicts. We recently made changes to this library.

thomasqueirozb

Thanks for this!

I looked at this PR before and thought it needed some modifications but looking closely into it, it seems that everything is right. I'm just holding off on an approval because some further discussion is needed but everything looks good

thomasqueirozb · 2025-11-18T18:47:24Z

lib/tracing-limit/src/lib.rs

+        match field.name() {
+            COMPONENT_ID_FIELD => self.component_id = Some(value),
+            MESSAGE_FIELD => self.message = Some(value),
+            _ => {}


I'm assuming that field.name() matches MESSAGE_FIELD both when info!("a") and info!(message="a"). Is this correct? I looked into tracing's code and it looks like info!("a") ends up being the same as info!(message="a") after a bunch of macro magic happens but I haven't verified this by testing it myself

I also have the same idea, that info!("a") and info!(message="a") are identical meaning that they can be using interchangeably.

thomasqueirozb · 2025-11-18T18:54:56Z

lib/tracing-limit/src/lib.rs

    component_id: Option<TraceValue>,
+    message: Option<TraceValue>,


Should we hash this to decrease memory usage? I think we'd run into issues of memory usage vs efficiency.

I'm worried about memory usage. With this change we'd now store the message in RateLimitedSpanKeys. Currently only component_id is stored and that isn't used in many places. Not sure if this is going to significantly impact us or not.

cc @pront

This is a valid concern. Is there any upper bound on how many of these we have to store at a give point in time? Are ever they removed?

Regarding hashing, that introduces new complexity and it's slower. So I would like us to understand this area better and also do some benchmarking too.

Thanks for the review!

I've just rebased on the latest master to resolve the merge conflicts.

Regarding the memory usage concern: I understand that storing the full message string might increase memory usage. Since I'm relatively new to this codebase, I opted for the most straightforward solution first. If you think hashing the message or another optimization is necessary right now, I'd be happy to try implementing it with some guidance. Otherwise, I'm open to benchmarking if you have a preferred way to do that.

pront · 2025-11-18T22:37:38Z

Also @WaterWhisperer, does this affect your pipelines in production? Trying to understand a bit better the motivation behind this solution.

… in same bucket

WaterWhisperer · 2025-11-19T14:19:51Z

Also @WaterWhisperer, does this affect your pipelines in production? Trying to understand a bit better the motivation behind this solution.

@pront, thanks for asking!

To be honest, I'm not running this in a production environment yet. I'm a Rust enthusiast and a new contributor looking to improve my skills by solving issues in open-source projects. I found this issue interesting because the behavior of info!("msg") and info!(message="msg") being treated differently felt inconsistent.

I hope this fix helps make vector's internal logging more predictable!

WaterWhisperer requested a review from a team as a code owner October 26, 2025 09:53

pront assigned thomasqueirozb Oct 27, 2025

thomasqueirozb added the no-changelog Changes in this PR do not need user-facing explanations in the release changelog label Oct 27, 2025

thomasqueirozb reviewed Nov 18, 2025

View reviewed changes

fix(tracing-limit): group info!(message = "foo") and info!("foo")…

c893a7f

… in same bucket

WaterWhisperer force-pushed the fix-tracing-limit branch from f10d37a to c893a7f Compare November 19, 2025 14:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(tracing-limit): group `info!(message = "foo")` and `info!("foo")` in same bucket #24077

fix(tracing-limit): group `info!(message = "foo")` and `info!("foo")` in same bucket #24077

WaterWhisperer commented Oct 26, 2025

Uh oh!

WaterWhisperer commented Nov 18, 2025

Uh oh!

pront commented Nov 18, 2025

Uh oh!

thomasqueirozb left a comment

Uh oh!

thomasqueirozb Nov 18, 2025

Uh oh!

pront Nov 18, 2025

Uh oh!

thomasqueirozb Nov 18, 2025

Uh oh!

pront Nov 18, 2025

Uh oh!

WaterWhisperer Nov 19, 2025 •

edited

Loading

Uh oh!

pront commented Nov 18, 2025

Uh oh!

WaterWhisperer commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		component_id: Option<TraceValue>,
		message: Option<TraceValue>,

fix(tracing-limit): group info!(message = "foo") and info!("foo") in same bucket #24077

Are you sure you want to change the base?

fix(tracing-limit): group info!(message = "foo") and info!("foo") in same bucket #24077

Conversation

WaterWhisperer commented Oct 26, 2025

Summary

Vector configuration

How did you test this PR?

Change Type

Is this a breaking change?

Does this PR include user facing changes?

References

Notes

Uh oh!

WaterWhisperer commented Nov 18, 2025

Uh oh!

pront commented Nov 18, 2025

Uh oh!

thomasqueirozb left a comment

Choose a reason for hiding this comment

Uh oh!

thomasqueirozb Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

pront Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

thomasqueirozb Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

pront Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

WaterWhisperer Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pront commented Nov 18, 2025

Uh oh!

WaterWhisperer commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(tracing-limit): group `info!(message = "foo")` and `info!("foo")` in same bucket #24077

fix(tracing-limit): group `info!(message = "foo")` and `info!("foo")` in same bucket #24077

WaterWhisperer Nov 19, 2025 •

edited

Loading