proc-macro-srv: Reimplement token trees via immutable trees #21097

Veykril · 2025-11-22T12:27:19Z

For macros, rust has the concept of TokenTrees and TokenStreams which are basically a tree-like form of the underlying tokenized input with (){}[] delimiters forming Groups (internal nodes) and all other tokens forming leaf nodes of the tree.

In rust-analyzer today these trees are implemented as a flattened list using indices to reduce their memory impact as we keep a lot of these memoized (implementation). This comes at the cost of mutating operations being more expensive though and we especially feel this in proc-macro expansion.

As it turns out, one of the primitive and very commonly used operations on token streams in proc-macros is concatenation, an operation that with our model is notoriously expensive as we need to do a deepclone of the first stream, append the second stream and rewrite the internal indices. Combine this with deeply recursive concatenations as is common with the quote! macro and we run into issues where the proc-macro server can take 8 seconds to expand these 8 derive invocations of num_derive::FromPrimitive on decent machines.

Fortunately, the proc-macro server has no reason to use the same implementation of TokenStreams and TokenTrees as rust-analyzer since it doesn't persist them in memory anways. So instead, to gain good concatenation performance, we can do with what rustc does and store them as immutable trees with shared internal nodes.

Before this change cache priming took 56 seconds on Zed's codebase, with the new proc-macro server it now takes 45.

Veykril · 2025-11-22T16:26:20Z

This is basically finished, but I realized we have a severe lack of tests here so I'll add some more thorough testing later.

Shourya742

LGTM. Just some questions.

Shourya742 · 2025-11-23T10:24:09Z

crates/proc-macro-api/src/legacy_protocol/msg/flat.rs

+                    let id = self.token_id_of(punct.span);
+                    self.punct.push(PunctRepr {
+                        char: punct.ch as char,
+                        spacing: if punct.joint { tt::Spacing::Joint } else { tt::Spacing::Alone },


Curious, why we don't encode JointHidden variant?

JointHidden becomes Alone after a proc-macro roundtrip, so this preserves that logic. The hidden part effectively means its hidden from proc-macros in this sense.

Shourya742 · 2025-11-23T11:55:42Z

crates/proc-macro-api/src/legacy_protocol/msg/flat.rs

+                span: proc_macro_srv::DelimSpan {
+                    open: read_span(repr.open),
+                    close: read_span(repr.close),
+                    // FIXME


Shouldn’t Reader and Writer really be restricted to a SpanTransformer<Table = SpanDataIndexMap and Span = Span>? That’s the only setup that actually gives us the full span information needed to reconstruct things like entire, right?
I am also curios why is SpanTransformer even implemented for SpanId in the first place?

Our macro setup doesn't really support entire at all, and afaik, its not really used upstream (in rustc) either anymore so it might be removed in the future.

As for SpanId, it's the legacy span type. r-a doesn't use it anymore but we keep it for backwards compatibility for RustRover (as well as if we need to break things, we can always fall back to that mode).

ChayimFriedman2

Mostly skimmed, but LGTM.

Veykril added 2 commits November 22, 2025 13:27

proc-macro-srv: Reimplement token trees via ropes

26f7400

Simplify

c26d5f5

Veykril changed the title ~~proc-macro-srv: Reimplement token trees via ropes~~ proc-macro-srv: Reimplement token trees via immutable trees Nov 22, 2025

Implement rpc protocol changes

5409a40

Veykril force-pushed the push-zpqupukpkrts branch 3 times, most recently from fb1f83e to 4774e6e Compare November 22, 2025 17:39

Fix up feature gates

b5fc29e

Veykril force-pushed the push-zpqupukpkrts branch from 4774e6e to b5fc29e Compare November 22, 2025 17:53

Veykril marked this pull request as ready for review November 22, 2025 18:12

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Nov 22, 2025

stefnotch mentioned this pull request Nov 23, 2025

Major overhaul to support actual regex semantics maciejhirsz/logos#491

Open

Shourya742 approved these changes Nov 23, 2025

View reviewed changes

ChayimFriedman2 approved these changes Nov 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

proc-macro-srv: Reimplement token trees via immutable trees #21097

proc-macro-srv: Reimplement token trees via immutable trees #21097

Veykril commented Nov 22, 2025 •

edited

Loading

Uh oh!

Veykril commented Nov 22, 2025

Uh oh!

Shourya742 left a comment

Uh oh!

Shourya742 Nov 23, 2025

Uh oh!

Veykril Nov 23, 2025

Uh oh!

Shourya742 Nov 23, 2025

Uh oh!

Veykril Nov 23, 2025

Uh oh!

ChayimFriedman2 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

proc-macro-srv: Reimplement token trees via immutable trees #21097

Are you sure you want to change the base?

proc-macro-srv: Reimplement token trees via immutable trees #21097

Conversation

Veykril commented Nov 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Veykril commented Nov 22, 2025

Uh oh!

Shourya742 left a comment

Choose a reason for hiding this comment

Uh oh!

Shourya742 Nov 23, 2025

Choose a reason for hiding this comment

Uh oh!

Veykril Nov 23, 2025

Choose a reason for hiding this comment

Uh oh!

Shourya742 Nov 23, 2025

Choose a reason for hiding this comment

Uh oh!

Veykril Nov 23, 2025

Choose a reason for hiding this comment

Uh oh!

ChayimFriedman2 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Veykril commented Nov 22, 2025 •

edited

Loading