[Debugger Visualizers] Optimize lookup behavior #147552

Walnut356 · 2025-10-10T11:11:23Z

Background

Almost all of the commands in lldb_commands used a regex to associate a type with the synthetic_lookup and summary_lookup python functions. When looking up a type, LLDB iterates through the commands in reverse order (so that new commands can overwrite old ones), stopping when it finds a match. These lookups are cached, but it's a shallow cache (e.g. when Vec<T> is matched by lldb, it will always point to synthetic_lookup, NOT the result of synthetic_lookup which would be StdVecSyntheticProvider).

This becomes a problem because within synthetic_lookup and summary_lookup we run classify_rust_type which checks exact same regexes again. This causes 2 issues:

running the regexes via lldb commands is even more of a waste because the final check is a .* regex that associates with synthetic_lookup anyway
Every time lldb wants to display a value, that value must run the entirety of synthetic_lookup and run its type through 19 regexes + some assorted checks every single time. Those checks take between 1 and 100 microseconds depending on the type.

On a 10,000 element Vec<i32> (which bypasses classify_struct and therefore the 19 regexes), ~30 milliseconds are spent on classify_rust_type. For a 10,000 element Vec<UserDefinedStruct> that jumps up to ~350 milliseconds.

The salt on the wound is that some of those 19 regexes are useless (BTreeMap and BTreeSet which don't even have synthetic/summary providers so it doesn't matter if we know what type it is), and then the results of that lookup function use string-comparisons in a giant if...elif...elif chain.

Solution

To fix all of that, the lldb_commands now point directly to their appropriate synthetic/summary when possible. In cases where there was extra logic, streamlined functions have been added that have much fewer types being passed in, thus only need to do one or two simple checks (e.g. classify_hashmap and classify_hashset).

Some of the lldb_commands regexes were also consolidated to reduce the total number of commands we pass to lldb (e.g. NonZero

An extra upshot is that summary_lookup could be completely removed due to being redundant.

rustbot · 2025-10-10T11:11:28Z

r? @Mark-Simulacrum

rustbot has assigned @Mark-Simulacrum.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

Zalathar · 2025-10-10T11:53:57Z

While I'm not comfortable reviewing this myself (and it might be tricky to find someone who is), I do want to at least say thanks for working on it.

Walnut356 · 2025-10-13T05:31:13Z

If it helps the review any, this is more or less how it should have been written from the start. The type synthetic add -l technically expects a python class (-l is short for --python-class), it's sortof a hack that we exploit LLDB not being able to differentiate between an initializer function and a flat function. It's nice because we can check extra bits of the value without callback-based type matching (which was only introduced in lldb 19.0), but I'm not sure why it was written the way it was in cases like Vec<T> that are unambiguous.

The MSVC providers I added a while ago already point directly to the initializer rather than taking a trip through synthetic_lookup and summary_lookup first. This patch just applies that to all of the commands because it's significantly faster.

There shouldn't be any actual changes for end-users (aside from IndirectionSyntheticProvider fixing a minor bug with pointer types).

Mark-Simulacrum · 2025-11-08T18:29:48Z

r=me if this is still ready to go (not sure if other changes have landed in the last month that make a rebase make sense). I think we can land this and revert/revisit if it runs into issues, the description makes sense to me and the changes seem reasonable.

jieyouxu

Changes also look sensible to me. Given that at worst we can revert, let's merge this.

View changes since this review

jieyouxu · 2025-11-12T00:54:59Z

@bors r=Mark-Simulacrum,jieyouxu rollup

bors · 2025-11-12T00:55:02Z

📌 Commit 2e8e618 has been approved by Mark-Simulacrum,jieyouxu

It is now in the queue for this repository.

jieyouxu · 2025-11-12T00:56:30Z

Uh actually @Walnut356, this doesn't need a rebase or anything, right? If not, please r= us
@bors r-
@bors delegate+

bors · 2025-11-12T00:56:35Z

✌️ @Walnut356, you can now approve this pull request!

If @jieyouxu told you to "r=me" after making some further change, please make that change, then do @bors r=@jieyouxu

Walnut356 · 2025-11-15T08:26:40Z

@bors r=@jieyouxu

bors · 2025-11-15T08:26:43Z

📌 Commit 2e8e618 has been approved by jieyouxu

It is now in the queue for this repository.

jieyouxu · 2025-11-15T08:32:15Z

@bors r=Mark-Simulacrum,jieyouxu

bors · 2025-11-15T08:32:18Z

💡 This pull request was already approved, no need to approve it again.

bors · 2025-11-15T08:32:19Z

📌 Commit 2e8e618 has been approved by Mark-Simulacrum,jieyouxu

It is now in the queue for this repository.

bors · 2025-11-15T08:33:35Z

⌛ Testing commit 2e8e618 with merge 3d00472...

[Debugger Visualizers] Optimize lookup behavior # Background Almost all of the commands in `lldb_commands` used a regex to associate a type with the `synthetic_lookup` and `summary_lookup` python functions. When looking up a type, LLDB iterates through the commands in reverse order (so that new commands can overwrite old ones), stopping when it finds a match. These lookups are cached, but it's a shallow cache (e.g. when `Vec<T>` is matched by lldb, it will always point to `synthetic_lookup`, NOT the result of `synthetic_lookup` which would be `StdVecSyntheticProvider`). This becomes a problem because within `synthetic_lookup` and `summary_lookup` we run `classify_rust_type` which checks exact same regexes again. This causes 2 issues: 1. running the regexes via lldb commands is even more of a waste because the final check is a `.*` regex that associates with `synthetic_lookup` anyway 2. Every time lldb wants to display a value, that value must run the entirety of `synthetic_lookup` and run its type through 19 regexes + some assorted checks every single time. Those checks take between 1 and 100 microseconds depending on the type. On a 10,000 element `Vec<i32>` (which bypasses `classify_struct` and therefore the 19 regexes), ~30 milliseconds are spent on `classify_rust_type`. For a 10,000 element `Vec<UserDefinedStruct>` that jumps up to ~350 milliseconds. The salt on the wound is that some of those 19 regexes are useless (`BTreeMap` and `BTreeSet` which don't even have synthetic/summary providers so it doesn't matter if we know what type it is), and then the results of that lookup function use string-comparisons in a giant `if...elif...elif` chain. # Solution To fix all of that, the `lldb_commands` now point directly to their appropriate synthetic/summary when possible. In cases where there was extra logic, streamlined functions have been added that have much fewer types being passed in, thus only need to do one or two simple checks (e.g. `classify_hashmap` and `classify_hashset`). Some of the `lldb_commands` regexes were also consolidated to reduce the total number of commands we pass to lldb (e.g. `NonZero` An extra upshot is that `summary_lookup` could be completely removed due to being redundant.

bors · 2025-11-15T09:38:55Z

💔 Test failed - checks-actions

Walnut356 · 2025-11-15T10:26:11Z

That is_msvc check needed to be changed anyway because SBProcess.GetTriple doesn't work quite how we need it to, but it's pretty crazy that it can return None lol

rustbot · 2025-11-15T10:28:20Z

This PR was rebased onto a different main commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

Zalathar · 2025-11-15T10:39:14Z

@bors try jobs=aarch64-apple

[Debugger Visualizers] Optimize lookup behavior try-job: aarch64-apple

rust-bors · 2025-11-15T12:45:15Z

☀️ Try build successful (CI)
Build commit: c4dc492 (c4dc4926c5197ff47b02db345958f28abcc6afe6, parent: 733108b6d4acaa93fe26ae281ea305aacd6aac4e)

bors · 2025-11-20T09:45:17Z

☔ The latest upstream changes (presumably #89917) made this pull request unmergeable. Please resolve the merge conflicts.

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 10, 2025

rustbot assigned Mark-Simulacrum Oct 10, 2025

jieyouxu self-assigned this Oct 10, 2025

Walnut356 mentioned this pull request Oct 11, 2025

[Debuginfo] improve enum value formatting in LLDB for better readability #145218

Merged

jieyouxu removed their assignment Oct 11, 2025

jieyouxu approved these changes Nov 12, 2025

View reviewed changes

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Nov 12, 2025

bors added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Nov 12, 2025

Walnut356 added 3 commits November 15, 2025 02:25

change RustType from string literals to enum

41b3d48

remove unnecessary regex usage

144046a

use direct providers when possible

989abcd

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Nov 15, 2025

This comment has been minimized.

Sign in to view

bors added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. labels Nov 15, 2025

Walnut356 force-pushed the cleanup branch from 2e8e618 to 989abcd Compare November 15, 2025 10:28

fix tests

0e5475a

rustbot added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Nov 15, 2025

rust-bors bot added a commit that referenced this pull request Nov 15, 2025

Auto merge of #147552 - Walnut356:cleanup, r=<try>

c4dc492

[Debugger Visualizers] Optimize lookup behavior try-job: aarch64-apple

This comment has been minimized.

Sign in to view

Mark-Simulacrum added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Nov 15, 2025

[Debugger Visualizers] Optimize lookup behavior #147552

Are you sure you want to change the base?

[Debugger Visualizers] Optimize lookup behavior #147552

Uh oh!

Conversation

Walnut356 commented Oct 10, 2025

Background

Solution

Uh oh!

rustbot commented Oct 10, 2025

Uh oh!

Zalathar commented Oct 10, 2025

Uh oh!

Walnut356 commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Mark-Simulacrum commented Nov 8, 2025

Uh oh!

jieyouxu left a comment • edited by rustbot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jieyouxu commented Nov 12, 2025

Uh oh!

bors commented Nov 12, 2025

Uh oh!

jieyouxu commented Nov 12, 2025

Uh oh!

bors commented Nov 12, 2025

Uh oh!

Walnut356 commented Nov 15, 2025

Uh oh!

bors commented Nov 15, 2025

Uh oh!

jieyouxu commented Nov 15, 2025

Uh oh!

bors commented Nov 15, 2025

Uh oh!

bors commented Nov 15, 2025

Uh oh!

bors commented Nov 15, 2025

Uh oh!

This comment has been minimized.

bors commented Nov 15, 2025

Uh oh!

Walnut356 commented Nov 15, 2025

Uh oh!

rustbot commented Nov 15, 2025

Uh oh!

Zalathar commented Nov 15, 2025

Uh oh!

This comment has been minimized.

rust-bors bot commented Nov 15, 2025

Uh oh!

bors commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Walnut356 commented Oct 13, 2025 •

edited

Loading

jieyouxu left a comment •

edited by rustbot

Loading