Share the green node cache

I've been looking at ways to reduce RA memory use, because for my usecase it's regularly using >80GB! It has to dip into swap and performance tanks.

Right now, rust analyzer always uses `GreenNodeBuilder::default()`, which creates a new `NodeCache` for every file/text section.

I ran into issues implementing it the "right" way because I couldn't figure out how to get a `NodeCache` into the database methods (such as `base-db:lib.rs:parse`).

So, to run some quick benchmarks just to see if it's worth it, I implemented cache sharing the "wrong" way (using a global static mutex).
This gave a modest performance increase, and a substantial reduction in peak memory.

I also tried:
- interning green nodes when they are "mutated" with methods such as `splice_children`. This seems promising for continuous running. Cache hitrate for mutations was 45-60% depending on the repo, and didn't cause noticable perf degradation.
- Disabling or tweaking `>3 children` NodeCache heuristic. Disabling gave another big win in peak memory, but cost a lot of performance, tweaking is better.

Running rust-analyzer analysis-stats on itself in release mode:
Metric | Baseline | Global Cache | Global Cache Without Heuristic | Global Cache With Heuristic 4
-- | -- | -- | -- | --
Peak Memory | 4675.54MB | 4083.73MB (-12.7%) | 3739.20MB (-20.0%) | 3930.96MB (-16%)
Idle Memory (post LRU clear) | 2908MB | 2869MB (-1.3%) | 2863MB (-1.5%) | 2867MB (-1.4%)
NodeCache::node hitrate | 44.27% | 72.51% | 83.60% | 76.62%
NodeCache::token hitrate | 87.40% | 98.38% | 98.38% | 98.38%
Runtime | 87.37s | 82.05s (-6.1%) | 91.93s (+5.2%) | 81.60s (-6.6%)

Because duplication rate grows with codebase size, the memory reduction *should* grow superlinearly.
I chose a larger repo, but not so large to use swap, to avoid skewing the performance metrics.
Stats from a larger repo:
Metric | Baseline | Global Cache | Global Cache Without Heuristic | Global Cache With Heuristic 4
-- | -- | -- | -- | --
Peak Memory | 23015.48MB | 20466.32MB (-11.1%) | 17741.96MB (-22.9%) | 19970.84MB (-13.2%)
Idle Memory (post LRU clear) | 11944MB | 11936MB (-0.1%) | 11927MB (-0.1%) | 11933MB (-0.1%)
NodeCache::node hitrate | 44.30% | 71.13% | 87.33% | 76.23%
NodeCache::token hitrate | 89.38% | 99.31% | 99.31% | 99.31%
Runtime | 148.49s | 145.36s (-2.1%) | 167.40s (+12.7%) | 144.28s (-2.8%)

I didn't run any benchmarks for continuous editing, but they should benefit even more from this due to the fact they do small re-parses, so cache sharing has more benefit.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Share the green node cache #19530

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Metric	Baseline	Global Cache	Global Cache Without Heuristic	Global Cache With Heuristic 4
Peak Memory	4675.54MB	4083.73MB (-12.7%)	3739.20MB (-20.0%)	3930.96MB (-16%)
Idle Memory (post LRU clear)	2908MB	2869MB (-1.3%)	2863MB (-1.5%)	2867MB (-1.4%)
NodeCache::node hitrate	44.27%	72.51%	83.60%	76.62%
NodeCache::token hitrate	87.40%	98.38%	98.38%	98.38%
Runtime	87.37s	82.05s (-6.1%)	91.93s (+5.2%)	81.60s (-6.6%)

Metric	Baseline	Global Cache	Global Cache Without Heuristic	Global Cache With Heuristic 4
Peak Memory	23015.48MB	20466.32MB (-11.1%)	17741.96MB (-22.9%)	19970.84MB (-13.2%)
Idle Memory (post LRU clear)	11944MB	11936MB (-0.1%)	11927MB (-0.1%)	11933MB (-0.1%)
NodeCache::node hitrate	44.30%	71.13%	87.33%	76.23%
NodeCache::token hitrate	89.38%	99.31%	99.31%	99.31%
Runtime	148.49s	145.36s (-2.1%)	167.40s (+12.7%)	144.28s (-2.8%)

Share the green node cache #19530

Description

Activity

ChayimFriedman2 commented on Apr 20, 2025

chlobes commented on Apr 22, 2025

Veykril commented on Apr 22, 2025

chlobes commented on Apr 23, 2025

Veykril commented on Apr 23, 2025

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions