perf: Relax atomic allocated ops in `Table` and shrink page size #766

Veykril · 2025-03-20T11:39:00Z

These are already paired with a lock-access on write. We also never store more than 10 bits of entries per page, so we can swap the usize with a u16 to shrink the page overhead.

Veykril · 2025-03-20T11:39:15Z

@davidbarsky here is your reminder for this for your loom work

netlify · 2025-03-20T11:39:19Z

✅ Deploy Preview for salsa-rs canceled.

Name	Link
🔨 Latest commit	`f6cdb3d`
🔍 Latest deploy log	https://app.netlify.com/projects/salsa-rs/deploys/684bb464675aa80008829a60

codspeed-hq · 2025-03-20T11:40:14Z

CodSpeed Performance Report

Merging #766 will not alter performance

_{Comparing Veykril:veykril/push-qryuyllyzoll (f6cdb3d) with master (6ced42b)}

Summary

✅ 12 untouched benchmarks

ibraheemdev · 2025-05-27T22:29:20Z

just shuttle passes

I don't think shuttle actually models weak atomics. We could run Miri with --many-seeds to get more coverage of this.

Veykril · 2025-06-12T05:25:37Z

Ah I see, a bummer. According to miri docs the default is already at a high, slow 64. Unsure if increasing it is worth it? Is my reasoning for relaxing the loads here correct either way? I think it is but I'd rather get a second opinion of course.

ibraheemdev · 2025-06-12T16:06:33Z

Ah I see, a bummer. According to miri docs the default is already at a high, slow 64

I think that's the default if you pass the flag without a value. A plan cargo miri run will only use one seed.

ibraheemdev

I'm not sure this is correct if something like slots_of is used concurrently? I know that's only exposed under salsa_unstable, but the code is currently written such that slots can be accessed across threads (the functions return a slice, not just the specific ID). If we wanted to relax the loads I think we should be clearer about only supporting point-access for previously synchronized slots.

The allocation lock already enforces a happens-before relationship for the initialized length

Veykril · 2025-06-13T05:06:22Z

Hmm, is that a problem? The allocated count only ever increments and we never reallocate the data. allocated here is more of a misnomer I think, what it actually means is slots initialized, so a slice here will never point to uninitialized elements.

You mean the slots_of function here I believe is that right?

We do not use more than 10 bits for counts per page anyways, this shrinks the page overhead by 1 word

ibraheemdev · 2025-06-13T12:48:48Z

allocated is the flag for a slot being initialized and establishes happens-before, e.g. if i < allocated.load(Acquire) { *slot.get(i) }. This matters if you are reading a slot that was initialized by a different thread, which I believe can only happen with the slots_of function. Otherwise, most slots are tied to a specific ingredient Id which must be allocated by the current-thread. However, even apart from slots_of, this invariant is not clear in the code so I would be wary of relaxing the load.

Veykril · 2025-06-13T13:11:31Z

fair enough

Veykril force-pushed the veykril/push-qryuyllyzoll branch 2 times, most recently from eeb899a to 0b2a614 Compare March 26, 2025 06:29

Veykril force-pushed the veykril/push-qryuyllyzoll branch from 0b2a614 to 45eaab3 Compare May 26, 2025 11:09

Veykril marked this pull request as ready for review May 26, 2025 11:09

ibraheemdev reviewed Jun 12, 2025

View reviewed changes

Relax atomic loads and stores in Table

ec35d41

The allocation lock already enforces a happens-before relationship for the initialized length

Shrink Page::allocated count from usize to u16

Loading
Loading status checks…

f6cdb3d

We do not use more than 10 bits for counts per page anyways, this shrinks the page overhead by 1 word

Veykril force-pushed the veykril/push-qryuyllyzoll branch from 45eaab3 to f6cdb3d Compare June 13, 2025 05:17

Veykril changed the title ~~Relax atomic loads and stores in Table~~ perf: Relax atomic allocated ops in Table and shrink page size Jun 13, 2025

Veykril closed this Jun 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: Relax atomic allocated ops in `Table` and shrink page size #766

perf: Relax atomic allocated ops in `Table` and shrink page size #766

Veykril commented Mar 20, 2025 •

edited

Loading

Uh oh!

Veykril commented Mar 20, 2025

Uh oh!

netlify bot commented Mar 20, 2025 •

edited

Loading

Uh oh!

codspeed-hq bot commented Mar 20, 2025 •

edited

Loading

Uh oh!

ibraheemdev commented May 27, 2025 •

edited

Loading

Uh oh!

Veykril commented Jun 12, 2025

Uh oh!

ibraheemdev commented Jun 12, 2025

Uh oh!

ibraheemdev left a comment •

edited

Loading

Uh oh!

Veykril commented Jun 13, 2025

Uh oh!

ibraheemdev commented Jun 13, 2025 •

edited

Loading

Uh oh!

Veykril commented Jun 13, 2025

Uh oh!

perf: Relax atomic allocated ops in Table and shrink page size #766

perf: Relax atomic allocated ops in Table and shrink page size #766

Conversation

Veykril commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Veykril commented Mar 20, 2025

Uh oh!

netlify bot commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for salsa-rs canceled.

Uh oh!

codspeed-hq bot commented Mar 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Performance Report

Merging #766 will not alter performance

Summary

Uh oh!

ibraheemdev commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Veykril commented Jun 12, 2025

Uh oh!

ibraheemdev commented Jun 12, 2025

Uh oh!

ibraheemdev left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Veykril commented Jun 13, 2025

Uh oh!

Uh oh!

ibraheemdev commented Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Veykril commented Jun 13, 2025

Uh oh!

perf: Relax atomic allocated ops in `Table` and shrink page size #766

perf: Relax atomic allocated ops in `Table` and shrink page size #766

Veykril commented Mar 20, 2025 •

edited

Loading

netlify bot commented Mar 20, 2025 •

edited

Loading

codspeed-hq bot commented Mar 20, 2025 •

edited

Loading

ibraheemdev commented May 27, 2025 •

edited

Loading

ibraheemdev left a comment •

edited

Loading

ibraheemdev commented Jun 13, 2025 •

edited

Loading