Revert "Use LLVM intrinsics for `madd` intrinsics" #2014

folkertdev · 2026-02-01T00:42:29Z

This reverts commit 3214671.

The commit is from #1985, which itself reverted this original change.

Use intrinsics::simd for madd (again), now that llvm/llvm-project#174149 should make this optimize properly.

folkertdev · 2026-02-01T01:12:54Z

See also https://godbolt.org/z/8z53f6WPE

r? sayantn

This reverts commit 3214671.

sayantn · 2026-02-01T07:15:06Z

crates/core_arch/src/x86/avx2.rs

+/// This is a trick used in the adler32 algorithm to get a widening addition. The
+/// multiplication by 1 is trivial, but must not be optimized out because then the vpmaddwd
+/// instruction is no longer selected. The assert_instr verifies that this is the case.
+#[target_feature(enable = "avx2")]
+#[cfg_attr(test, assert_instr(vpmaddwd))]
+unsafe fn test_mm256_madd_epi16_mul_one(mad: __m256i) -> __m256i {
+    let one_v = _mm256_set1_epi16(1);
+    _mm256_madd_epi16(mad, one_v)
 }


Please just append this to the test_mm256_madd_epi16 function. Or if you really want to keep it as a separate function, just move it to the tests module

The test checks that the instruction is emitted, so it can't just be added to that existing test. Locally it didn't run when I up it with the other tests, but on CI it does, so I've moved it now.

usamoi · 2026-02-02T08:04:05Z

This seems like it would break the usage of base64. https://godbolt.org/z/q6e1ne568 ¹²

I have some concerns about this kind of change. It might introduce reliance on hard-to-predict optimizer behavior and lead to performance issues that are difficult to detect.

folkertdev · 2026-02-02T09:51:16Z

Interesting, I think it uses a shift instead of a multiply and that fools the pattern match. Yeah that needs a fix.

Do you have any other cases that don't optimize?

I'll revert this, and then add separate tests for the adler32 and base64 patterns.

usamoi · 2026-02-02T10:40:40Z

Do you have any other cases that don't optimize?

Can't find more on GitHub. https://github.com/search?q=_mm_madd_epi16+NOT+is%3Afork+language%3ARust&type=code

If not limited to real-world cases, there are also cases like _mm256_madd_epi16(v, _mm256_set1_epi16(-1)) that might make sense.

folkertdev marked this pull request as ready for review February 1, 2026 01:12

rustbot assigned sayantn Feb 1, 2026

Revert "Use LLVM intrinsics for madd intrinsics"

1a3a6b2

This reverts commit 3214671.

folkertdev force-pushed the llvm-22-madd branch from 3ae174a to ed3475f Compare February 1, 2026 13:20

sayantn reviewed Feb 1, 2026

View reviewed changes

add test for multiply by one pattern

8a883ad

folkertdev force-pushed the llvm-22-madd branch from ed3475f to 8a883ad Compare February 1, 2026 15:28

sayantn added this pull request to the merge queue Feb 2, 2026

Merged via the queue into rust-lang:main with commit 030e64c Feb 2, 2026
75 checks passed

folkertdev mentioned this pull request Feb 2, 2026

Revert "Revert "Use LLVM intrinsics for madd intrinsics"" #2018

Open

usamoi mentioned this pull request Feb 2, 2026

wasm: use intrinsics::simd for dot product #2015

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "Use LLVM intrinsics for `madd` intrinsics" #2014

Revert "Use LLVM intrinsics for `madd` intrinsics" #2014

folkertdev commented Feb 1, 2026 •

edited

Loading

Uh oh!

folkertdev commented Feb 1, 2026

Uh oh!

sayantn Feb 1, 2026

Uh oh!

folkertdev Feb 1, 2026

Uh oh!

Uh oh!

usamoi commented Feb 2, 2026 •

edited

Loading

Uh oh!

folkertdev commented Feb 2, 2026

Uh oh!

usamoi commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Revert "Use LLVM intrinsics for madd intrinsics" #2014

Revert "Use LLVM intrinsics for madd intrinsics" #2014

Conversation

folkertdev commented Feb 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

folkertdev commented Feb 1, 2026

Uh oh!

sayantn Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

folkertdev Feb 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

usamoi commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Footnotes

Uh oh!

folkertdev commented Feb 2, 2026

Uh oh!

usamoi commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Revert "Use LLVM intrinsics for `madd` intrinsics" #2014

Revert "Use LLVM intrinsics for `madd` intrinsics" #2014

folkertdev commented Feb 1, 2026 •

edited

Loading

usamoi commented Feb 2, 2026 •

edited

Loading