Add next_gen_transmute RFC

scottmcm · scottmcm · commit 94cfc1e776d3 · 2025-08-02T16:50:58.000-07:00
diff --git a/text/3844-next-gen-transmute.md b/text/3844-next-gen-transmute.md
@@ -0,0 +1,261 @@
+- Feature Name: `next_gen_transmute`
+- Start Date: (fill me in with today's date, YYYY-MM-DD)
+- RFC PR: [rust-lang/rfcs#0000](https://github.com/rust-lang/rfcs/pull/0000)
+- Rust Issue: [rust-lang/rust#0000](https://github.com/rust-lang/rust/issues/0000)
+
+# Summary
+[summary]: #summary
+
+Change `mem::transmute` from having a magic size check to having an ordinary
+`const { … }`-enforced check plus some normal lints.
+
+Add a `mem::union_transmute` for an even-less-restricted transmute where size
+mismatches are both allowed and not (necessarily) UB.
+
+
+# Motivation
+[motivation]: #motivation
+
+From [#106281](https://github.com/rust-lang/rust/pull/106281#issuecomment-1496648190):
+
+> Many valid (but not provably so) transmute calls are currently rejected by the compiler's checks, pushing folks to less ergonomic options like transmute_copy or pointer casts.
+
+Today, the one-liner when the compiler doesn't let you `transmute` is to instead do
+
+```rust
+mem::transmute_copy(&mem::ManuallyDrop(other))
+```
+
+But that's not great.  It doesn't communicate that the programmer *expected* the size
+to match, and thus there's no opportunity for the compiler to help catch a mistaken
+expectation.  Plus it obfuscates other locations that really do want `transmute_copy`,
+perhaps because they're intentionally reading a prefix out of something.
+
+It would be nice to move `mem::transmute` to being a normal function -- not the one
+intrinsic we let people call directly -- in a way that it can be more flexible for
+users as well as easier to update in the compiler without semver worries.
+
+
+# Guide-level explanation
+[guide-level-explanation]: #guide-level-explanation
+
+## `union_transmute`
+
+The `union_transmute` function is a general way to reinterpret the byte representation
+of one type as a different type.  This is equivalent to writing and reading through
+a `union`, like
+
+```rust
+const unsafe fn union_transmute<T, U>(t: T) -> U {
+    #[repr(C)]
+    union Transmute<A, B> {
+        a: ManuallyDrop<A>,
+        b: ManuallyDrop<B>,
+    }
+    let u = unsafe {
+        Transmute { a: ManuallyDrop::new(t) }.b
+    };
+    ManuallyDrop::into_inner(u)
+}
+```
+
+or to copying over the common prefix length like
+
+```rust
+const unsafe fn union_transmute<T, U>(t: T) -> U {
+    let mut u = MaybeUninit::<U>::uninit();
+    unsafe {
+        let bytes = Ord::min(size_of::<T>(), size_of::<U>());
+        std::ptr::copy_nonoverlapping::<u8>((&raw const t).cast(), u.as_mut_ptr().cast(), bytes);
+        u.assume_init()
+    }
+}
+```
+
+You might have heard this referred to as "type punning" or similar as well.
+It's also a very similar operation to what you can get by casting pointers,
+though because it's by *value* you don't need to worry about the alignments of
+`T` and `U` the way you would when doing something like
+`(&raw const x).cast::<U>().read()`.
+
+This is incredibly `unsafe` because nearly all combinations of types are going
+to be immediately UB.  For example, `union_transmute::<u32, u64>(x)` is always
+UB because half of the value being read is uninitialized.
+
+However, it's still useful to have this available as the fully-general operation
+for those cases where it's useful.  For example, it's sound to use it for
+`union_transmute::<[T; BIG], [T; SMALL]>(…)` to read a prefix of an array.
+Or a SIMD type like `#[repr(C, align(16))] struct AlignedF32x3(Simd<f32, 3>);`
+can (at least based on the current plans for `Simd`) soundly then `union_transmute`
+back and forth between `AlignedF32x3` and `[f32; 3]`, despite their different sizes.
+
+And of course some things are trivially sound, like `union_transmute::<T, ()>`
+as that would read zero bytes, which of course works.  (There's no need to use
+`union_transmute` for that, though, since it's better spelled `mem::forget`.)
+
+## `transmute`
+
+The `transmute` function does the same thing as `union_transmute` when it compiles,
+but adds the restriction that the input and output types must have the same size.
+
+It's essentially this:
+```rust
+const unsafe fn transmute<T, U>(t: T) -> U {
+    const { assert!(size_of::<T>() == size_of::<U>()) };
+    union_transmute<T, U>(t)
+}
+```
+
+This has its own name because it's particularly common that when transmuting
+you're *expecting* the two types to be the same size, and it's helpful both to
+communicate that to the reader and let the compiler help double-check it.
+
+For example, `transmute::<[u32; N], [u64; M]>` is only going to be sound when
+the sizes match (aka when M = 2×N), so might as well have that checked at
+compile-time instead of potentially letting something unsound sneak in.
+
+Using a const-assert this way does mean that some calls to `transmute` that can
+never actually work will not be rejected at declaration time, only sometime later
+when the function in question is actually used by something else.
+
+To mitigate that, there's a number of lints:
+
+- `deny`-by-default lints for things where the compiler knows the sizes are
+  different, such as `transmute::<u32, u64>`.
+- `warn`-by-default lints for things where it's *possible* that the size will
+  match, but it's still suspicious, such as `transmute::<[u32; N], u32>` or
+  `transmute::<[u64; N], [u32; N]>` where only one monomorphization can work.
+
+The full complement of such lints is not listed here as they're regularly updated
+to catch more definitely-wrong cases and be smart enough to prove more things
+as *not* being suspicious.
+
+> 📜 Historical Note 📜
+>
+> In previous versions of rust, `transmute` was actually a hard error when the
+> compiler couldn't *prove* that the types were the same size.  This was limiting
+> in practice, as humans are smarter than the rules we're willing to run during
+> type checking -- this is why `unsafe` code exists at all, really -- and meant
+> that people needed workarounds.
+>
+> All those cases that were previously caught produce lints instead, now, with
+> the possible exception of things that were errors before only from the compiler
+> being insufficiently smart.  For example, `transmute::<[[u32; N]; 2], [u64; N]>`
+> was previously rejected despite those two types always having the same size
+> for any monomorphization, so it might not lint now.
+
+
+# Reference-level explanation
+[reference-level-explanation]: #reference-level-explanation
+
+Because lint details are non-normative, in some sense the implementation is trivial.
+Just add `union_transmute` and change `transmute` as above in `core`.
+
+## Possible implementation approach
+
+Today we already have a `transmute_unchecked` intrinsic in rustc which doesn't
+have a compile-time size check, but *is* still documented as UB if the sizes
+don't match.  That intrinsic can be changed to be defined as the union version,
+perhaps with fallback MIR using that exact definition, and used to implement
+both of the library functions.  Those library functions would probably also
+add some UbChecks (which aren't possible today as `mem::transmute` is a
+re-exported intrinsic, not an actual function).
+
+The change to make `mem::transmute` an ordinary function would need to update
+the existing `check_transmutes` in typeck to instead be a lint that looks for
+calls to `#[rustc_diagnostic_item = "transmute"]` instead.  (That diagnostic
+item already exists.)  For starters the same logic could be used as a
+deny-be-default lint, as the most similar diagnostics, with any splitting done
+separately over time at the discretion of the diagnostics experts.
+
+This should be straight-forward in CTFE and codegen as well.  Once lowered such
+that we have locals for the source and destination, this can be implemented by
+just copying over the number of bytes in the shorter value (plus uninitializing
+the upper bytes in the target, if it's bigger).
+
+For example, in cg_clif the general-case copy currently uses the destination size
+<https://github.com/rust-lang/rust/blob/6d091b2baa33698682453c7bb72809554204e434/compiler/rustc_codegen_cranelift/src/value_and_place.rs#L641>
+but it could use the min of the source and destination size.
+
+In cg_ssa the general case for SSA values actually already supports this
+<https://github.com/rust-lang/rust/blob/6d091b2baa33698682453c7bb72809554204e434/compiler/rustc_codegen_ssa/src/mir/rvalue.rs#L309-L316>
+and thus would just need the earlier "just emit `unreachable` if the sizes don't
+match" check removed to reach it.
+
+There are some other cases that will need more care, like transmuting a larger
+`BackendRepr::Scalar` to a smaller `BackendRepr::Memory` where the current code
+would do a OOB write if unchanged, but nothing particularly troublesome is expected.
+
+The internal changes (to codegen and similar) would probably happen first so they
+could be implemented and tested before doing the publicly-visible switchover.
+
+
+# Drawbacks
+[drawbacks]: #drawbacks
+
+- The more transmute-related functions we add the more people might feel encouraged
+  to use them, even if we'd rather not.
+- Lots of people don't like post-mono errors, and would rather Rust never have them.
+- This is still massively-unsound, so doesn't solve the biggest problems.
+- Weird crimes using transmute to check sizes without ever actually running the
+  transmute might not get caught by the linting or post-mono check.
+
+
+# Rationale and alternatives
+[rationale-and-alternatives]: #rationale-and-alternatives
+
+## Aren't the hard errors better than post-mono ones, on transmute?
+
+Well, there's two big reasons to prefer post-mono here:
+
+1. By being post-mono, it's 100% accurate.  Sure, if we could easily be perfectly
+   accurate earlier in the pipeline that would be nice, but since it's layout-based
+   that's extremely difficult at best because of layering.  (For anything related
+   to coroutines that's particularly bad.)
+2. By being *hard* errors, rather than lints, there's a bunch more breaking change
+   concerns.  Any added smarts that allow something to compile need to be around
+   *forever* (as removing them would be breaking), and similarly the exact details
+   of those checks need to be updated in the specification.  Those changes also
+   impact the MSRV of libraries that depend on them.  Whereas putting those smarts
+   into *lints* instead mean that cap-lints applies and we can detect new issues
+   without worrying about back-compat.  And the details of lints don't need to be
+   written down in the specification either.
+
+## Do we really need the `union_transmute`?
+
+Not strictly, no.  We could continue to say that the only "official" kind of
+transmute is one where the sizes are definitely equal at runtime.
+
+That said, the SIMD case was most persuasive to the RFC author.  If we chose to
+offer an `AlignedSimd<T, N>` that rounds up to some implementation-chosen multiple
+in order to offer more alignment (than a `PackedSimd<T, N>` would), it would be
+quite convenient to have a name and primitive operation for the kind of transmute
+that would work for both directions of `[T; N]` ⇄ `AlignedSimd<T, N>`.
+
+Plus using `union`s for type punning like this is something that people already
+do, so having a name for it helps make what's happening more obvious, plus gives
+a place for us to provider better documentation and linting when they do use it.
+
+
+# Prior art
+[prior-art]: #prior-art
+
+Unknown.
+
+
+# Unresolved questions
+[unresolved-questions]: #unresolved-questions
+
+During implementation:
+- Should MIR's `CastKind::Transmute` retain its equal-size precondition?
+
+For nightly and continuing after stabilization:
+- What exactly are the correct lints to have about these functions?
+
+
+# Future possibilities
+[future-possibilities]: #future-possibilities
+
+Nothing new foreseen.  Hopefully the safe-transmute project will continue to
+make progress and help people use `mem::(union_)transmute` less going forward.
+