Assertions cleanups, annotate unreachable code #13242

MisterDA · 2024-06-17T12:22:55Z

This PR sports small cleanups around assertions in the runtime.

Use a definition of unreachable() to mark unreachable code where it makes sense, and allow C compilers to optimize the code.
Split conjunctions of CAMLassert(A && B); into CAMLassert(A); CAMLassert(B) to better identify which part of the assertion fails.
Some CAMLassert can be replaced with static_assert.

(using the unreachable annotation will later be useful to optimize ocamlc too!)

~~No change entry needed, I think.~~
cc @NickBarnes

runtime/array.c

dustanddreams · 2024-06-17T12:27:43Z

runtime/caml/misc.h

+  #elif defined(_MSC_VER)
+    #define unreachable() (__assume(0))
+  #else
+    #define unreachable() (abort())


I'd rather see this defined as

Suggested change

#define unreachable() (abort())

#define unreachable() (assert(0))

so that, should it ever fire, this gets recognized as an assertion failure rather than a mysterious SIGABRT.

(And yes, given how eager clang and gcc are at arm-wrestling the C specs in order to win at picobenchmarks, I am expecting such unreachable code paths to be reachable when compiled with these compilers at some of their optimization levels).

But what if C assertions are disabled with NDEBUG? The code path may then be missing a return statement since there won't be a noreturn function to end it.
Similarly, if we used CAMLassert instead, it would result in an empty statement in non-debug runtimes, triggering the same problem.
If I keep an assertion and add a return statement, I'll probably get warnings about an unreachable return statement after the unreachable annotation…

There is no simple way to solve this without making every compiler combination happy. You probably should not remove the few return following unreachable then.

The compilers differ too much in behavior… but in my test they complain about missing returns rather that unreachable code. I've changed my code to use CAMLassert here.

runtime/extern.c

gasche · 2024-06-17T12:38:51Z

I am nervous about unreachable. If I understand correctly, before we used CAMLassert(0), which means "in debug mode have a clean error, otherwise do nothing", and then we would return something if the type was not void. Now unreachable means "reaching here is undefined behavior".

In the cases where we know for sure that this place is dead code, unreachable is fine. But if there are scenarios of misuse where these places do get reached, typically a programming error in our runtime or in FFI code, then users are going to get weird segfaults instead of "a clean error in debug mode, and some non-crashing fallback otherwise". This could be a degradation, and ensuring that it's fine requires a careful look at the code.

MisterDA · 2024-06-17T14:28:01Z

I am nervous about unreachable. If I understand correctly, before we used CAMLassert(0), which means "in debug mode have a clean error, otherwise do nothing", and then we would return something if the type was not void. Now unreachable means "reaching here is undefined behavior".

That is my understanding too.

In the cases where we know for sure that this place is dead code, unreachable is fine. But if there are scenarios of misuse where these places do get reached, typically a programming error in our runtime or in FFI code, then users are going to get weird segfaults instead of "a clean error in debug mode, and some non-crashing fallback otherwise". This could be a degradation, and ensuring that it's fine requires a careful look at the code.

These are valid concerns that I'll try to address. I've remembered in the meantime that GCC can warn whenever a switch statement has an index of enumerated type and lacks a case for one or more of the named codes of that enumeration (see -Wswitch, included in -Wall). This alleviates a bit the need for the unreachable annotation, clarifies the code, and adds a bit of type safety (yay!). Now, we need to convince ourselves that values outside of the enumeration space cannot sneak in, and I think that's achievable.

The diff of bigarray.h has changed to extract masks and layout position from the enumerations, but I don't think this changes the effective interface.

dustanddreams · 2024-06-17T14:30:48Z

These are valid concerns that I'll try to address. I've remembered in the meantime that GCC can warn whenever a switch statement has an index of enumerated type and lacks a case for one or more of the named codes of that enumeration (see -Wswitch, included in -Wall). This alleviates a bit the need for the unreachable annotation, clarifies the code, and adds a bit of type safety (yay!). Now, we need to convince ourselves that values outside of the enumeration space cannot sneak in, and I think that's achievable.

I'm willing to bet the CI will expose at least one compiler which will nevertheless emit warnings for this (its name probably starts with "MS" and ends with "VC").

MisterDA · 2024-06-18T07:40:34Z

I'm a bit disappointed by my last endeavor. GCC, clang, and MSCV have oh-so-subtly different behaviors regarding unreachable warnings and switch exhaustiveness check. This makes the code somewhat more verbose than I've previously hoped.
I all cases, we don't expect values outside of the possible range of the enumeration, and using default: unreachable() covers that and the compilers' wrath.

runtime/interp.c

dustanddreams

Aside the previous remark, I'm satisfied with the changes as they are now.

shindere · 2024-06-19T10:04:36Z

Antonin Décimo (2024/06/19 01:58 -0700):

In the meantime I'm tempted to leave this as-is an revisit later, if that's fine.

I'll trust you on that, @dustanddreams?

shindere · 2024-06-19T10:05:01Z

Miod Vallat (2024/06/19 02:02 -0700):

@dustanddreams commented on this pull request. > @@ -1418,7 +1418,7 @@ do_resume: { #ifndef THREADED_CODE default: #ifdef _MSC_VER - __assume(0); + unreachable(); That's fine.

Ah sorry, didn't see this.

shindere · 2024-06-19T10:09:46Z

Ah, this now needs to be rebased to resolve the conflict in Changes.

Feel free to ping me when done.

MisterDA · 2024-06-20T06:25:26Z

Rebase is done. Maybe @gasche still has comments?

gasche · 2024-06-20T06:41:27Z

My comment remains that if some of those unreachable() are in fact reachable in case of programming bugs by the user, then I would prefer something that fails cleanly than nasty undefined behavior, especially in debug mode -- but why not all the time?

I only looked at the new version briefly but my impression is that it suffers from the same problem. For example:

you use explicit casts for bigarray masks (and that's nice), but unless I am very C-confused this does not bring any additional dynamic safety, and a programming error in the runtime or in the user FFI code could still bring an unexpected value there?
in the bytecode interpreter you point out one place where UB-undefined may bring performance benefits, but if I understand correctly this is the place that handles unknown opcodes; it occurs from time to time that the interpreter is fed an unknown opcode (for example when we add new bytecode instructions and forget to update the magic number), and a user-friendly error in this situations is very helpful. To me this calls for a "this is a cold place that is very rarely reached" annotation, not a "do whatever you want" annotation.

runtime/interp.c

MisterDA · 2024-06-21T12:56:18Z

My comment remains that if some of those unreachable() are in fact reachable in case of programming bugs by the user, then I would prefer something that fails cleanly than nasty undefined behavior, especially in debug mode -- but why not all the time?

For debug mode, there's a possible alternative implementation where we trap the program if it reaches an unreachable annotation.

I only looked at the new version briefly but my impression is that it suffers from the same problem.

You're right, but...

you use explicit casts for bigarray masks (and that's nice), but unless I am very C-confused this does not bring any additional dynamic safety, and a programming error in the runtime or in the user FFI code could still bring an unexpected value there?

I can convince myself that the runtime currently cannot fall in the default cases. It's true that it's true that this doesn't prevent us from future errors. As for user FFI code, to me this example shares a lot in common with converting the sum type ocaml representation to and from a C array of integers (think socket flags, or caml_convert_flag_list). We usually don't bound-check them and just access the array at the offset represented by the tag. We cannot protect against a C array being shorter than the type exposed on the OCaml side, or having wrong values. Here too I think we could forget about these programming errors.

in the bytecode interpreter you point out one place where UB-undefined may bring performance benefits, but if I understand correctly this is the place that handles unknown opcodes; it occurs from time to time that the interpreter is fed an unknown opcode (for example when we add new bytecode instructions and forget to update the magic number), and a user-friendly error in this situations is very helpful. To me this calls for a "this is a cold place that is very rarely reached" annotation, not a "do whatever you want" annotation.

On the other hand the error case for the interpreter seems to be only handled for the non-threaded code interpreter (debug mode or MSVC). Am I reading the code wrong? What happens when (threaded-code) ocamlrun encounters a bad opcode at the moment?

Adding unreachable annotations in switch cases is supposed to help the compiler optimize the jumps. I'll try to provide some benchmarks later if we still need more motivation.

gasche · 2024-06-21T13:19:58Z

Here too I think we could forget about these programming errors.

I see no benefit to making our system less usable/debuggable in this way. (In particular I very much doubt that there is any measurable performance benefit -- in the bigarray stuff.) Why do you want to do this? My impression (but I may be wrong and I'm not judging in any way) is that you think that unreachable() is the standard/common/good-practice thing to do in C. But we would benefit more from a clean fatal-error than from undefined behavior.

In terms of gcc intrisics, I think we want __builtin_trap rather than __builtin_unreachable.

Adding unreachable annotations in switch cases is supposed to help the compiler optimize the jumps. I'll try to provide some benchmarks later if we still need more motivation.

Again, this could be done with a hint that says "this code path is unlikely" and then follows with a fatal error, we don't need to introduce more undefined behaviors in our programs to get code-generation benefits.

Rather than using C23 or C++'s unreachable annotation, we prefer trapping the program execution when an unreachable path is taken. MSVC currently has a bug with unreachable code analysis and its __fastfail intrinsic, thus the noreturn wrapper function. https://developercommunity.visualstudio.com/t/C-Code-Analysis-should-understand-that/10665570

Makes it easier to identify which part of the assertion has failed.

In some cases an compilers, this allows to remove the unreachable annotation, and the compiler checks that all cases are covered. In general, it also makes it easier to reason about the space of values.

MisterDA · 2024-06-24T08:52:07Z

Thank you for the pointers, that was really helpful. I've changed the definition of CAMLunreachable to trap if the compiler supports it, or use CAMLassert(0) if not (the current state).

gasche · 2024-06-26T15:59:53Z

runtime/interp.c

@@ -1417,13 +1417,9 @@ do_resume: {

 #ifndef THREADED_CODE
    default:
-#ifdef _MSC_VER
-      __assume(0);
-#else


This change may have a negative performance impact, I'm not sure. Have you tested it?

(The test that I have in mind: use a dumb microbenchmark that does something compute-heavy for long enough, and compare the performance using ocamlrun before and after the change. If you don't see any noticeable performance difference then we should be fine.)

dustanddreams reviewed Jun 17, 2024

View reviewed changes

MisterDA marked this pull request as draft June 17, 2024 14:34

MisterDA force-pushed the assertions branch from ec2d5a0 to a393574 Compare June 18, 2024 07:31

MisterDA marked this pull request as ready for review June 18, 2024 08:02

dustanddreams reviewed Jun 19, 2024

View reviewed changes

runtime/interp.c Outdated Show resolved Hide resolved

dustanddreams approved these changes Jun 19, 2024

View reviewed changes

MisterDA force-pushed the assertions branch from a393574 to 7aef5ee Compare June 19, 2024 12:47

MisterDA force-pushed the assertions branch 2 times, most recently from 68ce0f6 to 3bbb449 Compare June 21, 2024 12:24

dustanddreams reviewed Jun 21, 2024

View reviewed changes

runtime/interp.c Show resolved Hide resolved

MisterDA force-pushed the assertions branch from 3bbb449 to 39392b9 Compare June 21, 2024 12:41

MisterDA added 7 commits June 24, 2024 10:43

Use CAMLunreachable() in unreachable switch cases

821b948

Use C11 static_assert instead of CAMLassert with static expressions

2890656

Split CAMLassert conjunctions

09ce5d5

Makes it easier to identify which part of the assertion has failed.

Rely on the C compiler exhaustiveness check switch on enums

f035f83

In some cases an compilers, this allows to remove the unreachable annotation, and the compiler checks that all cases are covered. In general, it also makes it easier to reason about the space of values.

Always error if a bad opcode is found in non-threaded code interp

9755cc3

Update changes

29e32e6

MisterDA force-pushed the assertions branch from 39392b9 to 29e32e6 Compare June 24, 2024 08:50

gasche approved these changes Jun 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assertions cleanups, annotate unreachable code #13242

Assertions cleanups, annotate unreachable code #13242

MisterDA commented Jun 17, 2024 •

edited

Loading

dustanddreams Jun 17, 2024

MisterDA Jun 17, 2024

dustanddreams Jun 17, 2024

MisterDA Jun 18, 2024

gasche commented Jun 17, 2024

MisterDA commented Jun 17, 2024

dustanddreams commented Jun 17, 2024

MisterDA commented Jun 18, 2024

dustanddreams left a comment

shindere commented Jun 19, 2024 via email

shindere commented Jun 19, 2024 via email

shindere commented Jun 19, 2024

MisterDA commented Jun 20, 2024

gasche commented Jun 20, 2024

MisterDA commented Jun 21, 2024

gasche commented Jun 21, 2024

MisterDA commented Jun 24, 2024

gasche Jun 26, 2024

gasche Jun 26, 2024

	#define unreachable() (abort())
	#define unreachable() (assert(0))

Assertions cleanups, annotate unreachable code #13242

Are you sure you want to change the base?

Assertions cleanups, annotate unreachable code #13242

Conversation

MisterDA commented Jun 17, 2024 • edited Loading

dustanddreams Jun 17, 2024

Choose a reason for hiding this comment

MisterDA Jun 17, 2024

Choose a reason for hiding this comment

dustanddreams Jun 17, 2024

Choose a reason for hiding this comment

MisterDA Jun 18, 2024

Choose a reason for hiding this comment

gasche commented Jun 17, 2024

MisterDA commented Jun 17, 2024

dustanddreams commented Jun 17, 2024

MisterDA commented Jun 18, 2024

dustanddreams left a comment

Choose a reason for hiding this comment

shindere commented Jun 19, 2024 via email

shindere commented Jun 19, 2024 via email

shindere commented Jun 19, 2024

MisterDA commented Jun 20, 2024

gasche commented Jun 20, 2024

MisterDA commented Jun 21, 2024

gasche commented Jun 21, 2024

MisterDA commented Jun 24, 2024

gasche Jun 26, 2024

Choose a reason for hiding this comment

gasche Jun 26, 2024

Choose a reason for hiding this comment

MisterDA commented Jun 17, 2024 •

edited

Loading