Segfault fixes and deterministic multithreading #126

nh2 · 2019-10-14T23:03:47Z

See #125, and the commit messages.

The direct call to `make` in `BUILD_COMMAND` used until now forced subprojects to build with `-j1`, generating the message: warning: jobserver unavailable: using -j1. Add '+' to parent make rule. See https://gitlab.kitware.com/cmake/cmake/issues/16273.

Until now, it was possible that a mask with a 255 in its border was generated, later failing the `valid_mask` assertions, or, if assertions are disabled by the build, segementation faults due to invalid memory accesses. See the example in the added comment for a condition where this could happen. The key insight is that if a point lay exactly on the "border" between two pixels, say between pixel N and N+1, it counted as occupying pixel N+1. So the triangle { (1,1), (1,2), (2,1) } in a 3x3 image would result in mask 64 64 64 64 255 255 64 255 64 (notice the triangle of 255s), instead of the correct mask 64 64 64 64 255 64 64 64 64 This commit fixes it by adding the condition that the last `x = width-1` and `y = height-1` must not count as `inside` the triangle. It also improves related assertions in a few places.

When adding to vectors (e.g. using `push_back()`), `ordered` in contrast to `critical` ensures that ordering of operations is the same as if run single-threadedly, and it ensures that the result is the same on every run of the program. This allows to generate deterministic results: Run with same inputs, byte-identical outputs are to be produced, independent of threading. I have checked using `time` on a 6-core machine that the changes have no significant impact on performance; this is expected because the critical regions are very small (usually adding small pointers to vectors). One location remains that still uses `critical` because `omp ordered` cannot be used inside `pragma omp parallel`, only inside `pragma omp parallel for`; this is likely because of the thread-local variable `projected_face_view_infos` being intended as a per-thread intermediate buffer; more effort needs to be put into how that can be put back into order. I've added a TODO for this. I haven't yet observed nondeterminism due to this, but may as well have been lucky.

nh2 added 3 commits October 14, 2019 19:45

This was referenced Oct 14, 2019

Segfault and valgrind errors #125

Open

Fully deterministic output #124

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segfault fixes and deterministic multithreading #126

Segfault fixes and deterministic multithreading #126

nh2 commented Oct 14, 2019

Segfault fixes and deterministic multithreading #126

Are you sure you want to change the base?

Segfault fixes and deterministic multithreading #126

Conversation

nh2 commented Oct 14, 2019