Modify testrender to work with triangle meshes #1865

fpsunflower · 2024-09-15T00:12:52Z

Description

testrender was originally envisioned as a tiny example renderer that should only handle spheres/planes. Over time several groups have expressed the wish for it to handle arbitrary geometry instead. This PR replaces the sphere and quad primitives with triangle meshes.

I am making use of the (embedded) rapidobj library to load models in the .obj format. The original .xml scene format remains, so that you can combine several models together as well as declare and assign the osl shader networks you want to these meshes. For backwards compatibility, spheres and planes are still supported via tessellation (you specify how many subdivisions you want). In the case of .obj scenes, the shader assignment can be done either by the mesh name or the material name.

I am submitting this to get the review kickstarted. The history of commits includes some fairly large test scenes that should be squashed away to go into the main repo. Files over 50Mb are recommended to use git lfs, but we probably want to figure that out as a different task (possibly handle it via a separate repo).

The handling of derivatives is not totally correct yet, but the behavior is compatible with the previous (incorrect) handling of derivatives we had before.

Tests

Existing tests are passing (I will push updated reference images shortly).

Checklist:

I have read the contribution guidelines.
I have updated the documentation, if applicable.
I have ensured that the change is tested somewhere in the testsuite (adding new test cases if necessary).
My code follows the prevailing code style of this project. If I haven't
already run clang-format v17 before submitting, I definitely will look at
the CI test that runs clang-format and fix anything that it highlights as
being nonconforming.

Signed-off-by: Chris Kulla <[email protected]>

…inimal set of functions to have lit renders Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

… existing materials Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…of all triangles that share the same shader Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…de version which is not implemented in testrender Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…bals to visualize several fields of the shader globals, add basic dPdu and dPdv math Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…ntation for sphere primitives in previous version Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…ay bounces growing to infinity Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…p assist in manual scene setup Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…s contain the ray origin Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

…noise level Signed-off-by: Chris Kulla <[email protected]>

Signed-off-by: Chris Kulla <[email protected]>

steenax86 · 2024-09-20T20:14:08Z

Line 62 in src/testrender/CMakeLists.txt has fp-model=precise already, and the precise model has fhonor-nans on by default. I don't see fp-model=precise for the non-Intel clang compiler, which maybe why the assert is not firing for clang. There is a nan calculation happening somewhere higher up--binID is very odd, and depends on the nan-valued center via Box3.

Could you add fp-model=precise for a clang compilation in the CMakeList.txt and check for nans.

(Probably not needed, but for purposes of debug only, you could add fno-honor-nans and fno-honor-infinities This should disallow optimizations that assume results/operands are not nans. )

Signed-off-by: Chris Kulla <[email protected]>

lgritz · 2024-09-20T21:26:01Z

Crossing my fingers that this will fix the icx issue.

Chris, can you also look at the GPU test failure? (It's a build failure, not a run failure.)

lgritz · 2024-09-20T21:26:29Z

Oh, N/M, the GPU one is where you said that Tim's fix immediately after will address it, right?

tgrant-nv · 2024-09-20T21:41:30Z

Yes, I'll open a PR for the OptiX fixes after this PR is merged.

lgritz · 2024-09-20T21:42:00Z

looks like it's still hitting assertions

lgritz · 2024-09-20T21:44:52Z

I am very tempted to merge this as-is, knowing that there will be icx test failures (or maybe you'd prefer to disable the render tests just for icx, for now?) and assume that we'll get to the bottom of it and soon have what will probably be a very small follow-on PR to directly address the issue. I think failing tests for one compiler is a lesser evil that entering Dev Days with this unmerged, since I think some people want to do work on testrender and it seems pointless for them to start with the obsolete code. What do you think, @fpsunflower ?

fpsunflower · 2024-09-20T22:42:19Z

I agree. I think its worth merging now and we can investigate further as a follow up.

Signed-off-by: Chris Kulla <[email protected]>

lgritz

LGTM.

lgritz · 2024-09-20T23:55:08Z

OK, merging!

As a reminder, there are two outstanding issues here:

The icx test, alone, is failing by hitting assertions in the BVH code and we don't understand why yet. Expect some debugging and a follow-up PR to address the issue. Bug in the new bvh code? Need some kind of clamp to deal with compiler-induced floating point slop? Legit icx compiler bug? Will be exciting to find out!
The GPU test is failing to build. We understand that Tim Grant's immediate follow-on PR that adds OptiX functionality should take care of that.

lgritz · 2024-09-20T23:56:46Z

@timgrant You're up next!

AlexMWells · 2024-09-24T22:37:10Z

src/testrender/scene.cpp

+    for (int y = 0; y < H; y++) {
+        float t = float(y + 0.5f) / float(H);
+        float z = cosf(t * float(M_PI));
+        float q = sqrtf(1 - z * z);


float q = sqrtf(1 - z * z);
So if z > 1.0 by even 1ulp, then sqrtf could result in a NAN.
The result of cosf could allow a certain # of bits accuracy which means it could have results slightly greater than 1.0f.
Please try clamping the input to 0.f
float q = sqrtf(std::max(0.f, 1 - z * z));

and see if you NAN issues go away.

Sometimes in C++ I really miss OSL's "debug_nan" feature that will insert extra code around all math to 100% guarantee identifying the line in which a NaN first creeps in.

I was just did source code review to find that, C library does not guarantee perfect accuracy, so some clamping is needed if feeding values into a NaN producing operation.

Would someone be willing to try that and re-enable icpx CI?

Yeah, I will try right now!

This is a lesson we know well, that's why OIIO::safe_sqrt() exists!

running now...

I believe this is a smart change regardless, so I will submit the PR.

But I'm afraid it doesn't solve the problem. A number of tests still either time out (800 seconds!) or hit assertions. Only for icx.

steenax86 · 2024-10-03T00:41:25Z

Diagnosed the issue to be undefined behavior in the [] operator in float center = bbox.center()[bestAxis] in bvh.cpp's build_bvh function. IMath library's[] operator on a Vec3<float> dereferences only the first component, here float x, and every access beyond that (to y, or z) is out of bounds. Refining the fix and should be able to submit a PR shortly.

fpsunflower · 2024-10-03T01:48:46Z

Thanks for tracking that down! It's too bad that icx miscompiles this particular idiom. Also, I believe the same operator is used in the ray/triangle test, so perhaps there are more bugs lurking once this first one is fixed?

fpsunflower · 2024-10-03T01:51:37Z

I'm also wondering what the ideal way of expressing this idiom would be for icx? Is there a solution we could recommend for the Imath folks that would make all compilers happy?

lgritz · 2024-10-03T02:36:56Z

Wow, great detective work!

That idiom is unfortunately used in a number of places in OpenEXR. I feel like I've run into that problem before, back when we were realizing that it had a hard time vectorizing those classes. I thought I remembered fixing it on the openexr side, but maybe not?

What's the best way to fix that will not be UB? We should fix it in OpenEXR for sure.

lgritz · 2024-10-03T02:37:38Z

I mean Imath, not OpenEXR.

lgritz · 2024-10-03T02:38:30Z

Unfortunately, we also need a workaround on the OSL side, since we can't count on an Imath version being newer than whenever we can get it fixed there.

steenax86 · 2024-10-03T17:08:38Z

It was an interesting experience! XD. I examined values of all locals in build_bvh for icc 2022.0.0 and icx 2023.1.0. Turns out the value of center (for binID calculation) never changed and caused incorrect values to be propagated. Bringing bbox.center() on the stack and then indexing into bestAxis, solved the issue but then identified the core problem.

In it's current form, Imath's [] operator is breaking icx's strict aliasing rules.
The fix is a workaround in OSL. Replaced [] operator with a local getter to return vec3.x, vec3.y, and vec3.z, via depending on bestAxis value.

I will do a pass on all testrender files (including ray/triangle tests) to replace all occurrences of Imath::Box3f [] with the new getter.

fpsunflower · 2024-10-03T19:59:24Z

Replacing that memory access with a switch statement (I'm guessing?) seems like it will make things slower. I'll be curious to see the performance impact of the change.

Is there any flag we could pass to icx for it to match the aliasing rules of the other compilers?

lgritz · 2024-10-03T20:56:30Z

Could you use a trick like

union shim {
    Box3D b;
    float f[6];
};

Box3f thebox;
float component = ((const shim*)&thebox)->f[i];

Or is that also the same kind of UB?

AlexMWells · 2024-10-03T20:58:36Z

@fpsunflower , Imath breaks C/C++ language aliasing rules, so its not an icx specific issue.
https://github.com/AcademySoftwareFoundation/Imath/blob/main/src/Imath/ImathVec.h#L1593

template <class T>
IMATH_HOSTDEVICE IMATH_CONSTEXPR14 inline T&
Vec3<T>::operator[] (int i) IMATH_NOEXCEPT
{
    return (&x)[i]; // NOSONAR - suppress SonarCloud bug report.
}

When an address of an object is taken, it is illegal/undefined behavior to use that pointer to access memory outside the sizeof the object. In this case the object is float x, yes it happens to be a data member, but the object type that lifetime is marked for is 4 bytes of a float. In practice what this means is a temporary Vec3 with x, y, z is considered as only having x be used and y and z might be removed. Later when &x[i] happens with i > 0, the data being accessed won't be there, undefined behavior.

A simple correctness fix for this would be to take the address of the Vec3 object itself to mark all 12 bytes as potentially used, then cast to float to index.

template <class T>
IMATH_HOSTDEVICE IMATH_CONSTEXPR14 inline T&
Vec3<T>::operator[] (int i) IMATH_NOEXCEPT
{
    return reinterpret_cast<float *>(this)[i]
}

There are many good reasons to not use dynamic indices, but that is a longer conversation

lgritz · 2024-10-03T21:10:38Z

We are literally talking about this and looking at this ticket at this moment in the OpenEXR/Imath TSC meeting.

fpsunflower · 2024-10-04T08:37:10Z

The reinterpret_cast version looks good to me if it workss, though I believe its technically also UB, so is using a union. Its a shame there isn't a good way to do this in C++.

As far as this particular issue goes, we could switch from using Imath::Vec3 to using a plain old float[3] maybe? We would loose the overloaded operators but it might not be that bad for the handful of use cases that need indexing.

cary-ilm · 2024-10-10T21:46:14Z

Belatedly catching up on this. I reposted the issue at AcademySoftwareFoundation/Imath#446 for wider discussion, comments there are welcome!

fpsunflower added 27 commits September 14, 2024 16:33

Implement basic .obj file loading and triangle intersection

744d410

Signed-off-by: Chris Kulla <[email protected]>

Replace quad and sphere prims with tesselated version and implement m…

4f7c9f5

…inimal set of functions to have lit renders Signed-off-by: Chris Kulla <[email protected]>

Add stanford bunny test scene

68d581d

Signed-off-by: Chris Kulla <[email protected]>

Introduce type for triangle indices, minor cleanups

f3adf2a

Signed-off-by: Chris Kulla <[email protected]>

Keep track of shaderids and add a mechanism for obj files to refer to…

1444c1a

… existing materials Signed-off-by: Chris Kulla <[email protected]>

Add next-event estimation with random triangle picking

a779a06

Signed-off-by: Chris Kulla <[email protected]>

Add option to show normals, set surface area field based on the area …

7afd489

…of all triangles that share the same shader Signed-off-by: Chris Kulla <[email protected]>

Simplify random light selection logic, use uniform sampling for now

d64150b

Signed-off-by: Chris Kulla <[email protected]>

Fix resolution attribute parsing

0388cd1

Signed-off-by: Chris Kulla <[email protected]>

Run render tests using the optimized variant only, and skip the bitco…

c85a812

…de version which is not implemented in testrender Signed-off-by: Chris Kulla <[email protected]>

Track surface area per mesh instead of per shader

8e94515

Signed-off-by: Chris Kulla <[email protected]>

Update scene files to mark shaders as lights instead of objects

58ed24d

Signed-off-by: Chris Kulla <[email protected]>

Add shading normals and texture coordinates

8a68bba

Signed-off-by: Chris Kulla <[email protected]>

Add uvs to sphere primitives, matching previous implementation

aaa6d88

Signed-off-by: Chris Kulla <[email protected]>

Flip triangle winding on spheres, generalize show_normals to show_glo…

dc8f311

…bals to visualize several fields of the shader globals, add basic dPdu and dPdv math Signed-off-by: Chris Kulla <[email protected]>

Fix sphere parameterization to match previous implementation

3db0704

Signed-off-by: Chris Kulla <[email protected]>

Scale light powers by 4 to account for incorrect surface area impleme…

adc8e08

…ntation for sphere primitives in previous version Signed-off-by: Chris Kulla <[email protected]>

Fix sphere poles from changed parameterization

9ca6da8

Signed-off-by: Chris Kulla <[email protected]>

Ensure light sample hits are evaluated at the selected sample point

13afad7

Signed-off-by: Chris Kulla <[email protected]>

Limit number of bounces on burley diffuse furnace test to avoid runaw…

3dbb009

…ay bounces growing to infinity Signed-off-by: Chris Kulla <[email protected]>

Add info message with render time

7ccb516

Signed-off-by: Chris Kulla <[email protected]>

Add message for failed shader names lookup when loading models to hel…

5913ab3

…p assist in manual scene setup Signed-off-by: Chris Kulla <[email protected]>

Add MacOS finder files to .gitignore

0658552

Signed-off-by: Chris Kulla <[email protected]>

Speedup BVH traversal by being more accurate in cases where both boxe…

f3f4261

…s contain the ray origin Signed-off-by: Chris Kulla <[email protected]>

Add shaderball preview scene

b1ae25b

Signed-off-by: Chris Kulla <[email protected]>

Remove bunny and shaderpreview tests

fe97fea

Signed-off-by: Chris Kulla <[email protected]>

Remove references to removed tests

aa3ed58

Signed-off-by: Chris Kulla <[email protected]>

fpsunflower force-pushed the testrender-triangles branch from db90714 to aa3ed58 Compare September 15, 2024 00:18

fpsunflower added 2 commits September 14, 2024 18:07

Update oren-nayar test with quad light instead of sphere to preserve …

1f3e103

…noise level Signed-off-by: Chris Kulla <[email protected]>

Update uv test with quad light instead of sphere to preserve noise level

860ef2f

Signed-off-by: Chris Kulla <[email protected]>

Try tweaking ICC compiler options for testrender

b250534

Signed-off-by: Chris Kulla <[email protected]>

Revert ICC compiler option changes

bcd93ec

Signed-off-by: Chris Kulla <[email protected]>

lgritz approved these changes Sep 20, 2024

View reviewed changes

lgritz merged commit 78e5392 into AcademySoftwareFoundation:main Sep 20, 2024
20 of 22 checks passed

tgrant-nv mentioned this pull request Sep 21, 2024

Fix testrender OptiX build #1869

Merged

4 tasks

AlexMWells reviewed Sep 24, 2024

View reviewed changes

cary-ilm mentioned this pull request Oct 10, 2024

Undefined behavior in Vec::operator[] AcademySoftwareFoundation/Imath#446

Open

fpsunflower deleted the testrender-triangles branch October 28, 2024 03:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modify testrender to work with triangle meshes #1865

Modify testrender to work with triangle meshes #1865

fpsunflower commented Sep 15, 2024

steenax86 commented Sep 20, 2024

lgritz commented Sep 20, 2024

lgritz commented Sep 20, 2024

tgrant-nv commented Sep 20, 2024

lgritz commented Sep 20, 2024

lgritz commented Sep 20, 2024

fpsunflower commented Sep 20, 2024

lgritz left a comment

lgritz commented Sep 20, 2024

lgritz commented Sep 20, 2024

AlexMWells Sep 24, 2024

lgritz Sep 24, 2024

AlexMWells Sep 24, 2024

lgritz Sep 24, 2024

lgritz Sep 24, 2024

lgritz Sep 24, 2024

steenax86 commented Oct 3, 2024 •

edited

Loading

fpsunflower commented Oct 3, 2024

fpsunflower commented Oct 3, 2024

lgritz commented Oct 3, 2024

lgritz commented Oct 3, 2024

lgritz commented Oct 3, 2024

steenax86 commented Oct 3, 2024

fpsunflower commented Oct 3, 2024

lgritz commented Oct 3, 2024

AlexMWells commented Oct 3, 2024

lgritz commented Oct 3, 2024

fpsunflower commented Oct 4, 2024

cary-ilm commented Oct 10, 2024

Modify testrender to work with triangle meshes #1865

Modify testrender to work with triangle meshes #1865

Conversation

fpsunflower commented Sep 15, 2024

Description

Tests

Checklist:

steenax86 commented Sep 20, 2024

lgritz commented Sep 20, 2024

lgritz commented Sep 20, 2024

tgrant-nv commented Sep 20, 2024

lgritz commented Sep 20, 2024

lgritz commented Sep 20, 2024

fpsunflower commented Sep 20, 2024

lgritz left a comment

Choose a reason for hiding this comment

lgritz commented Sep 20, 2024

lgritz commented Sep 20, 2024

AlexMWells Sep 24, 2024

Choose a reason for hiding this comment

lgritz Sep 24, 2024

Choose a reason for hiding this comment

AlexMWells Sep 24, 2024

Choose a reason for hiding this comment

lgritz Sep 24, 2024

Choose a reason for hiding this comment

lgritz Sep 24, 2024

Choose a reason for hiding this comment

lgritz Sep 24, 2024

Choose a reason for hiding this comment

steenax86 commented Oct 3, 2024 • edited Loading

fpsunflower commented Oct 3, 2024

fpsunflower commented Oct 3, 2024

lgritz commented Oct 3, 2024

lgritz commented Oct 3, 2024

lgritz commented Oct 3, 2024

steenax86 commented Oct 3, 2024

fpsunflower commented Oct 3, 2024

lgritz commented Oct 3, 2024

AlexMWells commented Oct 3, 2024

lgritz commented Oct 3, 2024

fpsunflower commented Oct 4, 2024

cary-ilm commented Oct 10, 2024

steenax86 commented Oct 3, 2024 •

edited

Loading