Encoding float16 lossless appears to produce artifacts for specific values #3881

Skielex · 2024-10-07T21:04:09Z

Describe the bug
When using lossless float16 encoding, specific ranges of binary values appear to be corrupted. Specifically, it appears that all values with a binary representation corresponding to the uint16 ranges [512:1023], [31745:32767], [33280:33793], and [64513:65534] are changed during an encode-decode cycle.

To Reproduce
Steps to reproduce the behavior:

Encode an image with any float16 value with a binary representation corresponding to the uint16 ranges [512:1023], [31745:32767], [33280:33793], and [64513:65534].
Decode the image.
Check that the image data has changed.

Expected behavior
Values should not change during a lossless encode-decode cycle. I've tested float32 encoding with all possible values and it works as expected.

Screenshots
Affected value ranges in yellow:

Bit values (yellow = True) for all 65536 float16 values with affected ranges between red and blue lines.

Environment

OS: Ubuntu WSL2 on Windows (+ Windows, AMD64 and ARM64, see JPEGXL lossless float16 is not lossless cgohlke/imagecodecs#114)
Compiler version: GCC 11 (I assume, it's a Python 3.10 extension)
CPU type: AMD Ryzen 5900X
libjxl version: 0.11.0 (according to this)
cjxl/djxl version string: Cannot test with cjxl/djxl as they don't appear to support any formats that allow float16.

Additional context
I found this issue using the imagecodes Python package. Original issue is cgohlke/imagecodecs#114.

As mentioned in the issue linked above, I've not been able to test for the issue using cjxl/djxl or GIMP due to a lack of float16 support.

The text was updated successfully, but these errors were encountered:

jyrkialakuijala · 2024-10-11T11:11:35Z

I didn't look at the code, just speculating about this from a belief-based viewpoint.

The format itself stores these as integers and does prediction and other processing as if they were integers -- and it is highly unlikely that there is an issue there.

The phenomena could be due to some of the following:
-Inf, +Inf, NaN, negative zero vs. positive zero, Denormalized Numbers, other approximations used in 16 bit floating point calculations in the calling code rather than in JPEG XL

Skielex · 2024-10-11T21:07:46Z

I don't think the issue is related to special float values, although they too are affected, for two reasons:

The first affected region of values consists of float16 values between 3.0517578125e-05 and 6.097555160522461e-05, which become values between 0.0 and 6.091594696044922e-05 after encode-decode.
The float32 encode-decode does not suffer from this issue. I've tested all possible float32 binary values and they were all preserved with lossless encoding.

The true uint16 representation of the values in the first region are:

512,  513,  514, ..., 1021, 1022, 1023

However, after encode-decode they become:

0, 2, 4, ..., 1018, 1020, 1022

The next value, 1024 (6.103515625e-05 as float16), is preserved.

kmilos · 2024-10-23T15:14:06Z

It looks indeed like the differences are in the special inf/NaN and subnormal numbers.

I don't think the issue is related to special float values

The true uint16 representation of the values in the first region are:

512, 513, 514, ..., 1021, 1022, 1023

But these are (half of the) positive subnormal numbers in float16 binary representation.

[31745:32767]

These are "positive" NaNs.

[33280:33793]

These are (half of the) negative subnormals.

[64513:65534]

These are "negative" NaNs.

kmilos mentioned this issue Oct 8, 2024

jxl - check option combinations & bit depth darktable-org/darktable#17487

Open

mo271 added bug Something isn't working encoder Related to the libjxl encoder unrelated to 1.0 Things that need not be done before the 1.0 version milestone labels Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Encoding float16 lossless appears to produce artifacts for specific values #3881

Encoding float16 lossless appears to produce artifacts for specific values #3881

Skielex commented Oct 7, 2024

jyrkialakuijala commented Oct 11, 2024 •

edited

Loading

Skielex commented Oct 11, 2024 •

edited

Loading

kmilos commented Oct 23, 2024 •

edited

Loading

Encoding float16 lossless appears to produce artifacts for specific values #3881

Encoding float16 lossless appears to produce artifacts for specific values #3881

Comments

Skielex commented Oct 7, 2024

jyrkialakuijala commented Oct 11, 2024 • edited Loading

Skielex commented Oct 11, 2024 • edited Loading

kmilos commented Oct 23, 2024 • edited Loading

jyrkialakuijala commented Oct 11, 2024 •

edited

Loading

Skielex commented Oct 11, 2024 •

edited

Loading

kmilos commented Oct 23, 2024 •

edited

Loading