Add horizontal predictor support #240

spoutn1k · 2024-08-03T19:25:12Z

While investigating #237, I came to the realization that the crate lacked predictor support at encoding. This PR attemps to merge the feature in the most efficient way.

TiffEncoder gains a fluent interface with_predictor allowing users to easily set their preferred method. Using this predictor with the LZW compression makes its average compression ratio go from 0.85:1 (yes, the compressed image is bigger) to 1.3:1.

With those changes underway, the clunkyness of the current interface was just too strong. Having the compression as a generic type on the ImageEncoder methods serves no purpose other than complicating the code; methods were duplicated x and x_with_compression to account for the type. Adding the predictor to this interface was out of the question: x x_with_compression x_with_predictor x_with_compression_and_predictor ? No thanks.

TiffEncoder also gains a with_compression fluent interface to set the desired file compression.

The final interface looks as such:

let mut encoder = TiffEncoder::<std::fs::File>::new(compressed)?
        .with_predictor(Predictor::Horizontal)
        .with_compression(Compression::Lzw);

    encoder.write_image::<colortype::RGB8>(
        photo.width(),
        photo.height(),
        photo.as_rgb8().expect("Wrong image format"),
    )?;

-    let mut encoder = TiffEncoder::<std::fs::File>::new(compressed)?;
+    let mut encoder = TiffEncoder::<std::fs::File>::new(compressed)?
+        .with_predictor(Predictor::Horizontal)
+        .with_compression(Compression::Lzw);
+
+    encoder.write_image::<colortype::RGB8>(
-    encoder.write_image_with_compression::<colortype::RGB8, Compresion::Lzw>(
         photo.width(),
         photo.height(),
-        Compression::Lzw,
         photo.as_rgb8().expect("Wrong image format"),
+    )?;

This means the *_with_compression methods are gone from the TiffEncoder interface, which means breaking changes.

spoutn1k · 2024-08-18T16:43:41Z

Any feedback ? If the breaking changes are not welcome I can try and work around them but I was able to build image-rs/image with no issues

spoutn1k · 2024-08-19T18:42:36Z

Fixed the grammar incompatible with rust 1.61 ! Please re-run the tests I cannot do it manually :(

spoutn1k · 2024-09-14T12:50:14Z

Any feedback ? I am using this to compress my images using rust instead of an external tool and if it is helpful to me I'm sure it could help someone else.

kornelski · 2024-09-18T21:34:46Z

Creating a new Vec per row is wasteful. You could pass &mut Vec to the predictor and have it append pixels directly to the final buffer.

spoutn1k · 2024-09-19T04:32:25Z

Well maybe but the method this function is used in does not allow for modification of the buffer it passes:

image-tiff/src/encoder/mod.rs

Lines 467 to 490 in 9508118

    
               pub fn write_strip(&mut self, value: &[T::Inner]) -> TiffResult<()> 
        
               where 
        
                   [T::Inner]: TiffValue, 
        
               { 
        
                   let samples = self.next_strip_sample_count(); 
        
                   if u64::try_from(value.len())? != samples { 
        
                       return Err(io::Error::new( 
        
                           io::ErrorKind::InvalidData, 
        
                           "Slice is wrong size for strip", 
        
                       ) 
        
                       .into()); 
        
                   } 
        
                   // Write the (possible compressed) data to the encoder. 
        
                   let offset = match self.predictor { 
        
                       Predictor::None => self.encoder.write_data(value)?, 
        
                       Predictor::Horizontal => { 
        
                           let predicted: Vec<T::Inner> = value 
        
                               .chunks(self.row_samples as usize) 
        
                               .flat_map(|row| T::horizontal_predict(row).into_iter()) 
        
                               .collect(); 
        
                           self.encoder.write_data(predicted.as_slice())? 
        
                       } 
        
                       _ => unreachable!(),

So there would be copy at some point or another. I can look into changing that but I wanted to limit my changes.

I can allocate a new buffer of the same size as image but that seems also like a doubling of the memory, or keep allocating one vec per row and write it directly to the encoder.

kornelski · 2024-09-19T10:37:28Z

You're calling collect() there which creates a Vec anyway.
Don't use flat_map + collect(). Make Vec::with_capacity() and append to it

spoutn1k · 2024-09-19T16:46:00Z

[T::Inner] does not have Clone, so if I want to create the Vec outside the predictor function, I will need to add it ...

I pushed a version that does not collect and writes strips as soon as they are predicted. If returning a Vec is still a performance concern, I will update the ColorType trait to do so.

kornelski · 2024-09-20T00:59:47Z

Can you check that the tests actually work?

I've replaced the predictor code with nothing:

Predictor::Horizontal => {
                0
}

and cargo test --all still passes.

kornelski · 2024-09-20T01:22:18Z

This is what I mean: spoutn1k#1

spoutn1k · 2024-09-20T08:25:32Z

You are right I did not implement tests. Will do that. I tested it by compressing images and opening them with image editors.

Avoid per-row allocations

spoutn1k · 2024-09-20T10:23:42Z

Images are corrupted from e00fa6b. Looks like the encoder does not like writing strip by strip.

spoutn1k · 2024-09-21T19:40:48Z

Most of the round-trip tests fail for datatypes bigger than u8. After some testing and manually compressing images it seems the implementation of the "deprediction" in the decoder is the culprit.

spoutn1k · 2024-09-21T20:52:57Z

Fixes #237 and #247 !

kornelski · 2024-09-23T18:45:52Z

Thank you

spoutn1k · 2024-09-23T19:36:32Z

Thank you !

spoutn1k added 5 commits August 3, 2024 20:12

Added predictor for colortype

cbdfc35

Add predictor docs

d3e3554

Remove unnecessary compression generic argument

829d433

Add predictor/compression sanity checks

3ffc54a

Update tests

ef80908

Implement default manually for 1.61 compat

9508118

spoutn1k mentioned this pull request Sep 18, 2024

LZW compression expands files #237

Closed

Avoid collecting a flat_map

e00fa6b

Avoid per-row allocations

25b22b5

Merge pull request #1 from image-rs/spoutn1k-predictor

9a349fe

Avoid per-row allocations

Add predictor tests

c1557c2

spoutn1k force-pushed the predictor branch from 5b3ac29 to c1557c2 Compare September 20, 2024 10:34

spoutn1k added 2 commits September 20, 2024 13:35

Allow useless but valid prediction on uncompressed images

3479224

Write predicted buffer at once

9c3a51b

Fix horizontal prediction decoding

1c03328

spoutn1k mentioned this pull request Sep 21, 2024

Decoding horizontally-predicted images of more than 8bit per channel fails #247

Closed

kornelski approved these changes Sep 23, 2024

View reviewed changes

kornelski merged commit e28ad56 into image-rs:master Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add horizontal predictor support #240

Add horizontal predictor support #240

spoutn1k commented Aug 3, 2024 •

edited

Loading

spoutn1k commented Aug 18, 2024

spoutn1k commented Aug 19, 2024

spoutn1k commented Sep 14, 2024

kornelski commented Sep 18, 2024

spoutn1k commented Sep 19, 2024 •

edited

Loading

kornelski commented Sep 19, 2024

spoutn1k commented Sep 19, 2024 •

edited

Loading

kornelski commented Sep 20, 2024

kornelski commented Sep 20, 2024

spoutn1k commented Sep 20, 2024 •

edited

Loading

spoutn1k commented Sep 20, 2024

spoutn1k commented Sep 21, 2024

spoutn1k commented Sep 21, 2024

kornelski commented Sep 23, 2024

spoutn1k commented Sep 23, 2024

Add horizontal predictor support #240

Add horizontal predictor support #240

Conversation

spoutn1k commented Aug 3, 2024 • edited Loading

spoutn1k commented Aug 18, 2024

spoutn1k commented Aug 19, 2024

spoutn1k commented Sep 14, 2024

kornelski commented Sep 18, 2024

spoutn1k commented Sep 19, 2024 • edited Loading

kornelski commented Sep 19, 2024

spoutn1k commented Sep 19, 2024 • edited Loading

kornelski commented Sep 20, 2024

kornelski commented Sep 20, 2024

spoutn1k commented Sep 20, 2024 • edited Loading

spoutn1k commented Sep 20, 2024

spoutn1k commented Sep 21, 2024

spoutn1k commented Sep 21, 2024

kornelski commented Sep 23, 2024

spoutn1k commented Sep 23, 2024

spoutn1k commented Aug 3, 2024 •

edited

Loading

spoutn1k commented Sep 19, 2024 •

edited

Loading

spoutn1k commented Sep 19, 2024 •

edited

Loading

spoutn1k commented Sep 20, 2024 •

edited

Loading