Corruption if input is not a size multiple of the max_source_symbols #5

ThomasHabets · 2023-11-25T19:11:41Z

Sorry I can't share code (out of my control), but here's a description. Hopefully reproducible from my description. I'm working on getting the code approved for release.

Versions

raptor-code 1.0.5
rust: 1.66.0 (this was actually found a while ago, but I filed the bug with the wrong project)
Linux: Ubuntu 22.04.2 LTS

What I was doing

I took the example code:

let source_data: Vec<u8> = vec![1,2,3,4,5,6,7,8,9,10,11,12];
let max_source_symbols = 4;
let nb_repair = 3;

let mut encoder = raptor_code::SourceBlockEncoder::new(&source_data, max_source_symbols);
let n = encoder.nb_source_symbols() + nb_repair;

for esi in 0..n as u32 {
    let encoding_symbol = encoder.fountain(esi);
    //TODO transfer symbol over Network
    // network_push_pkt(encoding_symbol);
}

And I set the source_data to be 3684 bytes (a random example file), and max_source_symbols to 19 (chosen in order to get ~200 byte chunks, which is a requirement to me).

This produced a bunch of 194 byte chunks.

When decoding (I took example from the same place):

let encoding_symbol_length = 194;
let source_block_size = 19; // Number of source symbols in the source block
let mut n = 0u32;
let mut decoder = raptor_code::SourceBlockDecoder::new(source_block_size);

while decoder.fully_specified() == false {
    //TODO replace the following line with pkt received from network
    let (encoding_symbol, esi) = (vec![0; encoding_symbol_length],n);
    decoder.push_encoding_symbol(&encoding_symbol, esi);
    n += 1;
}

let source_block_size = encoding_symbol_length  * source_block_size;
let source_block = decoder.decode(source_block_size as usize);

I set encoding_symbol_length to 194 and source_block_size = 19 (per above), and ran it. It almost works perfectly. First of all of course the file is two bytes too big. I expected this, since 19*194 is 3686. I expected two null bytes at the end, though, which would be truncatable. But what actually happens is that the two extra null bytes are one each at the end of the last two blocks. That is, one at position 3686, and one at position 3492 (194 bytes earlier).

This seems like a bug to me. Surely giving non-multiple input should still produce the correct output?

Workaround

I successfully worked around this by padding the input itself to 3686 bytes. Not sure if it needs to be a multiple of 194 or 19. After doing that a simple truncation to 3684 produces perfect output.

The text was updated successfully, but these errors were encountered:

ypo · 2023-11-26T08:49:16Z

Thanks for reporting this.

I have updated the example in the readme, the variable source_block_size was used for 2 different purposes, so I have modified that.

The source_block_length parameter should actually be the total size of your block (3684 bytes)

let source_block = decoder.decode(3684);

I have added a new automatic test with your use-case. See

raptor/tests/raptor.rs

Line 152 in dc1bfa2

pub fn test_onthefly_3k_repair3_loss10_issue5() {

ypo added a commit that referenced this issue Nov 26, 2023

Issue #5 : improve example, add on-the-fly test

e7959f9

ypo added a commit that referenced this issue Nov 26, 2023

Issue #5 : update readme

dc1bfa2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Corruption if input is not a size multiple of the max_source_symbols #5

Corruption if input is not a size multiple of the max_source_symbols #5

ThomasHabets commented Nov 25, 2023 •

edited

Loading

ypo commented Nov 26, 2023

Corruption if input is not a size multiple of the max_source_symbols #5

Corruption if input is not a size multiple of the max_source_symbols #5

Comments

ThomasHabets commented Nov 25, 2023 • edited Loading

Versions

What I was doing

Workaround

ypo commented Nov 26, 2023

ThomasHabets commented Nov 25, 2023 •

edited

Loading