Skip to content

Commit

Permalink
Fix checksumming.md (#35)
Browse files Browse the repository at this point in the history
  • Loading branch information
jonahgao authored Jan 14, 2024
1 parent d013429 commit 8f9954c
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion content/en/docs/File Format/Data Pages/checksumming.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,4 +3,4 @@ title: "Checksumming"
linkTitle: "Checksumming"
weight: 7
---
Column chunks are composed of pages written back to back. The pages share a common header and readers can skip over page they are not interested in. The data for the page follows the header and can be compressed and/or encoded. The compression and encoding is specified in the page metadata.
Pages of all kinds can be individually checksummed. This allows disabling of checksums at the HDFS file level, to better support single row lookups. Checksums are calculated using the standard CRC32 algorithm - as used in e.g. GZip - on the serialized binary representation of a page (not including the page header itself).

0 comments on commit 8f9954c

Please sign in to comment.