Fast and efficient content-defined chunking for data deduplication. Java implementation of FastCDC as library.
-
Updated
Sep 21, 2023 - Java
Fast and efficient content-defined chunking for data deduplication. Java implementation of FastCDC as library.
Fast content-defined chunking in Go.
DedupBench is a benchmarking tool for data chunking techniques used in data deduplication. DedupBench is designed for extensibility, allowing new chunking techniques to be implemented with minimal additional code.
CDCZip is a deduplication filter meant for use as long range match finder preprocessor for use alongside other compression algorithms.
Find (partial content) duplicate files.
Print FastCDC rolling hash chunks and checksums.
Add a description, image, and links to the content-defined-chunking topic page so that developers can more easily learn about it.
To associate your repository with the content-defined-chunking topic, visit your repo's landing page and select "manage topics."