Releases: maxharlow/csvmatch
Releases Β· maxharlow/csvmatch
v1.16
- Intelligently handle files with Windows-style newline characters
- Remove requirement of needing to be installed with
sudo
v1.15
- Detect character encoding automatically quicker in large files
- Show progress bar (except for Bilenko fuzzy matches)
v1.14
- Fixes bug whereby columns with the same name from two different files would be coalesced
- Adds Jaro-Winkler fuzzy matching algorithm
v1.13
- Adds argument to specify fuzzy match threshold (currently only used by Levenshtein)
- Adds argument to show which version of CSV Match is running(!)
- Splits out functionality from command line interface to allow testing
- A suite of tests
- Readme improvements
v1.12
- Renames
--strip-nonalpha
to --ignore-nonalpha
- Adds argument to convert non-Latin characters to their Latin equivalents when comparing
- Adds option to include the matching degree in the output
- Increases the default threshold for a match when using Levenshtein from 40% to 60%
- Move to use a better Metaphone implementation
- Force Unix-style line endings on all platforms
v1.11
- Adds output header shortcuts
- Fixes Levenshtein failing on Unicode characters
- Adds argument to allow file encodings to be specified
- Improved documentation
v1.10
- Adds argument to specify which columns are included in the output
v1.9
- Adds argument to specify the join type -- defaults to an inner join, options for left outer, right outer, full outer
v1.8
- Fixes titles filter
- Adds more titles
v1.7
- Adds an argument to filter out terms from a given list of regular expressions
- Adds an argument to filter out common name titles (Mr, Ms, etc)
- Improved documentation