Skip to content

Releases: maxharlow/csvmatch

v1.16

22 Jun 20:31
Compare
Choose a tag to compare
  • Intelligently handle files with Windows-style newline characters
  • Remove requirement of needing to be installed with sudo

v1.15

19 Apr 22:13
Compare
Choose a tag to compare
  • Detect character encoding automatically quicker in large files
  • Show progress bar (except for Bilenko fuzzy matches)

v1.14

11 Apr 16:52
Compare
Choose a tag to compare
  • Fixes bug whereby columns with the same name from two different files would be coalesced
  • Adds Jaro-Winkler fuzzy matching algorithm

v1.13

03 Dec 21:38
Compare
Choose a tag to compare
  • Adds argument to specify fuzzy match threshold (currently only used by Levenshtein)
  • Adds argument to show which version of CSV Match is running(!)
  • Splits out functionality from command line interface to allow testing
  • A suite of tests
  • Readme improvements

v1.12

28 Apr 21:41
Compare
Choose a tag to compare
  • Renames --strip-nonalpha to --ignore-nonalpha
  • Adds argument to convert non-Latin characters to their Latin equivalents when comparing
  • Adds option to include the matching degree in the output
  • Increases the default threshold for a match when using Levenshtein from 40% to 60%
  • Move to use a better Metaphone implementation
  • Force Unix-style line endings on all platforms

v1.11

18 Apr 10:55
Compare
Choose a tag to compare
  • Adds output header shortcuts
  • Fixes Levenshtein failing on Unicode characters
  • Adds argument to allow file encodings to be specified
  • Improved documentation

v1.10

05 Mar 22:08
Compare
Choose a tag to compare
  • Adds argument to specify which columns are included in the output

v1.9

01 Dec 22:31
Compare
Choose a tag to compare
  • Adds argument to specify the join type -- defaults to an inner join, options for left outer, right outer, full outer

v1.8

01 Dec 22:27
Compare
Choose a tag to compare
  • Fixes titles filter
  • Adds more titles

v1.7

01 Dec 22:01
Compare
Choose a tag to compare
  • Adds an argument to filter out terms from a given list of regular expressions
  • Adds an argument to filter out common name titles (Mr, Ms, etc)
  • Improved documentation