-
Notifications
You must be signed in to change notification settings - Fork 11
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Improve CsvReader to pad rows with missing columns with empty columns…
… instead of skipping them. Behaves similar to gbif dwca-io, see https://github.com/CatalogueOfLife/data/issues/785
- Loading branch information
Showing
11 changed files
with
122 additions
and
19 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
Test dwca data for UNITE with broken parentNameUsageID records, | ||
but for which there is a working acceptedNameUsageID record. | ||
https://github.com/CatalogueOfLife/data/issues/785 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
<archive xmlns="http://rs.tdwg.org/dwc/text/" metadata="eml.xml"> | ||
<core encoding="UTF-8" fieldsTerminatedBy="\t" linesTerminatedBy="\n" fieldsEnclosedBy="" ignoreHeaderLines="1" rowType="http://rs.tdwg.org/dwc/terms/Taxon"> | ||
<files> | ||
<location>taxon.csv</location> | ||
</files> | ||
<id index="0" /> | ||
<field index="1" term="http://rs.tdwg.org/dwc/terms/taxonID"/> | ||
<field index="2" term="http://rs.tdwg.org/dwc/terms/parentNameUsageID"/> | ||
<field index="3" term="http://rs.tdwg.org/dwc/terms/acceptedNameUsageID"/> | ||
<field index="4" term="http://rs.tdwg.org/dwc/terms/scientificName"/> | ||
<field index="5" term="http://rs.tdwg.org/dwc/terms/taxonRank"/> | ||
<field index="6" term="http://purl.org/dc/terms/references"/> | ||
<field index="7" term="http://purl.org/dc/terms/taxonRemarks"/> | ||
</core> | ||
</archive> | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
id taxonID parentNameUsageID acceptedNameUsageID scientificName taxonRank references taxonRemarks | ||
SH0887892.09FU SH0887892.09FU 763571 SH0879619.10FU SH0887892.09FU species hypothesis dx.doi.org/10.15156/BIO/SH0887892.09FU 1.5% | ||
SH0879619.10FU SH0879619.10FU SH0097986.10FU SH0879619.10FU species hypothesis dx.doi.org/10.15156/BIO/SH0879619.10FU 1.5% | ||
SH0097986.10FU SH0097986.10FU 763571 SH0097986.10FU species hypothesis dx.doi.org/10.15156/BIO/SH0097986.10FU 3.0% | ||
SH0763571.10FU SH0763571.10FU SH0023252.10FU SH0763571.10FU species hypothesis dx.doi.org/10.15156/BIO/SH0763571.10FU 1.5% | ||
763571 763571 760612 Podila humilis (Linnem. ex W. Gams) Vandepol & Bonito Species | ||
760612 760612 119792 Podila Stajich, Vandepol & Bonito Genus | ||
119792 119792 119237 Mortierellaceae Luerss. Family | ||
119237 119237 672213 Mortierellales Caval.-Sm. Order | ||
672213 672213 671028 Mortierellomycetes Doweld Class | ||
671028 671028 118934 Mortierellomycota Tedersoo, Sánchez-Ramírez, Kõljalg, Bahram, Döring, Schigel, T. May, M. Ryberg & Abarenkov Phylum | ||
118934 118934 Fungi R.T. Moore Kingdom |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
Test dwca data for UNITE with broken parentNameUsageID records, | ||
but for which there is a working acceptedNameUsageID record. | ||
https://github.com/CatalogueOfLife/data/issues/785 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
<archive xmlns="http://rs.tdwg.org/dwc/text/" metadata="eml.xml"> | ||
<core encoding="UTF-8" fieldsTerminatedBy="\t" linesTerminatedBy="\n" fieldsEnclosedBy="" ignoreHeaderLines="1" rowType="http://rs.tdwg.org/dwc/terms/Taxon"> | ||
<files> | ||
<location>taxon.csv</location> | ||
</files> | ||
<id index="0" /> | ||
<field index="1" term="http://rs.tdwg.org/dwc/terms/taxonID"/> | ||
<field index="2" term="http://rs.tdwg.org/dwc/terms/parentNameUsageID"/> | ||
<field index="3" term="http://rs.tdwg.org/dwc/terms/acceptedNameUsageID"/> | ||
<field index="4" term="http://rs.tdwg.org/dwc/terms/scientificName"/> | ||
<field index="5" term="http://rs.tdwg.org/dwc/terms/taxonRank"/> | ||
<field index="6" term="http://purl.org/dc/terms/references"/> | ||
<field index="7" term="http://purl.org/dc/terms/taxonRemarks"/> | ||
</core> | ||
</archive> | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
id taxonID parentNameUsageID acceptedNameUsageID scientificName taxonRank references taxonRemarks | ||
SH0887892.09FU SH0887892.09FU 763571 SH0879619.10FU SH0887892.09FU species hypothesis dx.doi.org/10.15156/BIO/SH0887892.09FU 1.5% | ||
SH0879619.10FU SH0879619.10FU SH0097986.10FU SH0879619.10FU species hypothesis dx.doi.org/10.15156/BIO/SH0879619.10FU 1.5% | ||
SH0097986.10FU SH0097986.10FU 763571 SH0097986.10FU species hypothesis dx.doi.org/10.15156/BIO/SH0097986.10FU 3.0% | ||
SH0763571.10FU SH0763571.10FU SH0023252.10FU SH0763571.10FU species hypothesis dx.doi.org/10.15156/BIO/SH0763571.10FU 1.5% | ||
763571 763571 760612 Podila humilis (Linnem. ex W. Gams) Vandepol & Bonito Species | ||
760612 760612 119792 Podila Stajich, Vandepol & Bonito Genus | ||
119792 119792 119237 Mortierellaceae Luerss. Family | ||
119237 119237 672213 Mortierellales Caval.-Sm. Order | ||
672213 672213 671028 Mortierellomycetes Doweld Class | ||
671028 671028 118934 Mortierellomycota Tedersoo, Sánchez-Ramírez, Kõljalg, Bahram, Döring, Schigel, T. May, M. Ryberg & Abarenkov Phylum | ||
118934 118934 Fungi R.T. Moore Kingdom |