You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Aug 26, 2022. It is now read-only.
Jan sent the ALA a report from a validator (this one I think), which has the majority of the datasets being rejected with "BASIS_OF_RECORD_INVALID". Given that the Darwin Core spec doesn't give an exhaustive list of valid basisOfRecord values, I had to look through the codebase here to find a link to the list that may be the cause of the validation failures, which I had not seen before:
In the basisOfRecord case, the ALA has been offering GBIF verbatim data from data providers. The majority of them do not include basisOfRecord themselves, so it is understandable in this case that if GBIF requires it to be present that many of the ALA datasets won't be suitable in their current form. (Leaving a discussion of whether ALA offers its processed data to GBIF instead, for another day)
It would be ideal if there was a web page somewhere (the wiki here could work) that provides descriptions of the reasons for failures, including links to any controlled vocabularies that this validator is using in addition to the Darwin Core spec.
The text was updated successfully, but these errors were encountered:
After Jan sent ALA the report from the validator last week, I added the description for "BASIS_OF_RECORD_INVALID" referencing the vocabulary http://rs.gbif.org/vocabulary/dwc/basis_of_record.xml which complies with the Darwin Core Type Vocabulary. @cgendreau tells me the GBIF Data Validator actually uses the vocabulary from the GBIF API, however, be aware this vocabulary includes non-accepted values that shouldn't be used - see gbif/gbif-api#14
Hi @kbraak, Thanks, evaluation_types.md is what I was looking for.
My analysis of the basisOfRecord issue so far is that we have typically added basisOfRecord using default values, but those default values don't modify the "basisOfRecord" field that we send to GBIF. Instead they are stored in a "processed" field, next to basisOfRecord, called "basisOfRecord.p". I have filed an issue on biocache-store to track that part: AtlasOfLivingAustralia/biocache-store#212
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Jan sent the ALA a report from a validator (this one I think), which has the majority of the datasets being rejected with "BASIS_OF_RECORD_INVALID". Given that the Darwin Core spec doesn't give an exhaustive list of valid basisOfRecord values, I had to look through the codebase here to find a link to the list that may be the cause of the validation failures, which I had not seen before:
http://rs.gbif.org/vocabulary/dwc/basis_of_record.xml
In the basisOfRecord case, the ALA has been offering GBIF verbatim data from data providers. The majority of them do not include basisOfRecord themselves, so it is understandable in this case that if GBIF requires it to be present that many of the ALA datasets won't be suitable in their current form. (Leaving a discussion of whether ALA offers its processed data to GBIF instead, for another day)
It would be ideal if there was a web page somewhere (the wiki here could work) that provides descriptions of the reasons for failures, including links to any controlled vocabularies that this validator is using in addition to the Darwin Core spec.
The text was updated successfully, but these errors were encountered: