-
Notifications
You must be signed in to change notification settings - Fork 2
EML.XML validation not supported or broken? #98
Comments
The first issue could be due to changes in EML versions where license has been changed to licensed. Perhaps the validation will work as DWC with only EML, but not as a xml file? |
What's the EML version? We support version 1.0, 1.0.1, 1.0.2 and 1.1 at present, and have a longer term task to update to the newer versions. (The version is probably in the first few lines of the EML). |
eml-2.1.1 Then I suppose the first issue is not fixed by changing license to licensed either which was introduced with version 2. |
@tmcelrath I'm trying to figure out the best way to proceed on this. I could help with building an EML file using the IPT that you could work with as your starting point instead? Would that be helpful? Also note that when I tried to load your eml.xml file into XMLNotepad it gave this error: |
Idea, can you send me a properly formatted EML file that has been built by the IPT? That way I can compare with what I have. We used the same text on our last upload in 2015 so I don't have anything to compare it to. |
@tmcelrath I will resolve this: SpeciesFileGroup/taxonworks#2986, and try to validate from there. |
^ Please attach the very latest version of the EML that is supported to this issue if possible. |
|
Now I am confused. Here is EML from a very recent ALA file upload to GBIF. It appears to use 2.1. Perhaps our issue is something else. <?xml version="1.0" encoding="utf-8"?>
<eml:eml xmlns:d="eml://ecoinformatics.org/dataset-2.1.0" xmlns:eml="eml://ecoinformatics.org/eml-2.1.1" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/terms/" xsi:schemaLocation="eml://ecoinformatics.org/eml-2.1.1 http://rs.gbif.org/schema/eml-gbif-profile/1.1/eml-gbif-profile.xsd" system="ALA-Registry" scope="system" xml:lang="en">
<dataset>
... |
@tmcelrath I believe the problem is the file you supplied is a frankenstein of old and new, for example the section you merged re |
I think I found our main problem:
The validator is likely running the second. We'll update our generator to use that and go from there. |
Yes, |
@mjy and @tmcelrath were you able to solve this issue? |
Yes, we were. @mjy what exactly was the solution in the XML stuff at the top of the file? |
A GBIF publisher is experiencing issues with validation of his eml.xml file in the data validator.
Data validation result here:
Regardless, it was able to parse out three issues with the .eml file
1. "The licence can not be parsed, is not supported by GBIF or is simply missing”
eml:license
eml:licenseNameCC-BY 4.0</eml:licenseName>
eml:urlhttps://creativecommons.org/licenses/by/4.0/</eml:url>
</eml:license>
Is there something wrong with that?
2. I also get "The description of the dataset is missing or too short” - which I don’t understand.
and lastly, for the EML file:
3. The EML document does not validate against the schema”
ERROR
cvc-complex-type.3.2.2: Attribute 'core_scope' is not allowed to appear in element 'eml:eml'.
cvc-complex-type.4: Attribute 'scope' must appear on element 'eml:eml'.
cvc-complex-type.2.4.a: Invalid content was found starting with element '{"eml://ecoinformatics.org/eml-2.1.1":dataset}'. One of '{dataset}' is expected.
Not sure what any of that means. I built this EML file from our old EML file that worked and a few other places.
The text was updated successfully, but these errors were encountered: