Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend the DCAT to CKAN mapping #31

Open
1 of 2 tasks
TBoonX opened this issue Apr 13, 2021 · 10 comments
Open
1 of 2 tasks

Extend the DCAT to CKAN mapping #31

TBoonX opened this issue Apr 13, 2021 · 10 comments
Assignees
Labels
WP4.2 Entwicklung und Test der DCAT-basierten Synchronisation und Aggregation

Comments

@TBoonX
Copy link
Member

TBoonX commented Apr 13, 2021

We need to extend and change the mapping because we need more metadata and some are wrong.

  • The urls provided in uri and guid could not be opened (ERR_INVALID_SIGNED_EXCHANGE) - should be the DCAT predicate dcat:landingPage. The value should be an URI to the NOMAD Archive in which a human could have a look on the datesets. Atm the dataset URI is used by CKAN instead.
  • Support of DOI: Datasets should be connected to DOI, minimum via http://purl.org/dc/terms/identifier at dcat:Dataset as an URI or literal
@TBoonX
Copy link
Member Author

TBoonX commented Apr 27, 2021

I started to work on a custom profile, but I was not able yet to enable it. I have no clue where to continue and will come to this task when I have one.

@TBoonX
Copy link
Member Author

TBoonX commented May 5, 2021

I found the issue. I will continue with this task

@TBoonX
Copy link
Member Author

TBoonX commented May 7, 2021

The DCAT Extension is using the api differently from what we thought, thus either the endpoints have to be changed or part of the extension has to be rewritten.
(only the /catalog route is used)

@TBoonX
Copy link
Member Author

TBoonX commented May 19, 2021

FHI did adopt the NOMAD DCAT API and I did adopt the code. The harvest is good enough for now. I will redo the harvest on the server when the NOMAD API gets its new change.

@TBoonX TBoonX added the WP4.2 Entwicklung und Test der DCAT-basierten Synchronisation und Aggregation label Jun 29, 2021
@TBoonX
Copy link
Member Author

TBoonX commented Feb 3, 2022

@markus1978 is DOI (see description of this issue) included in the RDF return? If yes, how? and if not, do you have the time to work on it in the next month or so?
Thanks.

@markus1978
Copy link

No, currently I don't see any DOI in our mapping to DCAT

@TBoonX
Copy link
Member Author

TBoonX commented Feb 15, 2022

No, currently I don't see any DOI in our mapping to DCAT

Thanks for the info. I have an issue about verifying that DOIs are also read from the DCAT interface: #36 There is also a list of datasets which have DOIs.
@markus1978 I had the impression that this is quite important, thus please discuss this with Carsten or so. If the DOI is also in the DCAT interface, then the change on my side will be quite fast.

@yoavnash is my impression correct, that DSMS will not provide DOIs in the lifetime of this project? If yes then you could think about at least supporting it via your interface in order to report this at least.

@yoavnash
Copy link
Member

Matthias suggests that we do it via TIB. I will send an email to Tatyana asking her if it's possible.

@markus1978
Copy link

I added "something" to the nomad dcat API.

I could not use identifier on our dcat:Dataset objects. our dcat:Dataset objects are calculations. Calculations do not get DOIs on nomad. Our users curate many calculations into larger nomad datasets. Those have a DOI. Basically a DOI for a nomad calculation (or dcat:Dataset) is not unique. I added dcat:Distribution to those calculations that have a DOI (via a nomad dataset). The distribution is representing the nomad dataset and is using the respective DOI as an identifier. Note that only the minority of nomad calculations are part of a nomad dataset with DOI. Here is one example: https://nomad-lab.eu/prod/v1/dcat/datasets/zzZhsOkL-rbZHLOLUZswRPqEp-Uw?format=turtle

Also note the different api prefix. This and all future changes will only be available for nomad v1 (https://nomad-lab.eu/prod/v1). The old version (https://nomad-lab.eu/prod/rae) is still running though and will probably be running for a few more months. At some point it will be replaced by a simple redirect (301) towards the new URL.

@TBoonX
Copy link
Member Author

TBoonX commented Apr 19, 2022

@markus1978 Thank you for the information and the code change!
I will now use the new API. The DOI is already shown in our CKAN instance.
I have a change request for the distributions:

  • Use a better title of the DOI distribution (the title should indicate that this distribution is about the parent datasets DOI)
  • Use also a media typ (dcat:mediaType) for DOI - text/html would fit if you link to a website (for DOI there is no data type)
  • Use http://purl.org/dc/terms/description if the title gets too long or some information is not expressed via the other triples

If you want to see how it looks like in CKAN: https://stream-dataspace.net/dataset/brtati2 (note: old NOMAD DCAT API used )

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
WP4.2 Entwicklung und Test der DCAT-basierten Synchronisation und Aggregation
Projects
None yet
Development

No branches or pull requests

3 participants