Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Map csvw:TableSchema and RML #9

Open
VladimirAlexiev opened this issue Oct 4, 2024 · 3 comments
Open

Map csvw:TableSchema and RML #9

VladimirAlexiev opened this issue Oct 4, 2024 · 3 comments
Labels
question Further information is requested

Comments

@VladimirAlexiev
Copy link

VladimirAlexiev commented Oct 4, 2024

https://kg-construct.github.io/rml-io/spec/docs/#logical-source-in-rml uses CSVW but only to describe the location and dialect of a Table.
In a dataspace scenario we use TableSchema to describe some csv, to then provision ingest to a timeseries database, while enabling discovery and other semantic integration approaches.
CSVW is a nice focused spec dedicated to CSV semantic description, so I think it's worth examining it, mapping it to RML, and maybe finding some features to add to RML.

The CSVW Manifest is JSON (also JSON-LD) and it has one "defect": it maps columns not to RDF props but to URL templates. So maybe we should retool to use RML instead of CSVW.

@VladimirAlexiev VladimirAlexiev changed the title Map csvw:TableSchema Map csvw:TableSchema and RML Oct 4, 2024
@DylanVanAssche
Copy link
Contributor

Hi!

It is certainly possible to use TableSchema as data access description. RML-IO is made in such a way that you can pick your data access descriptions. However, we don't have to describe them all in the specification.

@DylanVanAssche DylanVanAssche added the question Further information is requested label Dec 5, 2024
@VladimirAlexiev
Copy link
Author

@DylanVanAssche But isn't it useful to define a CSVW "reference formulation"?

@DylanVanAssche
Copy link
Contributor

@VladimirAlexiev That's indeed the plan: defining a CSVW reference formulation, but in the RML IO Registry.
The idea is to keep a simple set of reference formulations in RML Core, RML IO like JSON, CSV, XML, RDB. Any other reference formulation is also welcome, but defined in the RML IO Registry to avoid that processors must implement 20 reference formulations to comply with the RML IO specification (which will never happen of course as it becomes so complex).

@DylanVanAssche DylanVanAssche transferred this issue from kg-construct/rml-io Dec 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants