-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Packaging and compression of files in a distribution #746
Conversation
Why is the range dct:MediaTypeOrExtent instead of dct:MediaType? |
I still think that 2 or 4 properties would be cleaner than just the one.
|
I am happy to consider the more complex solution with multiple properties, if that is what the group wants. In response to @jakubklimek I would say that 'cleaner' is not the main objective here -- I would say that a more complex solution should be chosen if it is essential and a simpler solution does not work. |
@smrgeoinfo I made the range |
I was assuming that for a packaged distribution, one could use dct:conformsTo to specify the packaging convention (e.g. bagit, OASIS OpenDocument Package Format...). In those schemes there is a manifest and metadata document in the package that details the internal structure and file formats. |
@smrgeoinfo That may apply to distributions complying with the package formats you mention. However, we could have a more loose package. If we had for instance a tar or zip file with 100 CSV files, one per day, all with the same schema, this schema (CSV on the Web JSON-LD descriptor) would go to |
@makxdekkers You are right about the simple cases. However, I would say that double compression and similar techniques are often enough just bad practice. I often see compressed single files and packaged (and compressed) sets of files with the same schema where it makes sense (splitting larger files into smaller ones or just reducing size of file on disk). However, when I come across a zip file containing many other zip files, it is often because the publisher does not know what they are doing. The question seems to be where do we draw the line. |
Can I ask the members of the WG to vote for one of the two alternatives:
Thanks. |
@makxdekkers How does the vote take place? My suggestion would be one comment for each of the proposals (single property vs two properties) and then use thumbs-up for the vote. |
Proposal 1: use a single property, |
Proposal 2: use two properties, e.g. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- These new properties also need to be expressed in the RDF file (i.e. dcat.ttl)
- Also add an entry in the change-log (Annex E)
# Conflicts: # dcat/index.html
Added the compressFormat and packageFormat properties in the text and in the TTL
@makxdekkers I merged the other branch into this PR @dr-shorthair if you could have a final check and merge, we resolved this on the call today: https://www.w3.org/2019/02/20-dxwgdcat-minutes.html#x13 |
the remaining work on this is to add the examples |
Merging for now, noting that instance examples also needed. |
Hmmm. This merge was probably premature. I don't think we have a decision recorded to support it. My bad. And not competent to unwind ... |
We did decide about this on the call yesterday: https://www.w3.org/2019/02/20-dxwgdcat-minutes.html#x13 |
Phew. It was resolved after I left the meeting and before I read the minutes :-) No reverts needed. |
I added text for the packaging/compression of files in the distribution. I simplified the proposal by only adding one property:
dcat:wrapFormat
.I could have created four properties
dcat:compressionFormat
,dcat:compressionMediaType
,dcat:packagingFormat
anddcat:packagingMediaType
, but I thought that would make the solution much too complex. I would think that it would be fairly trivial for the system to decompress or unpack using a single property, and the duplication between mediaType and format seemed unnecessary.