Definitions... #14
Replies: 2 comments 1 reply
-
My understanding of the original scope of the project was to provide a framework for "automated extraction of metadata" from non-FAIR filetypes. The idea was that you'd at least be able to query a database to answer what quantities (such as 'position', 'time', or 'velocity') are stored in your black-box-files, and ideally getting a way of getting at them. This at least lets you know what data you have and then formulate a way how to calculate 'velocity' from your 'position' and 'time'. This "calculation" step is in my view out of scope of this project. This of course runs into the issue of "what constitutes data/metadata", which is a difficult discussion. Things like "filename" and "author" are clearly metadata, perhaps also matrix/dataset sizes and even date ranges or locations. Things like "units" are debatable. My suggestion was (at least for As "extractor" I currently understand a software that's able to read data from the supported filetypes; and understands what that data is on some level. We tried to avoid the usage of the word "parser", as that implies some level of semantic translation, as opposed to just understanding the topology of the files in their native nomenclature. Providing a framework for exchange of any of this understanding (i.e. a part of the "metadata") would be tremendously useful. Finally, I have recently started thinking of this project as "metadata of extractors" (interoperability of tools) rather than "extractors of metadata" (interoperability of and ontology in filetypes). I don't think we can have one without the other. |
Beta Was this translation helpful? Give feedback.
-
I think an example might make it clearer. So you think it would be: executable spectraIron.xyz -o peaks
# returns: {"x":[1,3,6], "y":[10,14,4]} Or would it be: executable spectraIron.xyz -o neXus
# returns nexus file spectraIron.h5 |
Beta Was this translation helpful? Give feedback.
-
After recapping the 2nd meeting, I noticed that we should better define the word "extractor". Do we mean:
Terms like 'energy' are well defined. I think we should watch out that we understand the same thing for common words that are not as defined.
Beta Was this translation helpful? Give feedback.
All reactions