From f89540a943a40174a014700925e980e6eadbe9af Mon Sep 17 00:00:00 2001 From: Florian Wetzels <36967183+floWetzels@users.noreply.github.com> Date: Mon, 28 Oct 2024 16:46:59 +0100 Subject: [PATCH 1/5] Updated ARC related profiles --- docs/pages/resources/profiles.md | 112 +++++-------------------------- 1 file changed, 16 insertions(+), 96 deletions(-) diff --git a/docs/pages/resources/profiles.md b/docs/pages/resources/profiles.md index 92e9b46e..e2b719d8 100644 --- a/docs/pages/resources/profiles.md +++ b/docs/pages/resources/profiles.md @@ -322,108 +322,28 @@ evolved from this profile to conform with RO-Crate 1.1's ## ARC RO-Crate profile -A profile of RO-Crate for [Annotated Research Contexts](https://nfdi4plants.org/content/learn-more/annotated-research-context.html) (ARC), developed by [DataPLANT](https://nfdi4plants.org/). +The [ARC RO-Crate profile](https://github.com/nfdi4plants/arc-ro-crate-profile) describes [Annotated Research Contexts](https://arc-rdm.org/) (ARC), developed by [DataPLANT](https://nfdi4plants.org/). An ARC consists of [ISA](https://isa-specs.readthedocs.io/en/latest/isamodel.html) metadata describing the experimental setup and computational workflows given in CWL. -The current [profile](https://github.com/nfdi4plants/ARC-specification/blob/main/ARC%20specification.md#appendix-conversion-of-arcs-to-ro-crates) requires the crate to follow the ISA Investigation profile on the top level. -In the future, the ARC profile will be extended to not only cover the ISA part of an ARC, but also computational workflows, following the existing profiles for this kind of data. +The current profile is a combination of the [ISA RO-Crate profile](https://github.com/nfdi4plants/isa-ro-crate-profile) and the [ARC CWL RO-Crate profile](https://github.com/nfdi4plants/arc-cwl-ro-crate-profile), see details below. +It requires the crate to follow the ISA Investigation profile on the top level. +The investigation object contains studies and assays from the ISA model, as well as computational workflows. +The core entity of both sub-profiles is the type `LabProcess` defined by [bioschmeas](https://bioschemas.org/) and used to describe processes leading to data files in a parametrized way. How such an RO-Crate can be generated from an ARC is described in the [arc-to-rocrate](https://github.com/nfdi4plants/arc-to-rocrate) repository, which also contains scripts to perform the conversion. -## ISA Profile +## ISA RO-Crate Profile -A profile of RO-Crate for experimental data in plant sciences that is described by metadata following the [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html). +The [ISA RO-Crate profile](https://github.com/nfdi4plants/arc-to-rocrate/blob/main/profiles/investigation.md) describes experimental data in plant sciences that is annotated by metadata following the [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html). Such datasets consist of three types of data entities: `Investigation`, `Study` and `Assay`. -The [profile](https://github.com/nfdi4plants/arc-to-rocrate/blob/main/profiles/investigation.md) adds requirements of the crate such that the data folders match the Investigation, Study and Assay objects of the [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html). - -The profile here describes the top-level `Investigation` object (a dataset) and contained datasets following the `Study` and `Assay` profiles. -Profiles for other included types can be found in the [full version](https://github.com/nfdi4plants/isa-ro-crate-profile). - -### ISA Investigation Profile - -An `Investigation` object describes the top-level meatadata of a scientific investigation, e.g. descriptions of the context, the title, authors and publications (see [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html) for details). -It SHOULD contain further datasets that follow the `Study` profile. - -- [`Dataset`](http://schema.org/dataset) - - [`identifier`](http://schema.org/identifier): [`Text`](https://schema.org/Text) or [`URL`](https://schema.org/URL) (required) - - [`headline`](http://schema.org/headline): [`Text`](https://schema.org/Text) (required) - - [`description`](http://schema.org/description): [`Text`](https://schema.org/Text) (required) - - [`additionalType`](https://schema.org/additionalType): [`Text`](https://schema.org/Text) (required) - - - [`creator`](http://schema.org/creator): [`Person`](https://schema.org/Person) (recommended) - - [`mentions`](http://schema.org/mentions): [`DefinedTermSet`](https://schema.org/DefinedTermSet) (recommended) - - - [`dateCreated`](http://schema.org/dateCreated): [`Date`](https://schema.org/Date) or [`DateTime`](https://schema.org/DateTime) (optional) - - [`datePublished`](http://schema.org/datePublished): [`Date`](https://schema.org/Date) or [`DateTime`](https://schema.org/DateTime) (optional) - - [`citation`](http://schema.org/citation): [`ScholarlyArticle`](https://schema.org/ScholarlyArticle) (optional) - - [`disambiguatingDescription`](http://schema.org/disambiguatingDescription): [`Text`](https://schema.org/Text) (optional) - - [`hasPart`](http://schema.org/hasPart): [`Dataset`](http://schema.org/dataset) (optional) - -### ISA Study Profile - -A `Study` contains information on the subject under study, its characteristics and any treatments applied(see [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html) for details). -It contexualizes further datasets that follow the `Assay` profile. - -- [`Dataset`](http://schema.org/dataset) - - [`identifier`](http://schema.org/identifier): [`Text`](https://schema.org/Text) or [`URL`](https://schema.org/URL) (required) - - [`headline`](http://schema.org/headline): [`Text`](https://schema.org/Text) (required) - - [`additionalType`](https://schema.org/additionalType): [`Text`](https://schema.org/Text) (required) - - [`creator`](http://schema.org/creator): [`Person`](https://schema.org/Person) (required) - - - [`hasPart`](https://schema.org/hasPart): [`Dataset`](https://schema.org/Dataset) or [`File`](https://schema.org/MediaObject) (recommended) - - [`about`](https://schema.org/about): [`LabProcess`](https://bioschemas.org/LabProcess) (recommended) - - [`description`](https://schema.org/description): [`Text`](https://schema.org/Text) (recommended) - - [`dateCreated`](https://schema.org/dateCreated): [`Date`](https://schema.org/Date) or [`DateTime`](https://schema.org/DateTime) (recommended) - - [`dateModified`](https://schema.org/dateModified): [`Date`](https://schema.org/Date) or [`DateTime`](https://schema.org/DateTime) (recommended) - - - [`datePublished`](https://schema.org/datePublished): [`Date`](https://schema.org/Date) or [`DateTime`](https://schema.org/DateTime) (optional) - - [`citation`](https://schema.org/citation): [`ScholarlyArticle`](https://schema.org/ScholarlyArticle) (optional) - - [`comment`](https://schema.org/comment): [`Comment`](https://schema.org/Comment) (optional) - -### ISA Assay Profile - -An `Assay` contains information about a test performed either on material taken from a subject or on a whole initial subject(see [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html) for details). - -- [`Dataset`](http://schema.org/dataset) - - [`additionalType`](https://schema.org/additionalType): [`Text`](https://schema.org/Text) or [`URL`](https://schema.org/URL) (required) - - [`creator`](https://schema.org/creator): [`Person`](https://schema.org/Person) (required) - - [`identifier`](https://schema.org/identifier): [`Text`](https://schema.org/Text) or [`URL`](https://schema.org/URL) (required) - - [`headline`](https://schema.org/headline): [`Text`](https://schema.org/Text) (required) - - [`about`](https://schema.org/about): [`LabProcess`](https://bioschemas.org/LabProcess) (required) - - [`measurementMethod`](https://schema.org/measurementMethod): [`URL`](https://schema.org/URL) or [`DefinedTerm`](https://schema.org/DefinedTerm) (required) - - [`measurementTechnique`](https://schema.org/measurementTechnique): [`URL`](https://schema.org/URL) or [`DefinedTerm`](https://schema.org/DefinedTerm) (required) - - - [`hasPart`](https://schema.org/hasPart): [`File`](https://schema.org/MediaObject) (recommended) - - [`description`](https://schema.org/description): [`Text`](https://schema.org/Text) (recommended) - - [`variableMeasured`](https://schema.org/variableMeasured): [`Text`](https://schema.org/Text) or [`PropertyValue`](https://schema.org/PropertyValue) (recommended) - - [`dateModified`](https://schema.org/dateModified): [`Date`](https://schema.org/Date) or [`DateTime`](https://schema.org/DateTime) (recommended) - - - [`dateCreated`](https://schema.org/dateCreated): [`Date`](https://schema.org/Date) or [`DateTime`](https://schema.org/DateTime) (optional) - - [`citation`](https://schema.org/citation): [`ScholarlyArticle`](https://schema.org/ScholarlyArticle) (optional) - - [`comment`](https://schema.org/comment): [`Comment`](https://schema.org/Comment) (optional) - - +The profile adds requirements of the crate such that the data folders match the Investigation, Study and Assay objects of the [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html). +The description of experimental processes that produce the data uses the [bioschmeas](https://bioschemas.org/) type `LabProcess` as its core entity. +It describes how data and samples are derived from each other in a graph model based on formalized parameters as key-value pairs. +Profiles for all included types can be found in the [full version](https://github.com/nfdi4plants/isa-ro-crate-profile). + +## ARC CWL RO-Crate Profile + +A [ARC CWL RO-Crate profile](https://github.com/nfdi4plants/arc-cwl-ro-crate-profile) extends the existing Worflow Run Crate in an ARC-specific context. +It uses the same process graph types (`LabProtocol`,`LabProcess`) as the ISA profile to further annotate parameters of workflows (equivalent to protocols in ISA) and runs (equivalent to processes in ISA). ## Electronic Lab Notebook (ELN) From 11e264f6878e62ed2d6390de587f4374611d3fd9 Mon Sep 17 00:00:00 2001 From: Florian Wetzels <36967183+floWetzels@users.noreply.github.com> Date: Wed, 30 Oct 2024 09:31:40 +0100 Subject: [PATCH 2/5] Update docs/pages/resources/profiles.md Co-authored-by: Eli Chadwick --- docs/pages/resources/profiles.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/pages/resources/profiles.md b/docs/pages/resources/profiles.md index e2b719d8..a0a9c487 100644 --- a/docs/pages/resources/profiles.md +++ b/docs/pages/resources/profiles.md @@ -327,7 +327,7 @@ An ARC consists of [ISA](https://isa-specs.readthedocs.io/en/latest/isamodel.htm The current profile is a combination of the [ISA RO-Crate profile](https://github.com/nfdi4plants/isa-ro-crate-profile) and the [ARC CWL RO-Crate profile](https://github.com/nfdi4plants/arc-cwl-ro-crate-profile), see details below. It requires the crate to follow the ISA Investigation profile on the top level. The investigation object contains studies and assays from the ISA model, as well as computational workflows. -The core entity of both sub-profiles is the type `LabProcess` defined by [bioschmeas](https://bioschemas.org/) and used to describe processes leading to data files in a parametrized way. +The core entity of both sub-profiles is the type `LabProcess` defined by [Bioschemas](https://bioschemas.org/), which is used to describe processes leading to data files in a parametrized way. How such an RO-Crate can be generated from an ARC is described in the [arc-to-rocrate](https://github.com/nfdi4plants/arc-to-rocrate) repository, which also contains scripts to perform the conversion. From 4c46f95f236b98fb5d11465821220ba65c508e80 Mon Sep 17 00:00:00 2001 From: Florian Wetzels <36967183+floWetzels@users.noreply.github.com> Date: Wed, 30 Oct 2024 09:31:52 +0100 Subject: [PATCH 3/5] Update docs/pages/resources/profiles.md Co-authored-by: Eli Chadwick --- docs/pages/resources/profiles.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/pages/resources/profiles.md b/docs/pages/resources/profiles.md index a0a9c487..4d8f7b6b 100644 --- a/docs/pages/resources/profiles.md +++ b/docs/pages/resources/profiles.md @@ -336,7 +336,7 @@ How such an RO-Crate can be generated from an ARC is described in the [arc-to-ro The [ISA RO-Crate profile](https://github.com/nfdi4plants/arc-to-rocrate/blob/main/profiles/investigation.md) describes experimental data in plant sciences that is annotated by metadata following the [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html). Such datasets consist of three types of data entities: `Investigation`, `Study` and `Assay`. The profile adds requirements of the crate such that the data folders match the Investigation, Study and Assay objects of the [ISA model](https://isa-specs.readthedocs.io/en/latest/isamodel.html). -The description of experimental processes that produce the data uses the [bioschmeas](https://bioschemas.org/) type `LabProcess` as its core entity. +The description of experimental processes that produce the data uses the [Bioschemas](https://bioschemas.org/) type `LabProcess` as its core entity. It describes how data and samples are derived from each other in a graph model based on formalized parameters as key-value pairs. Profiles for all included types can be found in the [full version](https://github.com/nfdi4plants/isa-ro-crate-profile). From d4bec52a6718faef4ed35330e180c9783bfd5c38 Mon Sep 17 00:00:00 2001 From: Florian Wetzels <36967183+floWetzels@users.noreply.github.com> Date: Wed, 30 Oct 2024 09:31:58 +0100 Subject: [PATCH 4/5] Update docs/pages/resources/profiles.md Co-authored-by: Eli Chadwick --- docs/pages/resources/profiles.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/pages/resources/profiles.md b/docs/pages/resources/profiles.md index 4d8f7b6b..e1c6b724 100644 --- a/docs/pages/resources/profiles.md +++ b/docs/pages/resources/profiles.md @@ -342,7 +342,7 @@ Profiles for all included types can be found in the [full version](https://githu ## ARC CWL RO-Crate Profile -A [ARC CWL RO-Crate profile](https://github.com/nfdi4plants/arc-cwl-ro-crate-profile) extends the existing Worflow Run Crate in an ARC-specific context. +The [ARC CWL RO-Crate profile](https://github.com/nfdi4plants/arc-cwl-ro-crate-profile) extends the existing [Workflow Run Crate profile](https://w3id.org/ro/wfrun/workflow/) in an ARC-specific context. It uses the same process graph types (`LabProtocol`,`LabProcess`) as the ISA profile to further annotate parameters of workflows (equivalent to protocols in ISA) and runs (equivalent to processes in ISA). ## Electronic Lab Notebook (ELN) From 813ac38383ff707ba7e0929ef184162457bf9b26 Mon Sep 17 00:00:00 2001 From: Florian Wetzels <36967183+floWetzels@users.noreply.github.com> Date: Wed, 30 Oct 2024 09:32:06 +0100 Subject: [PATCH 5/5] Update docs/pages/resources/profiles.md Co-authored-by: Eli Chadwick --- docs/pages/resources/profiles.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/pages/resources/profiles.md b/docs/pages/resources/profiles.md index e1c6b724..afc9920a 100644 --- a/docs/pages/resources/profiles.md +++ b/docs/pages/resources/profiles.md @@ -343,7 +343,7 @@ Profiles for all included types can be found in the [full version](https://githu ## ARC CWL RO-Crate Profile The [ARC CWL RO-Crate profile](https://github.com/nfdi4plants/arc-cwl-ro-crate-profile) extends the existing [Workflow Run Crate profile](https://w3id.org/ro/wfrun/workflow/) in an ARC-specific context. -It uses the same process graph types (`LabProtocol`,`LabProcess`) as the ISA profile to further annotate parameters of workflows (equivalent to protocols in ISA) and runs (equivalent to processes in ISA). +It uses the same process graph types (`LabProtocol`,`LabProcess`) as the [ISA profile](#isa-ro-crate-profile) to further annotate parameters of workflows (equivalent to protocols in ISA) and runs (equivalent to processes in ISA). ## Electronic Lab Notebook (ELN)