Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ena upload tool #56

Open
wants to merge 26 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
26 commits
Select commit Hold shift + click to select a range
ee08e57
Added a copy of the ENA templates for the upload tool
Sep 20, 2023
2d262fc
Renamed the template files
Sep 21, 2023
7cf4e52
Updated the mandatory information of the sources template
Sep 21, 2023
2da57de
Remove all the optional attributes from the sources template
Sep 21, 2023
3bb2e56
Change Name of the source template
Sep 21, 2023
75e5e65
Change the name of the sample collection template
Sep 21, 2023
6078382
Change name and level of the nucleic extraction template
Sep 21, 2023
a50c6ab
Change metadata of the Library preparation template
Sep 21, 2023
8606b69
change attribute names of the library construction template
Sep 21, 2023
31736cf
Update metadata of the nucleic acid sequencing template
Sep 21, 2023
8247f59
Change attributes of the nucleic acid sequencing template
Sep 21, 2023
1d3447a
Change the metadata of the sequence assembly template
Sep 21, 2023
0a27712
Archiving the `nucleic extraction` and `sequence assembly` template
Sep 21, 2023
b58e555
Merge all separate templates into one aggregated template json
Sep 22, 2023
e52ac08
Updated the templates for ENA Upload
Oct 3, 2023
a695e9f
Adjusting templates
Oct 21, 2023
55c8cce
changed 'submission_date' to 'submission date'
Nov 8, 2023
8fd1787
Change submission date back to submission_date
kdp-cloud Nov 19, 2023
d3711d3
Add seed file for DataHub extended metadata
kdp-cloud Nov 19, 2023
72c2b6b
Change data type
kdp-cloud Nov 19, 2023
3f4d92f
Change datatype name to "Registered Sample List"
kdp-cloud Nov 30, 2023
cf9ddff
Replace "ontology" dataType by "Controlled Vocabulary"
kdp-cloud Nov 30, 2023
22a4bc0
Fix template
kdp-cloud Feb 16, 2024
1f12857
Clean-up ena templates
kdp-cloud Feb 16, 2024
12d7d4b
Remove duplicate 'protocol' ISA tag
kdp-cloud Jan 7, 2025
bad9421
Merge branch 'main' into ena_upload_tool
kdp-cloud Jan 7, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
249 changes: 249 additions & 0 deletions templates/ena_upload_tool/archive/library construction.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,249 @@
{
"metadata": {
"name": "library construction ENA Upload Tool",
"group": "ena modified for upload tool",
"group_order": 4,
"temporary_name": "4_ena_custom_library_construction",
"version": "1.0.0",
"isa_config": "genome_seq_default_v2015-07-02",
"isa_measurement_type": "genome sequencing",
"isa_technology_type": "nucleotide sequencing",
"isa_protocol_type": "library construction",
"repo_schema_id": "ENA General experiment information 2021",
"organism": "any",
"level": "assay - material"
},
"data": [
{
"iri": null,
"name": "Input",
"description": "Existing Samples in DataHub that represent input for this protocol.",
"dataType": "SEEK Sample Multi",
"required": "true",
"isaTag": null
},
{
"iri": null,
"name": "library construction",
"description": "type of assay or experimental step performed.",
"required": true,
"ontology": null,
"dataType": "String", //need to be autofilled with name of selected SOP for that block
"CVList": null, //SOPtittle
"isaTag": "protocol"
},
{
"iri": null,
"name": "library_construction_protocol",
"description": "Free form text describing the protocol by which the sequencing library was constructed.",
"dataType": "String",
"required": false,
"ontology": null,
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "title",
"description": "Short text that can be used to call out experiment records in searches or in displays.",
"required": true,
"ontology": null,
"dataType": "String",
"CVList": null,
"isaTag": "otherMaterial_characteristic"
},
{
"iri": null,
"name": "design_description",
"description": "The design of the library including details of how it was constructed. ",
"required": true,
"ontology": null,
"dataType": "String",
"CVList": null,
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "library_source",
"description": "The LIBRARY_SOURCE specifies the type of source material that is being sequenced. (SRA 1.2 documentation)",
"dataType": "Controlled Vocabulary",
"required": true,
"ontology": null,
"CVList": [
"GENOMIC",
"GENOMIC SINGLE CELL",
"TRANSCRIPTOMIC",
"TRANSCRIPTOMIC SINGLE CELL",
"METAGENOMIC",
"METATRANSCRIPTOMIC",
"SYNTHETIC",
"VIRAL RNA",
"OTHER"
],
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "library_strategy",
"description": "Sequencing technique intended for this library (SRA 1.2 documentation)",
"dataType": "Controlled Vocabulary",
"required": true,
"ontology": null,
"CVList": [
"WGS",
"WGA",
"WXS",
"RNA-Seq",
"ssRNA-seq",
"miRNA-Seq",
"ncRNA-Seq",
"FL-cDNA",
"EST",
"Hi-C",
"ATAC-seq",
"WCS",
"RAD-Seq",
"CLONE",
"POOLCLONE",
"AMPLICON",
"CLONEEND",
"FINISHING",
"ChIP-Seq",
"MNase-Seq",
"DNase-Hypersensitivity",
"Bisulfite-Seq",
"CTS",
"MRE-Seq",
"MeDIP-Seq",
"MBD-Seq",
"Tn-Seq",
"VALIDATION",
"FAIRE-seq",
"SELEX",
"RIP-Seq",
"ChIA-PET",
"Synthetic-Long-Read",
"Targeted-Capture",
"Tethered Chromatin Conformation Capture",
"OTHER",
"NOMe-Seq",
"ChM-Seq",
"GBS"
],
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "library_selection",
"description": "Whether any method was used to select for or against, enrich, or screen the material being sequenced. (SRA 1.2 documentation)",
"dataType": "Controlled Vocabulary",
"required": true,
"ontology": null,
"CVList": [
"RANDOM",
"PCR",
"RANDOM PCR",
"RT-PCR",
"HMPR",
"MF",
"repeat fractionation",
"size fractionation",
"MSLL",
"cDNA",
"cDNA_randomPriming",
"cDNA_oligo_dT",
"PolyA",
"Oligo-dT",
"Inverse rRNA",
"Inverse rRNA selection",
"ChIP",
"ChIP-Seq",
"MNase",
"DNase",
"Hybrid Selection",
"Reduced Representation",
"Restriction Digest",
"5-methylcytidine antibody",
"MBD2 protein methyl-CpG binding domain",
"CAGE",
"RACE",
"MDA",
"padlock probes capture method",
"other",
"unspecified"
],
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "library_layout",
"description": "specifies whether to expect single, paired, or other configuration of reads. In the case of paired reads, information about the relative distance and orientation is specified.",
"dataType": "Controlled Vocabulary",
"required": false,
"ontology": null,
"CVList": [
"SINGLE",
"PAIRED"
],
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "insert_size",
"description": "Insert size for paired reads.",
"dataType": "String",
"required": false,
"ontology": null,
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "library_name",
"description": "Name given to the generated library",
"dataType": "String",
"title": true,
"required": true,
"ontology": null,
"CVList": null,
"isaTag": "otherMaterial"
},
{
"iri": null,
"name": "submission date",
"description": "Date in which experiment was submitted to ENA.",
"dataType": "String",
"required": false,
"ontology": null,
"CVList": null,
"isaTag": "otherMaterial_characteristic"
},
{
"iri": null,
"name": "status",
"description": "Status of submission to ENA.",
"dataType": "String",
"required": false,
"ontology": null,
"CVList": [
"add",
"added",
"modify",
"modified",
"cancel",
"cancelled",
"release",
"released"
],
"isaTag": "otherMaterial_characteristic"
},
{
"iri": null,
"name": "accession",
"description": "ENA experiment accession code. Provided by ENA after submission.",
"dataType": "String",
"required": false,
"ontology": null,
"CVList": null,
"isaTag": "otherMaterial_characteristic"
}
]
}
57 changes: 57 additions & 0 deletions templates/ena_upload_tool/archive/nucleic acid extraction.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,57 @@
{
"metadata": {
"name": "nucleic acid extraction ENA Upload Tool",
"group": "ena modified for upload tool",
"group_order": 3,
"temporary_name": "3_ena_custom_nucleic_acid_extraction",
"version": "1.0.0",
"isa_config": "genome_seq_default_v2015-07-02",
"isa_measurement_type": "genome sequencing",
"isa_technology_type": "nucleotide sequencing",
"isa_protocol_type": "nucleic acid extraction",
"repo_schema_id": "ENA General experiment information 2021",
"organism": "any",
"level": "assay - material"
},
"data": [
{
"iri": null,
"name": "Input",
"description": "Existing Samples in DataHub that represent input for this protocol.",
"dataType": "SEEK Sample Multi",
"required": "true",
"isaTag": null
},
{
"iri": null,
"name": "nucleic acid extraction",
"description": "type of assay or experimental step performed.",
"required": true,
"ontology": null,
"dataType": "String", //need to be autofilled with name of selected SOP for that block
"CVList": , //SOPtittle
"isaTag": "protocol"
},
{
"iri": null,
"name": "sample volume or weight for DNA extraction",
"description": "Volume (mL) or weight (g) of sample processed for DNA extraction",
"required": false,
"ontology": null,
"dataType": "String", // Original regex "(0|((0\\.)|([1-9][0-9]*\\.?))[0-9]*)([Ee][+-]?[0-9]+)?",
"unit": null, //Original unit options ["mL","g"],
"isaTag": "parameter_value"
},
{
"iri": null,
"name": "Extract Name",
"description": "User-defined names for each portion of extracted material.",
"dataType": "String",
"title": true,
"required": true,
"ontology": null,
"CVList": null,
"isaTag": "otherMaterial"
}
]
}
Loading