-
Notifications
You must be signed in to change notification settings - Fork 4
Similarity
- Open Targets similar diseases (based on targets in common) relation-shared-target.json.gz relation-shared-target.tsv.gz
- Open Targets similar targets (based on diseases in common)
Filename: json format relation-shared-target.json.gz
Number of records: 557,122
Filename: tsv format relation-shared-target.tsv.gz
Number of rows x columns: 557,122 rows x 10 columns
Open Targets introduced the concept of similarity on diseases based on the number of shared targets. For instance, if you look at the Asthma disease profile page, the section "similar diseases (based on targets in common)" displays similar diseases sharing the same target.
For instance, clicking on "Chronic Obstructive Pulmonary Disease" will display a list of shared targets.
The data in relation-shared-target.json.gz is a dump of this information and 2 examples of similar diseases are given below:
{
"sort": [
"relation-shared-target#DOID_0050890-EFO_0000584"
],
"_type": "relation-shared-target",
"_routing": "DOID_0050890",
"_source": {
"shared_targets": [
"ENSG00000164400"
],
"object": {
"id": "EFO_0000584",
"links": {
"targets_count": 18
},
"label": "infectious meningitis"
},
"scores": {
"overlap": 0.22265818869331525
},
"counts": {
"shared_count": 1,
"union_count": 20
},
"id": "DOID_0050890-EFO_0000584",
"subject": {
"id": "DOID_0050890",
"links": {
"targets_count": 3
},
"label": "synucleinopathy"
}
},
"_score": null,
"_index": "18.06_relation-data",
"_id": "DOID_0050890-EFO_0000584"
}
{
"sort": [
"relation-shared-target#DOID_0050890-EFO_0003030"
],
"_type": "relation-shared-target",
"_routing": "DOID_0050890",
"_source": {
"shared_targets": [
"ENSG00000164400"
],
"object": {
"id": "EFO_0003030",
"links": {
"targets_count": 7
},
"label": "abscess"
},
"scores": {
"overlap": 0.33106914536924065
},
"counts": {
"shared_count": 1,
"union_count": 9
},
"id": "DOID_0050890-EFO_0003030",
"subject": {
"id": "DOID_0050890",
"links": {
"targets_count": 3
},
"label": "synucleinopathy"
}
},
"_score": null,
"_index": "18.06_relation-data",
"_id": "DOID_0050890-EFO_0003030"
}
The data in relation-shared-target.tsv.gz contains the following columns:
"id", "object.id", "object.label", "object.links.targets_count", "subject.id", "subject.label", "subject.links.targets_count", "scores.overlap", "shared_targets", "counts.shared_count", "counts.union_count"
Similarity on targets is based on the number of shared diseases. For instance, if you look at the NOD2 gene profile page, the section "similar targets (based on diseases in common)" displays similar targets sharing the same disease.
Files in json format:
- relation-shared-disease-0.json.gz
- relation-shared-disease-1.json.gz
- relation-shared-disease-2.json.gz
- relation-shared-disease-3.json.gz
- relation-shared-disease-4.json.gz
- relation-shared-disease-5.json.gz
- relation-shared-disease-6.json.gz
- relation-shared-disease-7.json.gz
Files in tsv format:
- relation-shared-disease-0.tsv.gz
- relation-shared-disease-1.tsv.gz
- relation-shared-disease-2.tsv.gz
- relation-shared-disease-3.tsv.gz
- relation-shared-disease-4.tsv.gz
- relation-shared-disease-5.tsv.gz
- relation-shared-disease-6.tsv.gz
- relation-shared-disease-7.tsv.gz
The data in relation-shared-disease-0.tsv.gz contains the following columns:
"id", "object.id", "object.label", "object.links.diseases_count", "subject.id", "subject.label", "subject.links.diseases_count", "scores.overlap", "shared_diseases", "counts.shared_count", "counts.union_count"