Skip to content
This repository has been archived by the owner on May 5, 2021. It is now read-only.

Similarity

Miguel Carmona edited this page Jul 2, 2018 · 6 revisions

Index of files

Open Targets similar diseases (based on targets in common)

Filename: json format relation-shared-target.json.gz

Number of records: 557,122

Filename: tsv format relation-shared-target.tsv.gz

Number of rows x columns: 557,122 rows x 10 columns

Open Targets introduced the concept of similarity on diseases based on the number of shared targets. For instance, if you look at the Asthma disease profile page, the section "similar diseases (based on targets in common)" displays similar diseases sharing the same target. Similar diseases for Asthma

For instance, clicking on "Chronic Obstructive Pulmonary Disease" will display a list of shared targets.

Shared targets between Asthma and Chronic Obstructive Pulmonary Disease

The data in relation-shared-target.json.gz is a dump of this information and 2 examples of similar diseases are given below:

{
  "sort": [
    "relation-shared-target#DOID_0050890-EFO_0000584"
  ],
  "_type": "relation-shared-target",
  "_routing": "DOID_0050890",
  "_source": {
    "shared_targets": [
      "ENSG00000164400"
    ],
    "object": {
      "id": "EFO_0000584",
      "links": {
        "targets_count": 18
      },
      "label": "infectious meningitis"
    },
    "scores": {
      "overlap": 0.22265818869331525
    },
    "counts": {
      "shared_count": 1,
      "union_count": 20
    },
    "id": "DOID_0050890-EFO_0000584",
    "subject": {
      "id": "DOID_0050890",
      "links": {
        "targets_count": 3
      },
      "label": "synucleinopathy"
    }
  },
  "_score": null,
  "_index": "18.06_relation-data",
  "_id": "DOID_0050890-EFO_0000584"
}
{
  "sort": [
    "relation-shared-target#DOID_0050890-EFO_0003030"
  ],
  "_type": "relation-shared-target",
  "_routing": "DOID_0050890",
  "_source": {
    "shared_targets": [
      "ENSG00000164400"
    ],
    "object": {
      "id": "EFO_0003030",
      "links": {
        "targets_count": 7
      },
      "label": "abscess"
    },
    "scores": {
      "overlap": 0.33106914536924065
    },
    "counts": {
      "shared_count": 1,
      "union_count": 9
    },
    "id": "DOID_0050890-EFO_0003030",
    "subject": {
      "id": "DOID_0050890",
      "links": {
        "targets_count": 3
      },
      "label": "synucleinopathy"
    }
  },
  "_score": null,
  "_index": "18.06_relation-data",
  "_id": "DOID_0050890-EFO_0003030"
}

The data in relation-shared-target.tsv.gz contains the following columns:

"id", "object.id", "object.label", "object.links.targets_count", "subject.id", "subject.label", "subject.links.targets_count", "scores.overlap", "shared_targets", "counts.shared_count", "counts.union_count"

Similar targets (based on diseases in common)

Similarity on targets is based on the number of shared diseases. For instance, if you look at the NOD2 gene profile page, the section "similar targets (based on diseases in common)" displays similar targets sharing the same disease.

Targets similar to NOD2

Files in json format:

Files in tsv format:

The data in relation-shared-disease-0.tsv.gz contains the following columns:

"id", "object.id", "object.label", "object.links.diseases_count", "subject.id", "subject.label", "subject.links.diseases_count", "scores.overlap", "shared_diseases", "counts.shared_count", "counts.union_count"