Generate report for Reactome #24

goodb · 2018-09-25T19:22:57Z

Show what GO classes can be inferred - show where they match existing annotations, where they differ, and where they differ if they are 'deeper'.

goodb · 2018-11-15T18:10:01Z

This report will be generated based on the go-cam models meant to be as semantically correct with respect to Reactome objectives as possible. This means that it will not be based on models that have been adapted specifically to support curation objectives from the GO. For example, complexes will not be removed or decomposed as Reactome is interested in these.

goodb · 2018-11-15T18:21:21Z

Columns for report:
Reactome id
Reactome label
Reactome node type [complex, reaction, pathway]
Reactome asserted GO classifications (both accessions and terms)
Rule-based GO classifications
OWL-inferred GO classifications

goodb · 2018-11-15T18:50:56Z

Note that the report should have one row per unique reactome id..

goodb · 2018-11-26T06:41:24Z

manual_plus_inferred_mapping.txt

Here is the current report on inferred classes for Reactome entities. Ping @deustp01 @ukemi @thomaspd @cmungall
(Note that it does contain multiple rows for single Reactome ids when those entities are mapped to both a biological process and a molecular function and when the same reaction appears in multiple pathways. e.g. R-HSA-156678 shows up in both situations. @deustp01 depending on if/how you would like to use this information I can adapt the output.

Notes on run:

Inference is based on September 2018 version of GOPlus, arachne_2.12, v1.2, October 4 version of human Reactome
Reactome mapping is done with the 'Direct Import' configuration (complexes, locations are not removed), as opposed to NoctuaCuration configuration.

goodb · 2018-11-26T19:08:06Z

Adding ping to connect @fabregat to this thread.

ukemi · 2019-03-21T16:12:50Z

Hi @goodb,

I was just taking a look at the report you enclosed above. It looks like some of the data is about Reactome Disease models. We are filtering those in the loads, aren't we?

We certainly don't want annotations from GO_CAM models that represent the disease state.

ukemi · 2019-03-21T16:13:09Z

Eg.

Biological Process Defective ABCD1 causes adrenoleukodystrophy (ALD) R-HSA-5684045
Molecular Function Defective ABCD1 does not transfer LCFAs from cytosol to peroxisomal matrix R-HSA-5684043
Biological Process RNF mutants show enhanced WNT signaling and proliferation R-HSA-5340588
Molecular Function RNF43 frameshift mutants show enhanced WNT siganling R-HSA-5340587

deustp01 · 2019-03-21T17:00:30Z

Would the "disease" attribute or Reactome physical entities and events be useful as a filter - for GO, you'd only want instances for which the attribute value is NULL?

ukemi · 2019-03-21T17:09:15Z

We need to learn more from you about the Reactome disease annotations. Are whole pathways labeled as disease pathways? Since GO only annotates 'normal' biology we wouldn't want pathways that represent a disease state. Mind you in the long run it would be fascinating to do the semantic comparison of the 'disease' pathways versus the 'healthy' pathways.

goodb · 2019-03-21T17:20:13Z

@ukemi (first note that a lot has changed since that run, so the inference report is likely going to be very different now).

Right now the code does not do any filtering of the disease models. It looks like the BioPAX for these models isn't really complete enough to develop a proper model anyway. e.g. looking at 'Defective ABCD1 causes adrenoleukodystrophy (ALD)' there isn't any structured data about the disease, the mutant genes, or the relation to the normal pathway in the BioPAX export. If we ever do want to turn this information into GO-CAMs (which I personally think would be very valuable for building analyses) we'd need some work on their end to improve the BioPAX or we'd need to access the data another way. (Ping @deustp01 )

For now I'll add the disease filter to the converter. See #58

goodb · 2019-03-21T17:23:40Z

@deustp01 I don't see any 'disease' information coming through the BioPAX. If there was such a tag, that would be very helpful. My plan was to make use of the Disease pathway hierarchy and ignore any of the subpathways there. Just visually from your browser that looks like it ought to work.

ukemi · 2019-03-21T17:26:05Z

(which I personally think would be very valuable for building analyses)
me too!

deustp01 · 2019-03-21T19:05:21Z

We need to learn more from you about the Reactome disease annotations. Are whole pathways labeled as disease pathways? Since GO only annotates 'normal' biology we wouldn't want pathways that represent a disease state. Mind you in the long run it would be fascinating to do the semantic comparison of the 'disease' pathways versus the 'healthy' pathways.

The relationship between disease pathways and their normal counterparts is complicated, and doesn't work very well for us. A plan to revise it substantially is in the works but it will be a really large effort so it's not clear when it is going to happen.

Meanwhile, there are about three versions of disease pathway: loss-of-function, a variant gene encodes a nonfunctional protein or no protein at all so any reaction dependent on that protein fails (phenylketonuria, adrenoleukodystrophy); gain-of-function, a variant gene has a novel function so a reaction dependent on the protein is altered (constitutively active mutant forms of signaling proteins); a pathogen introduces novel alien proteins into a human cell and those proteins mediate novel reactions with no normal human counterpart.

In every case though, the basic unit of disease annotation is a disease pathway containing one or more disease reactions involving abnormal proteins and possibly abnormal molecules of other sorts, e.g., lipopolysaccharide. If a disease reaction has a normal counterpart, that is noted. All loss-of-function annotations point to the reaction that would have happened if the normal protein had been available, for example. But I'm not sure how any of this is represented in the BioPax export. Try getting a BioPax download of an individual disease pathway and see what it contains.

goodb · 2019-03-21T19:10:57Z

I looked at one and basically none of that information comes through - just the reactions involved and their participants. Even the mutant gene in the one I looked at was not there. So.. if and when we want to go down this road we will need to think through how to do it. Perhaps another case for working on a BioPAX level 4...

deustp01 · 2019-03-21T19:28:48Z

Grasping at another straw here, do the modified-residue attributes of protein (entity with accessioned sequence) instances come through, specifically ones of the genetically modified residue subclass? That's how we annotate the sequence variants that differentiate a mutant disease protein from its canonical UniProt normal counterpart. Any protein with a non-null genetically modified residue attribute is a disease protein, and any reaction involving that protein is a disease reaction.

goodb · 2019-03-21T20:55:19Z

We do get BioPAX "ModificationFeature" annotations on the mutants. These are linked to a SequenceSite and a SequenceModificationVocabulary annotation (e.g. L-arginine removal) which in turn is xrefed to something with db MOD and an id like MOD:01632 .

Getting to this info. is possible but a bit complex. My impression is that it would be easier and perhaps more consistent if we just use the disease subtree to filter these out for now.

deustp01 · 2019-03-21T21:17:51Z

The disease subtree should be equally reliable.

ukemi added Reactome2GO:curation Reactome2GO:Software labels May 12, 2022

kltm added this to Reactome2GO: Manual Review of Reactome/GO-CAM Human Pathways and Derivatives Aug 22, 2024

kltm moved this to To do in Reactome2GO: Manual Review of Reactome/GO-CAM Human Pathways and Derivatives Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate report for Reactome #24

Generate report for Reactome #24

goodb commented Sep 25, 2018

goodb commented Nov 15, 2018

goodb commented Nov 15, 2018

goodb commented Nov 15, 2018

goodb commented Nov 26, 2018

goodb commented Nov 26, 2018

ukemi commented Mar 21, 2019

ukemi commented Mar 21, 2019

deustp01 commented Mar 21, 2019

ukemi commented Mar 21, 2019

goodb commented Mar 21, 2019

goodb commented Mar 21, 2019

ukemi commented Mar 21, 2019

deustp01 commented Mar 21, 2019

goodb commented Mar 21, 2019 •

edited

Loading

deustp01 commented Mar 21, 2019

goodb commented Mar 21, 2019

deustp01 commented Mar 21, 2019

Generate report for Reactome #24

Generate report for Reactome #24

Comments

goodb commented Sep 25, 2018

goodb commented Nov 15, 2018

goodb commented Nov 15, 2018

goodb commented Nov 15, 2018

goodb commented Nov 26, 2018

goodb commented Nov 26, 2018

ukemi commented Mar 21, 2019

ukemi commented Mar 21, 2019

deustp01 commented Mar 21, 2019

ukemi commented Mar 21, 2019

goodb commented Mar 21, 2019

goodb commented Mar 21, 2019

ukemi commented Mar 21, 2019

deustp01 commented Mar 21, 2019

goodb commented Mar 21, 2019 • edited Loading

deustp01 commented Mar 21, 2019

goodb commented Mar 21, 2019

deustp01 commented Mar 21, 2019

goodb commented Mar 21, 2019 •

edited

Loading