Skip to content

DorresteinLaboratory/Bioactive_Molecular_Networks

Repository files navigation

Bioactive Molecular Networks

The bioactive molecular network workflow integrates MS/MS molecular networking and bioassay-guided fractionation into the concept of bioactive molecular networking. The workflow relies on open bioinformatic tools, such MZmine2 or Optimus (using OpenMS), a Jupyter notebook, and the GNPS web-platform (http://gnps.ucsd.edu). The code is released as a Jupyter notebook for easiness and reproducibility. The jupyter notebook has been prepared by Dr. Ricardo Silva (UCSD).

Citations

Bioactivity-Based Molecular Networking for the Discovery of Drug Leads in Natural Product Bioassay-Guided Fractionation. Nothias, L.-F.; Nothias-Esposito, M.; da Silva, R.; Wang, M.; Protsyuk, I.; Zhang, Z.; Sarvepalli, A.; Leyssen, P.; Touboul, D.; Costa, J.; Paolini J., Alexandrov T., Litaudon M., Dorrestein, P.; et al. J. Nat. Prod. 81, 4, 758–767, (2018).

Feature-based molecular networking in the GNPS analysis environment. Nothias, L.-F., Petras, D., Schmid, R. et al. Nat. Methods 17, 905–908 (2020).

Graphical abstract

Table of contents

Try online with Binder

You can quickly launch the notebooks of this repository using Binder by clicking this badge : Binder

Installation and system requirements:

Install MZmine2 or Optimus

Jupyter notebook and R:

The Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text. Note that the notebook is available as an R script also.

GNPS and Cytoscape:

  • You will need to install Cytoscape (minimum version 3.4).

Running the bioactive molecular networking workflow procedure

NB: Template files for each steps are available in the folder TEMPLATE_FILE

The bioactive molecular networking workflow

Step 1 - MZmine2 or Optimus LC-MS/MS processing, and bioassay data

The step 1 will output:

  • a feature quantification table (features_quantification_matrix.csv) that contains the aligned list of features and their intensity accross the fractions analyzed by LC-MS/MS
  • a .MGF file containing the MS/MS spectral summary.

MZmine2:

Process your data following instructions described in the GNPS feature-based molecular networking workflow (FBMN) documentation. Note that two options in the "Peak list row filter" are required "Keep only features with MS/MS scan (GNPS)", 'Reset the peak number". By default, the bioactive molecular networks Jupyter notebook requires a table exported with the 'Export to CSV' module in MZmine, that includes 'Export row ID', 'Export row m/z', 'Export row retention time', and 'Peak area'. See the template provided.

Optimus:

Refers to the documentation on https://github.com/MolecularCartography/Optimus Note that the option 'MS/MS filter' has to be activated.

Other processing tools supported:

Note that the workflow can be adapted to any other tools available in the GNPS feature-based molecular networking workflow (FBMN) documentation. In these cases, you might have to convert the feature table into the needed format. Note that feature quantification table converted in the proper format can be downloaded from the FBMN results page.

Add the value of the bioassay experiments for each sample analyzed by LC-MS/MS in the table (features_quantification_matrix.csv to features_quantification_matrix_edited_bioactivity.csv)

NB: alternatively any other quantitative variable can be used instead (pH, metabolite, concentration, ....)

Step 2 - Running the Jupyter notebook:

The step 2 will calculate the Pearson correlation score and the its significance for every detected features, against the result of the bioassay, and output a table that can imported into Cytoscape:

NB: note that the notebook is available as an R script as well (Bioactive_Molecular_Networks_v1.1_MZmine2.r).

  • Upload the Jupyter notebook Bioactive_molecular_networks_v1.0.ipynb in your Jupyter environment.
  • Upload your MZmine or Optimus input file (features_quantification_matrix_edited_bioactivity.csv) in the notebook folder.
  • In your Jupyter environment, open the Bioactive_molecular_networks_v1.0.ipynb Jupyter notebook.
  • Make sure the R kernel is selected in the notebook.
  • In the notebook, specify the path to the input table. If you used an MZmine output table, start at cell 1. If you use an Optimus output table, skip cells 1-3 and start directly at cell 4.
  • Run the Jupyter notebook by clicking in menu Cell/Run All, and wait for the notebook to process (should be less than 30 sec). Informations are provided as comments in each cells.
  • Download the output file features_quantification_matrix_transposed_with_significant_correlation_pvalue_corrected.csv

Step 3 - GNPS and Cytoscape

  • Go on Global Natural Product Social Molecular Networking web-platform http://gnps.ucsd.edu
  • Run a Data Analysis job following the GNPS feature-based molecular networking (FBMN) workflow. Important: the molecular networks will be generated from the .MGF file export with MZmine2 or Optimus, and MS-Cluster will not be used. See the FBMN documentation and video tutorials for how to use GNPS.
  • Import your molecular networks in Cytoscape following GNPS documentation https://ccms-ucsd.github.io/GNPSDocumentation/.
  • In Cytoscape, import the output table of the Jupyter notebook (features_quantification_matrix_transposed_with_significant_correlation_pvalue_corrected.csv) into your molecular networks.
  • In Cytoscape, use the Select function to select the nodes with significant bioactivity (typical threshold would be r2 > 0.8 and p-value < -0.02).
  • In Cytoscape, bypass the style of these nodes as needed (the node size or node shape can be used to visualize). Piechart diagrams can be used to display the distribution of an ion accross samples.

Figure from the manuscript Nothias et al 2018

Tools employed in the workflow

GNPS: Wang, M.; Carver, J. J.; Phelan, V. V.; Sanchez, L. M.; Garg, N.; Peng, Y.; Nguyen, D. D.; Watrous, J.; Kapono, C. A.; Luzzatto-Knaan, T.; et al. Sharing and Community Curation of Mass Spectrometry Data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 2016, 34 (8), 828–837. https://www.nature.com/articles/nbt.3597

MZmine: Pluskal, T.; Castillo, S.; Villar-Briones, A.; Oresic, M. MZmine 2: Modular Framework for Processing, Visualizing, and Analyzing Mass Spectrometry-Based Molecular Profile Data. BMC Bioinformatics 2010, 11, 395. https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-11-395

OpenMS: Röst, H. L.; Sachsenberg, T.; Aiche, S.; Bielow, C.; Weisser, H.; Aicheler, F.; Andreotti, S.; Ehrlich, H.-C.; Gutenbrunner, P.; Kenar, E.; et al. OpenMS: A Flexible Open-Source Software Platform for Mass Spectrometry Data Analysis. Nat. Methods 2016, 13 (9), 741–748. https://www.nature.com/articles/nmeth.3959

Optimus: Protsyuk, I.; Melnik, A. V.; Nothias, L.-F.; Rappez, L.; Phapale, P.; Aksenov, A. A.; Bouslimani, A.; Ryazanov, S.; Dorrestein, P. C.; Alexandrov, T. 3D Molecular Cartography Using LC-MS Facilitated by Optimus and ’Ili Software. Nat. Protoc. 2018, 13 (1), 134–154. https://www.nature.com/articles/nprot.2017.122

Cytoscape: Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N. S.; Wang, J. T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Res. 2003, 13 (11), 2498–2504. https://www.ncbi.nlm.nih.gov/pubmed/14597658

Other processing tools if used with the FBMN workflow.

Help and Troubleshooting

Licence

The Jupyter notebooks available on this GitHub are available under the MIT Open licence. https://github.com/DorresteinLaboratory/Bioactive_Molecular_Networks/blob/master/LICENSE.md

About

Bioactive Molecular Networks Project

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published