Skip to content

xomicsdatascience/zoDIAq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

zoDIAq

Introduction

Uses an MIT License

zoDIAq (derived from "csodiaq" acronym, Cosine Similarity Optimization for DIA qualitative and quantitative analysis) is a software tool for the analysis of direct infusion shotgun proteome analysis (DISPA) data and data independent acquisition (DIA) data.

Click here for the paper published in Analytical Chemistry.

Paper Title Header

Instructions

Installation

All installation instructions are done from the command line. Download and use of zoDIAq requires access to the git and pip packages. For system-specific instructions on using the command line or installing these packages, see the wiki page.

Accessing the GUI

From the command line enter zodiaq gui to start the GUI. All instructions in this README file pertain to use of the GUI. Instructions for use from the command line can be accessed by entering zodiaq -h.

Peptide Identification

Peptide Identification GUI Picture

Files:

  1. DIA Data File: This is a required field. Choose at least one data file from a DISPA or DIA data run.
  2. Library File: This is a required field. Choose a reference library file that is in TraML (.tsv or .csv) or MGF format. zoDIAq treats .tsv files as tab-deliminated and .csv as comma-deliminated. Some library formats do not follow this standard (ie, .csv files that are tab-deliminated) and will require adjustment by the end user.
  3. Outfile Directory: This is a required field. Choose a folder that the output files should go into.

Settings:

Note that it is recommended that the default value is used for each of these settings.

  1. Pre-correction m/z difference tolerance for peak matches (in ppm): Default value is 30. If you're not sure what would make an ideal setting, go ahead and leave this blank, but check the histogram box below. If the resulting histogram doesn't have a normal distribution (a peak), consider using a wider tolerance.
  2. Correction: Default is checked. This value resets the PPM tolerance range based on an initial scan. It reduces the liklihood of identifying false positives.
  3. Corrective Standard Deviations: Only available if the correction box is checked. Default is to customize the corrected tolerance to the distribution of the histogram, excluding noise around the peak. Entering a value here instead sets the tolerance to a standard deviation of the distribution.
  4. Create Histogram: Only available if the correction box is checked. Default is checked. Generates a histogram to visualize the corrective PPM tolerance used in the corrected analysis.

Scoring and FDR Filtering

Screenshot 2023-12-12 at 5 03 11 PM

Files:

  1. zoDIAq Scoring Output Directory: This is a required field. Choose the directory created by the Peptide Identification output of zoDIAq.

Settings:

Note that it is recommended that the default value is used for each of these settings.

  1. protein quantification method (if applicable): Two protein quantification methods are available. The first default option, maxlfq, is based on the maxLFQ method. The second option, sum, is a simplified method that quantifies proteins by summing relevant peptides identified in all samples. More details regarding the implementation of these methods can be found on the wiki.
  2. minimum number of matching peptides between samples (only for 'maxlfq' method): The maxLFQ method requires comparing each sample to every other sample. This variable dictates how many matching quantified peptides need to have been found between two samples for them to be considered a match.

Targeted Peptide Reanalysis

Screenshot 2023-12-01 at 10 25 53 AM

Files:

  1. zoDIAq Identification Output Directory: This is a required field. Choose the directory created by the Scoring and FDR Filtering output of zoDIAq.

Settings:

  1. Maximum Number of peptides per protein: By default, no protein analysis is completed. If a number is provided, the program will provide a mass spectrometer file targetting the given number of peptides for each protein.
  2. Proximity m/z values should be to a bin value: To avoid increased cost for targetting every peptide individually, specific m/z bins are targeted for overlapping peptides. This setting indicates how close a peptide's m/z value should be to the "center" of a bin.
  3. Include Heavy Isotopes for SILAC protocol: For targeted reanalysis of peptides using a SILAC protocol, the heavy isotopes for chosen peptides will be included as well.

Citations

About

Cosine Similarity Optimization for DIA qualitative and quantitative analysis

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages