Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Identification workflow needed. #346

Closed
6 tasks
ypriverol opened this issue Jan 21, 2024 · 0 comments · Fixed by #351
Closed
6 tasks

Identification workflow needed. #346

ypriverol opened this issue Jan 21, 2024 · 0 comments · Fixed by #351
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@ypriverol
Copy link
Member

ypriverol commented Jan 21, 2024

Description of feature

First use case: We are doing some research about how to control FDR when integrating multiple datasets at quantms.org at protein level. We currently have a method mad_decoy that we are trying to improve now with the entrapment approach. For the new method, we need to study the distribution of peptide probabilities for each identification.

Second use case: In addition, we want to explore for AI-dataset generation the impact of rank 1,2,3 PSMs in search engines. Most of the AI-dataset for spectra prediction training are based on rank 1 spectra. We want to release dataset with rank 1, rank 2, and 3 spectra for exploring the impact on AI prediction methods.

Third use case: ID at large-scale for spectra library generation.

This subworkflow will help used to provide a solution for peptide identification outside the quant part. It is actually related with #345. I recommend the following @daichengxin @jpfeuffer @timosachsenberg :

  • Perform peptide identification with the three search engines (SAGE, MSGF+, COMET)
  • We should make sure that this work for inmunopeptidomics datasets, when searching with no enzymatic restriction.
  • Percolator will be optional, this is needed because if you want to study pure search engine results and all the search engine ranks, you may need to skip Percolator who select the first rank for each search engine.
  • ConsesusID should be applied on Percolator results or other type of search engines that do not remove the ranks.
  • We should have an ID-filter that filter by PSMs FDR.
  • Export to quantms.io PSM file.

Feedback from @jpfeuffer @daichengxin @timosachsenberg would be great.

@ypriverol ypriverol added the enhancement New feature or request label Jan 21, 2024
@ypriverol ypriverol added this to the Release 1.3 milestone Jan 22, 2024
@ypriverol ypriverol linked a pull request Feb 1, 2024 that will close this issue
11 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants