Description
Description of the bug
@WangHong007 @zprobot @daichengxin:
I have been recently testing multiple USIs generated by quantms for DIA experiments. A little background about the problem, USIs are a way to reference directly to the scan and spectrum that was used to identify the spectrum, in someway is the fundamental evidence on DDA identification; here is an example.
All DDA search engines keep track of the scan that was used to identify the spectrum. However, in DIA experiments other features are also relevant and DIANN do not trace in the output files of the scan number that was used to identify the peptide. In quantms, we have implemented a logic to "find" for every peptide the scan number used to identify the peptide. @zprobot provided all the USIs for reanalysis PXD019909; however, when we were doing a visualization of multiple USIs they look like random USI meaning the spectrum looks like do not correspond to the given peptide, see example.
I propose the following follow up:
- @WangHong007 can you explain in a comment on this thread in details how the scan are retrieved. Probably @vdemichev can double-check that our logic is correct.
- @zprobot can you provide for other reanalyses the corresponding USIs and the Posterior error probabilities. The idea is that we can check for the best IDs (lower PEP) in their USIs. It is important to have the USIs and PEP for other projects (reanalyses) to see if the problems are related with the specific project.
- Double check that for this project PXD019909 the USIs are well annotated @WangHong007 @zprobot. If they are well annotated, and we check USIs for good IDs (lower PEP) we can try to discuss with @vdemichev what is the problem on this project.
Please let me know if you need more discussion.
Additional examples that looks wrong:
- https://www.ebi.ac.uk/pride/archive/usi?usi=mzspec:PXD019909:20180914_QE8_nLC0_BDA_SA_DIA_Keratinocytes_NN002:scan:7150:GKQEEEKPGEEK/2&resultType=FULL
- https://www.ebi.ac.uk/pride/archive/usi?usi=mzspec:PXD019909:20180914_QE8_nLC0_BDA_SA_DIA_Keratinocytes_NN002:scan:6918:HSQAGQGQSEGSR/2&resultType=FULL
Command used and terminal output
No response
Relevant files
No response
System information
No response