Stringency in the NanoMethPhase paper #17

weishwu · 2023-05-22T20:46:36Z

Hi @vahidAK. I followed the pipeline instructed in this git repo and got 30k DMRs (26 million bp in total) from my sample. The Nanopore sequencing depth for my sample is 27x. Read length N50 is 39kb and mean quality score is 14.7.

I noticed that in your paper (https://genomebiology.biomedcentral.com/articles/10.1186/s13059-021-02283-5) you got ~2k DMRs. I understand that there are some parameters at the dma step that control stringency, especially the delta cutoff. But still it would be appreciated if you could let me know what parameters you were using in your paper.

I checked the CpG level differential methylation between the alleles in some known DMRs and imprinted genes and the signals in my data seems to make a lot of sense. It is that region calling thing that binarizes signals into segments that is always bothering me. I don't know how to set up a level of stringency that can achieve a sweet point between sensitivity and specificity. The sparsity nature of CpG methylation data makes this "segmentation" even harder than other types of data that has signal values in a basewise manner, like ChIP-Seq.

Thanks for any insights.

vahidAK · 2023-05-23T19:35:01Z

Hi @weishwu ,

I think this is because of the version of DSS you are using not the parameters, as the parameters in the paper were the defaults of dma module which should give a similar number of DMRs with the latest release with default options. Some versions of DSS tend to give much more DMRs compare to others (It seems it happens when smoothing is true). For example, v2.46.0 tends to give much more DMRs compare to v2.36.0, read issue #7 for more information. Moreover, some samples generally have more allelic DMRs, for example, tumour samples.
You can also refine your DMR list afterward based on the "diff.methy" column (which is the difference of average methylations at DMR from both comparisons) and/or areaStat column.

Best,
Vahid

weishwu · 2023-05-24T19:10:34Z

Thanks! Trying DSS 2.36.0 right now. Seems to be much much slower than 2.46.0. It has been staying at "Estimating dispersion for each CpG site, this will take a while ... 0%" for half a day. The 2.46.0 DSS finished within an hour.

Never mind. There was a glitch in my docker image. Fixed it and it worked fine. The DMRs were reduced by half using DSS 2.36.0.

weishwu closed this as completed May 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stringency in the NanoMethPhase paper #17

Stringency in the NanoMethPhase paper #17

weishwu commented May 22, 2023

vahidAK commented May 23, 2023

weishwu commented May 24, 2023 •

edited

Loading

Stringency in the NanoMethPhase paper #17

Stringency in the NanoMethPhase paper #17

Comments

weishwu commented May 22, 2023

vahidAK commented May 23, 2023

weishwu commented May 24, 2023 • edited Loading

weishwu commented May 24, 2023 •

edited

Loading