minor edits #15

tavareshugo · 2023-11-28T17:35:37Z

Collecting a few minor issues here, to avoid me pushing now and causing merge conflicts.
These are not urgent and don't need to be fixed imminently - I'm happy to push these changes once development is less active.

A few of these are perhaps higher priority, and I've marked them with 🔴 to highlight this.

setup

Missing comma here

01_intro.Rmd

Slightly rephrase this one, maybe: "After labelling each sample with a different TMT reagent, the same peptide will have an identical mass but be differentially labelled across samples.

02_import_and_infrastructure.Rmd

03_data_processing.Rmd

Again, I feel a little confused by the nomenclature "assays" used here. To avoid confusion with the assay slots, could we maybe use the term "experiment assay"?
Here - Again, since we're in tidyverse land, this could be simplified with count(Search.Engine.Rank). If you wanted to make it more visual, could even pipe to ggplot: ggplot(aes(Search.Engine.Rank, n)) + geom_col() + scale_y_log10()
Typo "give" should be "given"
Exercise maybe give a bit of a clue of which column we should be looking at. You could say something like: "the XX software flags potential contaminant features in the Contaminants column found in the rowData of the experiment assay. For example, we can count how many contaminants there are using cc_qf[["psms_filtered"]] |> rowData() |> as_tibble() |> count(Contaminant). Use the filterCounts() function... etc."

04_normalisation_aggregation.Rmd

outputDir = "." could perhaps be outputs for consistency to where they are saving analysis outputs.
Here - I wonder if having a snapshot of the report would be helpful. For example, the QQ plots or the boxplots. Also, I'm not sure how we infered center.median was the method? In the PDF report it's referred to as median.

05_protein_exploration.Rmd

Here I have max .n is 223 and median is 2.

06_statistical_analysis.Rmd

Revise some of the statistical concepts #16
Here maybe use "continuous variable" instead of covariate.
Could be worth adding somewhere what the interpretation of FDR is. Something like "The FDR defines the fraction of false discoveries that we are willing to tolerate in our list of differential proteins. For example, an FDR threshold of 0.05 means that around 5% of the differential proteins will be false positives. It is up to you to decide what this threshold should be, but conventionally people use 0.01 or 0.05."
Try to be consistent using either BH or FDR. Sometimes one or the other is used.

More general questions:

Here "The quantitation data is stored in columns 47 through to 56" --> how would we know this?
throughout replace cc_qf@ExperimentList with experiments(cc_qf)

The text was updated successfully, but these errors were encountered:

lmsimp · 2023-11-29T19:06:37Z

I've started to address these edits for lessons 4 and 5 in commit e868288 and commit 3278941. Thank you very @tavareshugo for going through these lessons. Please feel free to add more as you go to this issue.

tavareshugo · 2023-11-30T09:09:18Z

I've turned them into checkboxes, it might be easier to keep track of it

lmsimp added a commit that referenced this issue Nov 29, 2023

rephase tmt labelling as per #15

3278941

lmsimp added a commit that referenced this issue Nov 29, 2023

address comments in #15 for lesson 4 import

e868288

lmsimp added a commit that referenced this issue Nov 30, 2023

update all lessons with edits from Hugo in #15 and simplify lessons

9a13c9a

lmsimp added a commit that referenced this issue Dec 2, 2023

update lesson 3 addressing minor edits #15

63b05f0

lmsimp added a commit that referenced this issue Dec 2, 2023

update lessons addressing #15

3fd0f95

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minor edits #15

minor edits #15

tavareshugo commented Nov 28, 2023 •

edited by lmsimp

Loading

lmsimp commented Nov 29, 2023

tavareshugo commented Nov 30, 2023

minor edits #15

minor edits #15

Comments

tavareshugo commented Nov 28, 2023 • edited by lmsimp Loading

lmsimp commented Nov 29, 2023

tavareshugo commented Nov 30, 2023

tavareshugo commented Nov 28, 2023 •

edited by lmsimp

Loading