You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: vignettes/introduction.Rmd
+19-12Lines changed: 19 additions & 12 deletions
Original file line number
Diff line number
Diff line change
@@ -21,9 +21,10 @@ knitr::opts_chunk$set(
21
21
22
22
## Acknowledgments
23
23
24
-
`hermes` is a successor of the Roche internal `rnaseqTools` R package, and therefore many code ideas have been borrowed from it. We therefore would like to thank the `rnaseqTools` authors for their work. In particular, we would like to acknowledge Chendi Liao and Joe Paulson for their guidance and explanations during the development of `hermes`. We also discussed the class design with Valerie Obenchain, and discussed RNAseq data standards with Armen Karapetyan. We borrowed some ideas from the Roche internal `biokitr` R package and discussed with its maintainer Daniel Marbach.
24
+
`hermes` is a successor of the Roche internal `rnaseqTools` R package, and therefore many code ideas have been borrowed from it. Therefore we would like to thank the `rnaseqTools` authors for their work.
25
+
In particular, we would like to acknowledge Chendi Liao and Joe Paulson for their guidance and explanations during the development of `hermes`. We also discussed the class design with Valerie Obenchain, and discussed RNAseq data standards with Armen Karapetyan. We borrowed some ideas from the Roche internal `biokitr` R package and discussed them with its maintainer Daniel Marbach.
25
26
26
-
Finally, as with any NEST product, `hermes` is only possible because of the whole NEST project team's work, and we are grateful for the larger team's support.
27
+
Finally, as with any NEST product, `hermes` is only possible because of the whole NEST project team's work, and we are grateful for the entire team's support.
Here we are using the `genes()` method to access the gene IDs (row names) of the `HermesData` object. Note that not all genes might be found in the data base and the corresponding rows would then be `NA` in the annotations.
167
+
Here we are using the `genes()` method to access the gene IDs (row names) of the `HermesData` object.
168
+
Note that not all genes might be found in the data base and the corresponding rows would then be `NA` in the annotations.
166
169
167
170
## Quality Control Flags
168
171
169
172
`hermes` provides automatic gene and sample flagging, as well as manual sample flagging functionality.
170
173
171
174
### Automatic Gene and Sample Flagging
172
175
173
-
For genes, it is counted how many samples don't pass a minimum expression CPM threshold. If too many, then this gene is flagged as a "low expression" gene.
176
+
For genes, it is counted how many samples don't pass a minimum expression CPM (counts per million reads mapped) threshold.
177
+
If too many, then this gene is flagged as a "low expression" gene.
174
178
175
-
For samples, two flags are provided. The "technical failure" flag is based on the average Pearson correlation with other samples. The "low depth" flag is based on the library size, i.e. the total sum of counts for a sample across all genes.
179
+
For samples, two flags are provided. The "technical failure" flag is based on the average Pearson correlation with other
180
+
samples. The "low depth" flag is based on the library size, i.e. the total sum of counts for a sample across all genes.
176
181
177
182
Thresholds for the above flags can be initialized with `control_quality()`, and the flags are added with `add_quality_flags()`.
178
183
@@ -271,7 +276,8 @@ A series of simple descriptive plots can be obtained by just calling `autoplot()
271
276
autoplot(object)
272
277
```
273
278
274
-
Note that individual plots from these can be produced with the series of `draw_*()` functions, see `?plot_all` for the detailed list. Then, these can be customized further.
279
+
Note that individual plots from these can be produced with the series of `draw_*()` functions, see `?plot_all` for the
280
+
detailed list. Then, these can be customized further.
275
281
For example, we can change the number and color of the bins in the library size histogram:
276
282
277
283
```{r}
@@ -327,7 +333,8 @@ autoplot(
327
333
328
334
### Correlation with Sample Variables
329
335
330
-
Afterwards it is easy to correlate the obtained principal components with the sample variables. We obtain a matrix of R-squared (R2) values for all combinations, which can again be visualized as a heatmap.
336
+
Subsequently it is easy to correlate the obtained principal components with the sample variables. We obtain a matrix of
337
+
R-squared (R2) values for all combinations, which can again be visualized as a heatmap.
0 commit comments