Skip to content

Commit fe865ca

Browse files
bienertsGitHub Enterprise
authored andcommitted
minor changes in wording and spelling (#177)
1 parent f2800ca commit fe865ca

File tree

7 files changed

+32
-25
lines changed

7 files changed

+32
-25
lines changed

R/connections.R

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
1-
#' Connection to Biomart
1+
#' Connection to BioMart
22
#'
33
#' @description `r lifecycle::badge("experimental")`
44
#'
55
#' `connect_biomart()` creates a connection object of class [`ConnectionBiomart`] which contains
66
#' the `biomaRt` object of class [`biomaRt::Mart`][biomaRt::Mart-class] and the prefix of the object
77
#' which is used downstream for the query.
88
#'
9-
#' @details This connects to the Ensembl data base of Biomart for human genes.
9+
#' @details This connects to the Ensembl data base of BioMart for human genes.
1010
#'
1111
#' @param prefix (`string`)\cr gene ID prefix.
1212
#'
@@ -37,15 +37,15 @@ connect_biomart <- function(prefix = c("ENSG", "GeneID")) {
3737
slots = c(prefix = "character")
3838
)
3939

40-
#' Get Annotations from Biomart
40+
#' Get Annotations from BioMart
4141
#'
4242
#' @description `r lifecycle::badge("experimental")`
4343
#'
4444
#' Helper function to query annotations from `biomaRt`, for cleaned up gene IDs of
4545
#' a specific ID variable and given [`biomaRt::Mart`][biomaRt::Mart-class].
4646
#'
4747
#' @param gene_ids (`character`)\cr gene IDs, e.g. `10329`.
48-
#' @param id_var (`string`)\cr corresponding gene ID variable name in Biomart,
48+
#' @param id_var (`string`)\cr corresponding gene ID variable name in BioMart,
4949
#' e.g. `entrezgene_id`.
5050
#' @param mart (`Mart`)\cr given [`biomaRt::Mart`][biomaRt::Mart-class] object.
5151
#'
@@ -119,7 +119,7 @@ h_get_annotation_biomart <- function(gene_ids,
119119
#' is extensible: It is simple to add new connections and corresponding query methods
120120
#' for other data bases, e.g. company internal data bases. Please make sure to
121121
#' follow the required format of the returned value.
122-
#' - The Biomart queries might not return information for all the genes. This can be
122+
#' - The BioMart queries might not return information for all the genes. This can be
123123
#' due to different versions being used in the gene IDs and the queried Ensembl data base.
124124
#'
125125
#' @param genes (`character`)\cr gene IDs.
@@ -160,7 +160,7 @@ setGeneric(
160160
#' @export
161161
#'
162162
#' @note This is currently used to strip away the `GeneID` prefix from Entrez gene IDs
163-
#' so that they can be queried from Biomart.
163+
#' so that they can be queried from BioMart
164164
#'
165165
#' @examples
166166
#' h_strip_prefix(c("GeneID:11185", "GeneID:10677"), prefix = "GeneID")

inst/WORDLIST

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
Armen
22
Benjamini
33
BioConductor
4-
Biomart
4+
BioMart
55
Chendi
66
CPM
77
DESeq

man/connect_biomart.Rd

Lines changed: 2 additions & 2 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

man/h_get_annotation_biomart.Rd

Lines changed: 2 additions & 2 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

man/h_strip_prefix.Rd

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

man/query.Rd

Lines changed: 1 addition & 1 deletion
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

vignettes/introduction.Rmd

Lines changed: 19 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -21,9 +21,10 @@ knitr::opts_chunk$set(
2121

2222
## Acknowledgments
2323

24-
`hermes` is a successor of the Roche internal `rnaseqTools` R package, and therefore many code ideas have been borrowed from it. We therefore would like to thank the `rnaseqTools` authors for their work. In particular, we would like to acknowledge Chendi Liao and Joe Paulson for their guidance and explanations during the development of `hermes`. We also discussed the class design with Valerie Obenchain, and discussed RNAseq data standards with Armen Karapetyan. We borrowed some ideas from the Roche internal `biokitr` R package and discussed with its maintainer Daniel Marbach.
24+
`hermes` is a successor of the Roche internal `rnaseqTools` R package, and therefore many code ideas have been borrowed from it. Therefore we would like to thank the `rnaseqTools` authors for their work.
25+
In particular, we would like to acknowledge Chendi Liao and Joe Paulson for their guidance and explanations during the development of `hermes`. We also discussed the class design with Valerie Obenchain, and discussed RNAseq data standards with Armen Karapetyan. We borrowed some ideas from the Roche internal `biokitr` R package and discussed them with its maintainer Daniel Marbach.
2526

26-
Finally, as with any NEST product, `hermes` is only possible because of the whole NEST project team's work, and we are grateful for the larger team's support.
27+
Finally, as with any NEST product, `hermes` is only possible because of the whole NEST project team's work, and we are grateful for the entire team's support.
2728

2829
Thanks a lot to everyone involved!
2930

@@ -39,7 +40,7 @@ vignette(topic = "introduction", package = "hermes")
3940
In this vignette you are going to learn how to:
4041

4142
* Import RNAseq count data into the `hermes` ready format.
42-
* Annotate gene information automatically from a central database (e.g. Biomart).
43+
* Annotate gene information automatically from a central database (e.g. BioMart).
4344
* Add quality control (QC) flags to genes and samples.
4445
* Filter the data set.
4546
* Normalize the counts.
@@ -66,7 +67,7 @@ The data for `hermes` needs to be imported into the `HermesData` or `RangedHerme
6667

6768
### Importing a `SummarizedExperiment`
6869

69-
The simplest import route is from a `SummarizedExperiment` (SE) object. This is because a `HermesData` object
70+
The simplest way to import data is from a `SummarizedExperiment` (SE) object. This is because a `HermesData` object
7071
is just a special SE, with few additional requirements and slots.
7172

7273
In a nutshell, the object needs to have a `counts` assay, have certain
@@ -92,11 +93,11 @@ For a bit more details we can also call `summary()` on the object.
9293
summary(object)
9394
```
9495

95-
Note that here the "additional" columns refer to everything not mandatorily included in a `HermesData` object.
96+
Note that here the "additional" columns refer to everything that is not mandatory to be included in a `HermesData` object.
9697

9798
### Importing an `ExpressionSet`
9899

99-
If we start from an `ExpressionSet`, we can first convert this to a `RangedSummarizedExperiment` and then import to `RangedHermesData`:
100+
If we start from an `ExpressionSet`, we can first convert it to a `RangedSummarizedExperiment` and then import it to `RangedHermesData`:
100101

101102
```{r}
102103
se <- makeSummarizedExperimentFromExpressionSet(expression_set)
@@ -134,7 +135,8 @@ allow for future extensions in this or other downstream packages.
134135

135136
### Connection to Database
136137

137-
The first step is to connect to a database. In `hermes` the only option is currently Biomart.
138+
The first step is to connect to a database. In `hermes` the only option is currently databases that utilize the
139+
BioMart software suite.
138140
However due to the generic function design, it is simple to extend `hermes` with other data base
139141
connections.
140142

@@ -162,17 +164,20 @@ Then the second step is to query the gene annotations and save them in the objec
162164
annotation(small_object) <- query(genes(small_object), connection)
163165
```
164166

165-
Here we are using the `genes()` method to access the gene IDs (row names) of the `HermesData` object. Note that not all genes might be found in the data base and the corresponding rows would then be `NA` in the annotations.
167+
Here we are using the `genes()` method to access the gene IDs (row names) of the `HermesData` object.
168+
Note that not all genes might be found in the data base and the corresponding rows would then be `NA` in the annotations.
166169

167170
## Quality Control Flags
168171

169172
`hermes` provides automatic gene and sample flagging, as well as manual sample flagging functionality.
170173

171174
### Automatic Gene and Sample Flagging
172175

173-
For genes, it is counted how many samples don't pass a minimum expression CPM threshold. If too many, then this gene is flagged as a "low expression" gene.
176+
For genes, it is counted how many samples don't pass a minimum expression CPM (counts per million reads mapped) threshold.
177+
If too many, then this gene is flagged as a "low expression" gene.
174178

175-
For samples, two flags are provided. The "technical failure" flag is based on the average Pearson correlation with other samples. The "low depth" flag is based on the library size, i.e. the total sum of counts for a sample across all genes.
179+
For samples, two flags are provided. The "technical failure" flag is based on the average Pearson correlation with other
180+
samples. The "low depth" flag is based on the library size, i.e. the total sum of counts for a sample across all genes.
176181

177182
Thresholds for the above flags can be initialized with `control_quality()`, and the flags are added with `add_quality_flags()`.
178183

@@ -271,7 +276,8 @@ A series of simple descriptive plots can be obtained by just calling `autoplot()
271276
autoplot(object)
272277
```
273278

274-
Note that individual plots from these can be produced with the series of `draw_*()` functions, see `?plot_all` for the detailed list. Then, these can be customized further.
279+
Note that individual plots from these can be produced with the series of `draw_*()` functions, see `?plot_all` for the
280+
detailed list. Then, these can be customized further.
275281
For example, we can change the number and color of the bins in the library size histogram:
276282

277283
```{r}
@@ -327,7 +333,8 @@ autoplot(
327333

328334
### Correlation with Sample Variables
329335

330-
Afterwards it is easy to correlate the obtained principal components with the sample variables. We obtain a matrix of R-squared (R2) values for all combinations, which can again be visualized as a heatmap.
336+
Subsequently it is easy to correlate the obtained principal components with the sample variables. We obtain a matrix of
337+
R-squared (R2) values for all combinations, which can again be visualized as a heatmap.
331338
See `?pca_cor_samplevar` for details.
332339

333340
```{r, fig.height=8}

0 commit comments

Comments
 (0)