You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: posts/2025-10-10-introducing-tidyomics-ecosystem/index.qmd
+30-32Lines changed: 30 additions & 32 deletions
Original file line number
Diff line number
Diff line change
@@ -24,15 +24,15 @@ execute:
24
24
25
25
# Introduction
26
26
27
-
The tidyomics ecosystem was born from a common challenge faced by life-scientists: every omics technology and framework in R seemed to require learning a new data structure and syntax. Switching from bulk RNA-seq to single-cell, or from expression data to genomic ranges, often felt climbing a different mountain. Tidyomics keeps the **underlying objects exactly the same** while giving them a single, tidyverse-flavoured grammar so that moving from bulk RNA-seq to single-cell or spatial data is no harder than shifting between two dplyr pipelines. Its design principles take inspiration from the tidyverse philosophy of clear, human-readable code as articulated by Wickham *et al.* (2019) ([JOSS 10.21105/joss.01686](https://joss.theoj.org/papers/10.21105/joss.01686)).
27
+
The tidyomics ecosystem was born from a common challenge faced by life-scientists: omics technologies and frameworks in R often require specialised data structures and syntax. Switching from bulk RNA-seq to single-cell, or from expression data to genomic ranges, often felt climbing a different mountain. Tidyomics keeps the **underlying objects exactly the same** while giving them a single, tidyverse-flavoured grammar and data display making moving from bulk RNA-seq to single-cell or spatial data seamless. Its design principles take inspiration from the tidyverse philosophy of clear, human-readable code as articulated by Wickham *et al.* (2019) ([JOSS 10.21105/joss.01686](https://joss.theoj.org/papers/10.21105/joss.01686)).
28
28
29
29
This initiative snowballed into an international collaboration—and ultimately into `tidyomics` ([Nat. Methods 2024](https://www.nature.com/articles/s41592-024-02299-2)). Thanks to support from the [Chan Zuckerberg Initiative's Essential Open Source Software for Science (EOSS) Cycle 6 program](https://chanzuckerberg.com/eoss/proposals/?cycle=6), we are actively improving tidyomics through performance optimization, enhanced documentation, and ecosystem expansion to better serve the biomedical research community.
30
30
31
31
# What is Tidyomics?
32
32
33
33
{width="120px"}
34
34
35
-
`tidyomics` is an open project to develop and integrate software and documentation to enable a tidy data analysis framework for omics data objects ([Hutchison *et al.* 2024](https://doi.org/10.1038/s41592-024-02299-2)). The development of packages and tutorials is organized around [tidyomics open challenges](https://github.com/tidyomics/). Tidyomics enables the use of familiar tidyverse verbs (`select`, `filter`, `mutate`, etc.) to manipulate rich data objects in the Bioconductor ecosystem. Importantly, the data objects are not modified, but tidyomics provides a tidy *interface* to work on the native objects, leveraging existing Bioconductor classes and algorithms.
35
+
`tidyomics` is an open project to develop and integrate software and documentation to enable a tidy data analysis framework for omics data objects ([Hutchison *et al.* 2024](https://doi.org/10.1038/s41592-024-02299-2)). The development of packages and tutorials is organized around [tidyomics open challenges](https://github.com/tidyomics/). Tidyomics enables the use of familiar tidyverse verbs (`select`, `filter`, `mutate`, etc.) to manipulate rich data objects in the Bioconductor ecosystem. Importantly, while the data objects are not modified, `tidyomics` provides a tidy *interface* to work on the native objects, leveraging existing Bioconductor classes and algorithms.
36
36
37
37
`tidyomics` is a set of R packages by an international group of developers. The ecosystem allows for code such as:
Loading `tidyprint` (available in the 3.22 Bioconductor release), the `SummarizedExperimenrt` is abstracted to a richer flat representation, without altering its internal properties or structure.
86
+
Loading `tidyprint` (available from the 3.22 Bioconductor release), the `SummarizedExperimenrt` is abstracted to a richer flat representation, without altering its internal properties or structure.
87
87
88
88
```r
89
89
library(tidyprint)
@@ -148,21 +148,21 @@ With a single call you have a tidy interface ready for spatial, single-cell, bul
148
148
## Utility packages
149
149
150
150
### tidyprint
151
-
`tidyprint` offers a consistent, user-friendly print method for Bioconductor objects such as `SummarizedExperiment`, `SingleCellExperiment`, and others. It flattens complex S4 objects into tidy tibbles for straightforward inspection, summarization, and reporting—without modifying the underlying data. This approach makes it easy to explore and understand your data at a glance using familiar tidyverse conventions.
151
+
`tidyprint`(available from the 3.22 Bioconductor release) offers a consistent, user-friendly print method for Bioconductor objects such as `SummarizedExperiment`. It flattens the display of complex S4 objects into tidy tibbles for straightforward inspection, summarization, and reporting—without modifying the underlying data. This approach makes it easy to explore and understand your data at a glance using familiar tidyverse conventions.
Each tidyomics package tackles a real-world analytical challenge. Bulk RNA-seq analyses, for example, are traditionally scattered across disjoint data frames, objects and helper lists. `tidySummarizedExperiment` re-imagines a `SummarizedExperiment`as a tibble-first citizen: you can `filter()`, `mutate()` and `group_by()` genes or samples exactly as you do with any tidyverse data frame. For single-cell data the same philosophy inspired `tidySingleCellExperiment`, while for users of the Seurat workflow we created `tidyseurat`, a drop-in tidy wrapper that never compromises the original Seurat object.
157
+
Bulk RNA-seq analyses, for example, are traditionally scattered across disjoint data frames, objects and helper lists. `tidySummarizedExperiment` re-imagines a `SummarizedExperiment`through a tibble-like interface: you can `filter()`, `mutate()` and `group_by()` genes or samples exactly as you do with any tidyverse data frame. For single-cell data the same philosophy inspired `tidySingleCellExperiment`, while for users of the Seurat workflow we created `tidyseurat`, a drop-in tidy wrapper that makes transitioning between Bioconductor and Seurat frameworks seamless.
158
158
159
159
### tidySummarizedExperiment
160
160
The tidy interface for `SummarizedExperiment` objects, enabling tidyverse operations on bulk RNA-seq data.
Single-cell experiments often contain millions of cells and dozens of matrices. `tidySingleCellExperiment` flattens this complexity so you can focus on the biology instead of the bookkeeping.
165
+
Single-cell experiments are highly dimensional.`tidySingleCellExperiment` flattens this complexity so you can focus on the biology instead of the bookkeeping.
Spatial transcriptomics combines gene expression with tissue geography. `tidySpatialExperiment` brings the tidy philosophy to `SpatialExperiment` objects so you can transform, visualise and model spatial spots with the same verbs you already use for bulk and single-cell data.
175
+
Spatial transcriptomics combines gene expression with tissue spatial coordinates. `tidySpatialExperiment` brings the tidy philosophy to `SpatialExperiment` objects so you can transform, visualise and gate spatial spots with the same verbs you already use for bulk and single-cell data.
0 commit comments