Skip to content

Commit

Permalink
0.1.0.001
Browse files Browse the repository at this point in the history
  • Loading branch information
antaldaniel committed Jun 23, 2022
1 parent dd3a944 commit 21bc43e
Show file tree
Hide file tree
Showing 3 changed files with 12 additions and 11 deletions.
4 changes: 3 additions & 1 deletion DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,12 +1,14 @@
Package: dataset
Title: Create interoperable and well-documented data frames
Version: 0.1.0
Version: 0.1.0.001
Authors@R:
person("Daniel", "Antal", , "[email protected]", role = c("aut", "cre"),
comment = c(ORCID = "0000-0001-7513-6760"))
Description: Create well-documented, well-described, interoperable datasets in R for
publication and use on the (semantic) web.
License: GPL (>= 3)
URL: https://github.com/dataobservatory-eu/dataset
BugReports: https://github.com/dataobservatory-eu/dataset/issues
Encoding: UTF-8
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.2.0
Expand Down
2 changes: 1 addition & 1 deletion README.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -62,7 +62,7 @@ print(iris_bibentry, sytle="html")

## faIR: Interoperable & Reusable Datasets

The interoperability and reusability of datasets is greatly enhanced if they follow a standardized and practical format. Our datasets follow the tidy data principles^[Wickham, H.(2014). Tidy Data. Journal of Statistical Software, 59(10), 1–23. <https://doi.org/10.18637/jss.v059.i10>] and are interoperable with the W3C [DF Data Cube Vocabulary]^[RDF Data Cube Vocabulary, W3C Recommendation 16 January 2014 <https://www.w3.org/TR/vocab-data-cube/#metadata>] (semantic web) and SDMX (statistical) dataset definitions.
The interoperability and reusability of datasets is greatly enhanced if they follow a standardized and practical format. Our datasets follow the tidy data principles^[Wickham, H.(2014). Tidy Data. Journal of Statistical Software, 59(10), 1–23. <https://doi.org/10.18637/jss.v059.i10>] and are interoperable with the W3C ^[DF Data Cube Vocabulary]^[RDF Data Cube Vocabulary, W3C Recommendation 16 January 2014 <https://www.w3.org/TR/vocab-data-cube/#metadata>] (semantic web) and SDMX (statistical) dataset definitions.

Both W3C and SDMX uses are more complex object, the Datacube in its description. The dataset is a redused datacube. To adhere to tidy data principles and easy use in reproducible resaerch workflows, we further reduced our subjective definition of the dataset.

Expand Down
17 changes: 8 additions & 9 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@ findable, accessible, interoperable and reusable.

## Installation

You can install the development version of dataset like so:
You can install the development version of dataset from Github:

``` r
devtools::install_package(dataobservatory-eu/dataset)
Expand Down Expand Up @@ -84,9 +84,11 @@ Edgar A (1935). “Iris Dataset.”

The interoperability and reusability of datasets is greatly enhanced if
they follow a standardized and practical format. Our datasets follow the
tidy data principles[^1] and are interoperable with the W3C \[DF Data
Cube Vocabulary\][^2] (semantic web) and SDMX (statistical) dataset
definitions.
tidy data principles[^1] and are interoperable with the W3C <sup>\[DF
Data Cube Vocabulary\]</sup>\[RDF Data Cube Vocabulary, W3C
Recommendation 16 January 2014
<https://www.w3.org/TR/vocab-data-cube/#metadata>\] (semantic web) and
SDMX (statistical) dataset definitions.

Both W3C and SDMX uses are more complex object, the Datacube in its
description. The dataset is a redused datacube. To adhere to tidy data
Expand Down Expand Up @@ -143,7 +145,7 @@ attributes(iris_ds)
#> [145] 145 146 147 148 149 150
#>
#> $Modified
#> [1] "2022-06-23 17:40:26 CEST"
#> [1] "2022-06-23 17:54:32 CEST"
#>
#> $dataset_id
#> [1] "iris_dataset"
Expand Down Expand Up @@ -173,7 +175,7 @@ If your application needs URIs, the first 5 elements of the iris dataset
can be referenced as `dataset_id#obs_id` will be used to create unique
resource identifiers (URIs). For the first five observations of the
`iris` dataset:
iris_dataset#1,iris_dataset#2,iris_dataset#3,iris_dataset#4,iris_dataset#5
iris_dataset#1’,‘iris_dataset#2’,‘iris_dataset#3’,‘iris_dataset#4’,’iris_dataset#5

## Code of Conduct

Expand All @@ -184,6 +186,3 @@ By contributing to this project, you agree to abide by its terms.

[^1]: Wickham, H.(2014). Tidy Data. Journal of Statistical Software,
59(10), 1–23. <https://doi.org/10.18637/jss.v059.i10>

[^2]: RDF Data Cube Vocabulary, W3C Recommendation 16 January 2014
<https://www.w3.org/TR/vocab-data-cube/#metadata>

0 comments on commit 21bc43e

Please sign in to comment.