Skip to content

ipums/hipread

Folders and files

NameName
Last commit message
Last commit date

Latest commit

b9a4578 · Oct 22, 2024
Sep 30, 2022
Apr 23, 2020
Jun 12, 2018
Sep 30, 2022
Oct 18, 2024
Oct 18, 2024
Sep 30, 2022
Apr 29, 2020
Jul 10, 2018
May 30, 2018
Sep 30, 2022
Nov 29, 2023
May 30, 2018
Dec 28, 2018
Nov 29, 2023
Sep 30, 2022
Sep 30, 2022
Jul 10, 2018
Nov 29, 2023
May 13, 2019

Repository files navigation

hipread

Project Status:Active CRAN status R-CMD-check AppVeyor build status

hipread (hierarchical IPUMS reader) is a fork from tidyverse readr that allows for reading hierarchical fixed width text files, like those created by the CSPro software and commonly used by census data providers.

Compared to readr it is:

  • Able to natively read the "hierarchical" fixed width file format that IPUMS and some other census data providers use. These files can have multiple types of observations in them, each with their own specification of variables.

  • Better at reading gzipped data. It does not require loading the full file into a raw vector, which takes a large amount of memory, and prevents reading gigantic files altogether (because R can only store raw vectors of a certain size).

  • Less flexible. It only works on fixed width files, only accepts data of types character, double and integer, and is less detailed about the information it gives about parsing failures. This makes it easier for me to maintain.

I do not expect that this will be directly useful for too many people, so the documentation is a little bit light. Instead I expect most users will use this package through the ipumsr package. But, if you are interested and find something confusing, please let me know!

Installation

Install the development version from GitHub with:

# install.packages("devtools")
devtools::install_github("ipumsr/hipread")