Skip to content

STARR OMOP size

Michael Wornow edited this page Dec 14, 2022 · 3 revisions

Size of full STARR OMOP dump: som-rit-phi-starr-prod.starr_omop_cdm5_deid_2022_12_03

Size (.csv.gz) Table
386G Total (all tables)
138G observation
136G note
79G measurement
14G drug_exposure
6.6G procedure_occurrence
5.6G condition_occurrence
3.6G visit_occurrence
1.1G condition_era
694M drug_era
594M concept_ancestor
430M concept_relationship
380M visit_detail
361M fact_relationship
240M concept
71M concept_synonym
58M observation_period
55M person
48M payer_plan_period
27M drug_strength
26M location
20M device_exposure
3.2M provider
240K death
56K care_site
20K relationship
12K vocabulary
12K concept_class
8.0K specimen
8.0K source_to_concept_map
8.0K note_nlp
8.0K metadata
8.0K dose_era
8.0K domain
8.0K cost
8.0K cohort_definition
8.0K cohort_attribute
8.0K cohort
8.0K cdm_source
8.0K attribute_definition

Size of 1% extract: som-rit-phi-starr-prod.starr_omop_cdm5_deid_1pcent_2022_12_03

Size (.csv.gz) Table
4.9G Total (all tables)
1.4G observation
1.3G note
754M measurement
594M concept_ancestor
430M concept_relationship
240M concept
103M drug_exposure
71M concept_synonym
49M procedure_occurrence
41M condition_occurrence
29M visit_occurrence
27M drug_strength
26M location
7.6M condition_era
5.0M drug_era
3.8M visit_detail
3.6M fact_relationship
3.2M provider
732K person
704K observation_period
528K payer_plan_period
216K device_exposure
56K care_site
20K relationship
12K vocabulary
12K concept_class
8.0K specimen
8.0K source_to_concept_map
8.0K note_nlp
8.0K metadata
8.0K dose_era
8.0K domain
8.0K death
8.0K cost
8.0K cohort_definition
8.0K cohort_attribute
8.0K cohort
8.0K cdm_source
8.0K attribute_definition

Note: To generate these tables, cd into the parent directory of your STARR OMOP dump and run du -h . | sort -rh