Skip to content

Data Profiling Tools

Eddie Lin edited this page Jan 18, 2017 · 3 revisions

Data Profile

We want to generate some data profile after the open schema created for better understanding of our data and evaluating the quality of data. We will output a csv file as a data profile report.

  • Descriptive Statistics
  • Unique job title/frequency
  • Unique occupation/frequency
  • Number of posting
  • Geo-related stats:
  • Time-related stats:
  • Word counts for each field
  • Missing value counts for each field
Clone this wiki locally