Skip to content

Latest commit

 

History

History
9 lines (7 loc) · 225 Bytes

README.md

File metadata and controls

9 lines (7 loc) · 225 Bytes

spark-python

Apache Spark examples in Python:

  • Read a local/S3 CSV file into a Dataframe
  • Filter using UDF, regEx
  • Filter using timestamp
  • Write to Postgres
  • aggregate using SQL-Group-By / SQL-Partition-By