Skip to content

ashenjy/spark-submit-airflow-aws

Repository files navigation

spark-submit-airflow-aws

Tech:

Python, PySpark, Airflow, AWS S3, AWS EMR

Movie review classifier

  1. clean input data
  2. use a pre-trained model to make prediction
  3. write predictions to a HDFS output

About

Python, Spark Submit, Airflow, AWS S3, AWS EMR

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages