Skip to content

Latest commit

 

History

History
39 lines (30 loc) · 1.31 KB

README.md

File metadata and controls

39 lines (30 loc) · 1.31 KB

PROJECT MEMBERS:

GOALS: Provide Microsoft with actionable insights on how to start a movie studio.

  1. Provide analysis on most financially successful genres (profitable, high ROI)
  2. Provide insights on movie-goers most highly rated genres

FILE SUMMARY:

Deliverables

  1. index.ipynb
  2. mod1_movie_deck.pdf
  3. data/movie_data.csv.gz
  4. data/movie_data_genre_breakout.csv.gz

Python Module

  1. movie_data.py
  2. movie_data/chart.py
  3. movie_data/clean.py
  4. movie_data/split_fields.py

Data cleaning and EDA Notebooks

  1. notebooks/generate_clean_dataframe.ipynb
  2. notebooks/load_data_tables.ipynb
  3. notebooks/split_data_tables.ipynb
  4. notebooks/movie_eda.ipynb
  5. notebooks/more_movie_eda.ipynb

To load the datafis-mod1-project

FIS project one repo

import movie_data as md

rootdir = 'mydata/' # default is 'data/'
df = md.generate_movie_analysis_df(rootdir)