Project Description: This project is our final homework assignment for the Stat 159 Course taught at UC Berkeley. For this assignment, we analyze the publicly available Personal Key Indicators of Heart Disease Dataset. Here is a description of the files in the repo:
data
is a folder that contains the .csv file of the dataset used in our analysisfigures
is a folder that contains all of the .png files we generated from our analysistools
is a folder that contains the custom functions we used in our analysis, along with some simple tests for these functionscontributionstatement.md
is a Markdown file outlining each group member's contributions to this repoEDA.ipynb
is a Jupyter Notebook displaying the exploratory data analysis we ran on the datasetenvironment.yml
is a .yml file that can be used to reproduce the environment we ran our analysis onhw07-description.md
is a Markdown file containing the instructions for this homework assignmentLICENSE
is a license for our workmain.ipynb
is a Jupyter Notebook that contains a detailed description of our analysis and findingsMakefile
is a Makefile that allows the easy rerunning of our analysismodel.ipynb
is a Jupyter Notebook that displays the process by which we created our classifiers from the datasetpyproj.toml
is used to install our custom packagesetup.cfg
is used to install our custom packagesetup.py
is used to install our custom package_config.yml
is used to produce the jupyter book_toc.yml
is used to produce the jupyter book
Citations: The Dataset Used in Our Analysis: https://www.kaggle.com/datasets/kamilpytlak/personal-key-indicators-of-heart-disease
The Full CDC BRFSS Dataset: https://www.cdc.gov/brfss/annual_data/annual_2020.html