This project focuses on analyzing referral data from Southern High School in Louisville, KY. Referrals are disciplinary tracking systems on student behavior. Teachers and admin indicate the behavior event, student identification, and comments on referrals.
- What is the most written type of referral?
- For a specific referral, is there bias towards a certain group?
- How has the number of referrals progressed over time?
- How are teachers documenting the situation that consistuted the referral?
There are two data sets that were used: behaviorEvents.csv and southernGrades.csv. Both were obtained by JCPS and added to the .gitignore file to align with FERPA. behaviorEvents.csv has the following variables:
- School,
- Group By,
- Submitted By,
- Event Type,
- Date,
- Student,
- Student Number,
- Grade,
- Role,
- Comments
To obtain the students race, academy, and gender, southernGrades.csv was used. southernGrades.csv has the following variables:
- Grade,
- Last,
- First,
- Student ID,
- Academy,
- Section,
- Class Name,
- Letter Grade,
- Percentage Grade,
- Teacher,
- Race,
- Gender
A new combined data sets was created to remove student and teacher identification to align with FERPA. A sample data set to run the following functions is provided:
The sample data set is also used to create a Tableau dashboard to display the data.
All needed packages can be installed by the requirements.txt file. The following packages are included:
- pandas
- matplotlib
- seaborn
The features included for the Data Anaylsis 2 Project are:
- Read two data files in csv
- Clean your data and perform a pandas merge with your two data sets, then calculate some new values based on the new data set.
- Make 3 matplotlib or seaborn (or another plotting library) visualizations to display your data.
- Utilize a virtual environment and include instructions in your README on how the user should set one up
- Annotate your .py files with well-written comments and a clear README.md.
- Run git clone https://github.com/samaguiar/southern_hs_referrals.git to clone repo.
- Create a virtual environment: python3 -m venv env
- Activate virtual environment: source env/bin/activate
- Install requirements: pip install -r requirements.txt
- Run the following code to start the program: python3 main.py
- Create a virtual environment: py -m venv env
- Activate virtual environment: .\env\Scripts\activate
- Install requirements: py -m pip install -r requirements.txt
- Run the following code to start the program: py main.py
[] Create a Tableau Dashboard
[] Allow User Input to Individualize Reports and Graphs (i.e: Specific Day, Teacher Name, Student Name, etc)