Skip to content

HodardCodeclub/Data_Science_Nanodegree

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

programming-for-data-science

Hodard Hazwinayo

This repository contains projects for Udacity Programming For Data Science.

Project 1

Overview

This project consisted in the exploration of a provided database movie rentals in which the student had to run SQL queries and build visualizations to showcase the output of the student's queries.

For the presentation component, the student had to create four slides, and each should have (a) a question of interest, (b) a supporting SQL query needed to answer the question, (c) a supporting visualization created using the final data of the SQL queries that answer the questions of interest, and (d) a small summary on each slide.

Project Submission

You can find the queries here.

Question Sets

A set of questions have also been provided by Udacity so that we were free to consider and include them in our Project Submission. These were solely provided for our convenience, and we were able to choose between any of the questions in these sets or none at all in your project submission.Despite having not used any of the questions provided by Udacity, I still chose to query each question.

Project 2

Overview

In this project, the student had to make use of Python to explore data related to bike share systems for three major cities in the United States — Chicago, New York City, and Washington. The student had to write code to (a) import the data and answer interesting questions about it by computing descriptive statistics, and (b) write a script that takes in raw input to create an interactive experience in the terminal to present these statistics.

Project Submission

The developed CLI program allows the user to explore an US bikeshare system database and retrieve statistics information from the database. The user is able filter the information by city, month and weekday, in order to visualize statistics information related to a specific subset of data. The user is also able to chose to view raw data and to sort this data by columns, in ascending or descending order.

Project Submission.

Requirements

This program was written in Python (version 3.6.8) and relies on the following libraries:

pandas==0.23.4 numpy==1.15.4 Click==7.0

Project 3

This project consisted in using Git and GitHub with a simulated workflow to refactor Project 2.

Project Submission

The Project Submission consisted in the filling of a template with all the Git commands used to perform the requested tasks.