Udacity Data Analyst Nanodegree Program - Project 06 : Explore and Summarize Red Wine Dataset

Introduction

In this project, we are going to investigate a dataset which contains information of 1599 red wine samples with a variety of attributes such as fixed acidity, sulfur dioxide, pH and alcohol. Each sample is assigned a quality rating from 0 to 10.

Part 1: Univariate Plots and Analysis

Create plots and calculate descriptive statistics for each indidivual attribute to find out about its distribution and other note-worthy characteristics.
Make some observations regarding the structure and main features of interest of our dataset, as well as other attributes that are most likely to influence the quality rating of red wine samples.

Part 2: Bivariate Plots and Analysis

Examine the relationship between any two attributes through a correlation matrix and scatterplot matrix. Pairs of attributes that are deemed to have at least moderate correlation shall be explored further using plots and other mathematical tools such as Pearson product-moment correlation.
Analyze and make conclusions on the correlation between these attributes, including some very interesting and/or strong relationships and hidden insights that we have found.

Part 3: Multivariate Plots and Analysis

Dive deeper into the red wine dataset by examining the relationships between multiple features at the same time by making use of density and scatter plots.
Report on our findings of some significant relationships that we have identified in this section. A number of surprising interactions between features that have not been noticed in earlier parts are also mentioned.

Part 4: Final Plots and Summary

Three most important findings throughout our project, including their accompanying plots and details are gathered together and documented in this last section.
Some final reflections on our effort, most notably challenges that we faced and managed to overcome in order to create a meaningful and comprehensive report on time for our readers.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
RED WINE ANALYSIS RMD.Rmd		RED WINE ANALYSIS RMD.Rmd
RED_WINE_ANALYSIS_RMD.html		RED_WINE_ANALYSIS_RMD.html
wineQualityInfo.txt		wineQualityInfo.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Udacity Data Analyst Nanodegree Program - Project 06 : Explore and Summarize Red Wine Dataset

Introduction

Part 1: Univariate Plots and Analysis

Part 2: Bivariate Plots and Analysis

Part 3: Multivariate Plots and Analysis

Part 4: Final Plots and Summary

About

Uh oh!

Releases

Packages

Languages

drtuanhung/DA-Exploratory-Data-Analysis

Folders and files

Latest commit

History

Repository files navigation

Udacity Data Analyst Nanodegree Program - Project 06 : Explore and Summarize Red Wine Dataset

Introduction

Part 1: Univariate Plots and Analysis

Part 2: Bivariate Plots and Analysis

Part 3: Multivariate Plots and Analysis

Part 4: Final Plots and Summary

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages