Skip to content

Dulani/beer-data-science

 
 

Repository files navigation

Overview

This is a preliminary, strictly for-fun foray into beer data. Pairs well with most session IPAs.

All beer data was grabbed from the BreweryDB API and dumped into a MySQL database. You can find the main report in compile.md.

The main question I went into the analysis with was: how well do beer styles actually describe the characteristics of beers within each style? In other words, do natural clusters in beer align well with style boundaries?

I set about answering this with a mix of clustering (k-means) and classification (multinomial neural net and random forest) methods.

Reproduce it

To grab the data yourself, you can create an API key on BreweryDB run the run_it.R script inside the run_it folder. For a quicker but less up-to-date solution (the BreweryDB database is updated pretty frequently), feel free to download beer_necessities.csv.

This analysis deals mainly with beer and its consituent components like ingredients (hops, malts) and other characteristics like bitterness and alcohol content. However, you can easily construct your own function for grabbing other things like breweries, glassware, locations, etc. by running the function generator in analyze/construct_funcs.R.

Any and all feedback is more than welcome. Cheers!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 99.5%
  • Other 0.5%