Skip to content

SebastianVeuskens/Covid-19-Clustering-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Clustering Analysis of countries using the COVID-19 cases dataset

University-based extended paper reproduction and analysis

General information

Context

  • University course Machine Learning Systems for Data Science
  • Project timeframe from December 2022 until January 2023
  • Reproduction of Paper Analysis

Authors

  • Alberto Trashaj
  • Manuel Rech
  • Sebastian Benno Veuskens

Institution

University of Bologna

Data overview

All data are collected in accordance with the paper procedure from the John-Hopkins University website. The raw data is available here.

Datasets

Methods and Analysis

Selection criteria

30 countries are selected for each dataset, based on one of the two selection critera, respectively:

  • The countries with the highest number of cases on the 4th of April 2020
  • The countries where cases occured first

Agglomerative Clustering

Based on these datasets and selected countries, an agglomerative clustering algorithm is applied.

Extension

Based on the outcome of the clustering algorithm, the countries are embedded into a world map. Their cluster membership is made visually available within the context of all countries.

References

The referenced paper is available here. The description for the university course in Machine Learning can be found here.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published