ParisClustering

This is my first ever GitHub project. Any kind review is well appreciated.

Clustering of different touristic attractions in Paris.

Once, I read an interesting article( https://towardsdatascience.com/using-unsupervised-learning-to-plan-a-paris-vacation-geo-location-clustering-d0337b4210de) about clustering landmarks in Paris based on their longitude and latitude. The point? Create n clusters where each cluster contains the landmarks that should be visited on the Mth day. I found this article very interesting since I happened to be going to Paris soon and I still had no plans in mind!.

In the article, the author uses sklearn's k-means and HDBSCAN to cluster the landmarks into 10 clusters.

I had some questions concerning which algorithm I should use (k-means, hdbscan, mean-shift, agglomerative, etc...), and I also considered the question of normalizing the longitudes/latitudes axis since it appears that in Paris, the latitudes are ~=2 and the longitudes ~= 48. And I didn't want my code to be influenced by one over the other due to its larger/smaller scale.

In this project, I use sklearn's agglomerative, k-means, mean-shift, dbscan, spectral clustering, as well as HDBSCAN's dbscan. So, in total 6 algorithms. And for each algorithm, I do 2 versions: one without normalization and one with mean normalization (x = (x-mean(x))/(max(x)-min(x)) and I compare. In some of these algorithms I get to select the number of clusters that I want, but in others (like DBSCAN) I had to select the minimal number of points for a cluster to be kept.

The file named doc.kml contains the data of the touristic attractions in Paris as well as their coordinates. The file named Paris-Clustering.ipynb contains the code of the 6 algorithms and the code that extracts the data from doc.kml

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
7-clusters-agg-result.png		7-clusters-agg-result.png
7-clusters-agg.png		7-clusters-agg.png
7-clusters-kmeans-result.png		7-clusters-kmeans-result.png
7-clusters.png		7-clusters.png
Paris-Clustering.ipynb		Paris-Clustering.ipynb
README.md		README.md
doc.kml		doc.kml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ParisClustering

Clustering of different touristic attractions in Paris.

About

Releases

Packages

Contributors 2

Languages

Joseph94m/ParisClustering

Folders and files

Latest commit

History

Repository files navigation

ParisClustering

Clustering of different touristic attractions in Paris.

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages