Skip to content

Preprocessing, Classification, Clustering and Association Rules

Notifications You must be signed in to change notification settings

aminhbl/Data-Mining-Assignments

Repository files navigation


Data Mining

Assignments of Data Mining course


Preprocessing

This assignment includes handling of missing data, conversion of categorical values, data normalization, dimension reduction and data visualization.

Classification

For this assignment we use data to try and reason the way our neural network model would perform classification with different hyperparameters. We compare the accuracy and loss of the model by changing activation and loss functions, layering and number of neurons. We find the optimum learning ratio and finally propose an accurate model.

Clustering

For this assignment we use blobs as our data and K-means algorithm to cluster them and visualize the result. We further explore the performance and limitations of K-means by trying to determine the right value of K, for which we use elbow method.
Then we use K-means for clustering digits and compare the performance of our model with or without dimension reduction and visualize the results.
Further we compress the image of a bird by performing K-means algorithm and clustering colors to 4 separate clusters and replacing each color with centroid of it's respective cluster.
We also use DBSCAN algorithm and find the best eps for it by performing KNN and KneeLocator where K is equal to minPts and compare the result with K-means by visualization.

Association Rules

For this assignment we use Hypermarket dataset and Apriori algorithm to find all frequent itemsets and generate high confidence rules from each frequent itemset for specific support, confidence and lift.

About

Preprocessing, Classification, Clustering and Association Rules

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published