GitHub - aminhbl/Data-Mining-Assignments: Preprocessing, Classification, Clustering and Association Rules

Data Mining

Assignments of Data Mining course

Preprocessing

This assignment includes handling of missing data, conversion of categorical values, data normalization, dimension reduction and data visualization.

Classification

For this assignment we use data to try and reason the way our neural network model would perform classification with different hyperparameters. We compare the accuracy and loss of the model by changing activation and loss functions, layering and number of neurons. We find the optimum learning ratio and finally propose an accurate model.

Clustering

For this assignment we use blobs as our data and K-means algorithm to cluster them and visualize the result. We further explore the performance and limitations of K-means by trying to determine the right value of K, for which we use elbow method.
Then we use K-means for clustering digits and compare the performance of our model with or without dimension reduction and visualize the results.
Further we compress the image of a bird by performing K-means algorithm and clustering colors to 4 separate clusters and replacing each color with centroid of it's respective cluster.
We also use DBSCAN algorithm and find the best eps for it by performing KNN and KneeLocator where K is equal to minPts and compare the result with K-means by visualization.

Association Rules

For this assignment we use Hypermarket dataset and Apriori algorithm to find all frequent itemsets and generate high confidence rules from each frequent itemset for specific support, confidence and lift.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
README.md		README.md
association_rules.ipynb		association_rules.ipynb
clustering.ipynb		clustering.ipynb
neural_network_classification.ipynb		neural_network_classification.ipynb
preprocessing.ipynb		preprocessing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Mining

Preprocessing

Classification

Clustering

Association Rules

About

Releases

Packages

Languages

aminhbl/Data-Mining-Assignments

Folders and files

Latest commit

History

Repository files navigation

Data Mining

Preprocessing

Classification

Clustering

Association Rules

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages