Skip to content

TeamP3-2/ML-Recommendation-System

Repository files navigation

Course Recommendation System

This project was developed as part of a hackathon challenge to build a Course Recommendation System using various datasets from Coursera, Stanford, and Harvard. The final recommendation system is content-based and built using TF-IDF vectorization and cosine similarity. Below is a description of each file in the project.


Project Structure and File Descriptions

Data Files (Stored in the data folder)

  • Coursera.csv:
    The main dataset used for building the recommendation system, containing course descriptions and metadata.

  • coursera_processed_data.csv:
    A processed version of the Coursera dataset with clean and structured data for easier analysis.

  • coursera_processed_data_with_cluster.csv:

Clustering and Recommendation Files

  • clustering.ipynb:
    This notebook contains the code for applying K-means clustering to the Coursera dataset to generate course categories. Categories like "Art" and "Machine Learning" were created to help manage and filter the dataset.

  • recommendation_system.ipynb:
    The main notebook for building the recommendation system. It uses TF-IDF Vectorization to convert course descriptions into feature vectors and cosine similarity to recommend courses based on content.

Similarity Matrix

  • similarity_matrix.pkl:
    A serialized similarity matrix used to speed up the recommendation process by storing precomputed course similarities.
    u can download it from here

Conclusion

This project demonstrates the steps taken to create a content-based recommendation system by exploring, clustering, and analyzing multiple datasets. The system is designed for easy dataset management and includes a dashboard for viewing course categories.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published