Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

clustering movie data from imdb issue #153 #156

Merged
merged 2 commits into from
Jun 2, 2024
Merged

Conversation

Roverbk
Copy link
Contributor

@Roverbk Roverbk commented Jun 1, 2024

#153
[clustering movie data from imdb issue]

Description
1.We scrape IMDb data including movie titles, ratings, and summaries using BeautifulSoup
2.We use TF-IDF vectorization to convert the movie summaries into numerical feature vectors.
3.We apply K-means clustering to group the movies into clusters based on their summaries.
4.We use PCA for dimensionality reduction to visualize the clusters in a 2D space.

Type of PR

  • Bug fix
  • Feature enhancement
  • Documentation update
  • Other (specify): _______________

image
image
image

Checklist:

  • I have performed a self-review of my code
  • I have read and followed the Contribution Guidelines.
  • I have tested the changes thoroughly before submitting this pull request.
  • I have provided relevant issue numbers, screenshots, and videos after making the changes.
  • I have commented my code, particularly in hard-to-understand areas.

Co-authored-by: Mercia Jeno [email protected]
Co-authored-by: Bhakti Kalyankasture [email protected]

@Roverbk
Copy link
Contributor Author

Roverbk commented Jun 1, 2024

@sanjay-kv Can you merge it.
attaching link: #108

@sanjay-kv sanjay-kv merged commit 8a4bb9d into recodehive:main Jun 2, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants