Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calculating TF-IDF #7

Open
sunhwap opened this issue Apr 6, 2014 · 3 comments
Open

Calculating TF-IDF #7

sunhwap opened this issue Apr 6, 2014 · 3 comments

Comments

@sunhwap
Copy link
Collaborator

sunhwap commented Apr 6, 2014

This issue deals with making user interact with school data via ESPN API.

[edit: changed title to reflect actual issue; ESPN API use/school data was covered in issue #5. -@britth]

@sunhwap
Copy link
Collaborator Author

sunhwap commented Apr 6, 2014

We can use an existing library that will calculate TF-IDF for obtaining most frequent terms or use Scikit-learn to simplify bag of words to TF-IDF.

@sunhwap
Copy link
Collaborator Author

sunhwap commented Apr 11, 2014

This link seems to be helpful for extracting terms using tf-idf.
http://stevenloria.com/finding-important-words-in-a-document-using-tf-idf/

@britth
Copy link
Owner

britth commented Apr 14, 2014

@sunhwap thanks for the link! definitely looks like some code we could modify for this

@britth britth changed the title Schools-ESPN API Calculating TF-IDF Apr 14, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants