Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List of Todo #1

Open
6 of 10 tasks
nileshprasad137 opened this issue Apr 10, 2018 · 0 comments
Open
6 of 10 tasks

List of Todo #1

nileshprasad137 opened this issue Apr 10, 2018 · 0 comments

Comments

@nileshprasad137
Copy link
Owner

nileshprasad137 commented Apr 10, 2018

  • Properly handle Named Entity Recognition
  • Handle Punctuations
  • Look for suitable Machine Algo to train -PCA, SVM, Artificial Neural Networks, Multiple Linear Regression
  • Split into training and test set and train on all 8 sets of essays and calculate accuracy. Plot graphs. (Kappa Values)
  • Improve Spelling Accuracy
  • Add more features in dataframe
    - No. of Stop Words in Sentence
    - Average length of Sentence in an essay
  • Improve code modularity (if possible, least preference)
  • Normalize wrong spellings, word count (on Scale of 1-100)
  • Optimize code (Time Complexity and Space complexity)
  • If time permits, Use Word2Vec, TextRank to understand sentence Structure. ( Less preference)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant