Skip to content

TheFebrin/Sentiment-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Feb 6, 2020
300f0b6 · Feb 6, 2020

History

47 Commits
Jan 20, 2020
Feb 6, 2020
Jan 29, 2020
Dec 29, 2019
Feb 6, 2020
Feb 6, 2020
Dec 30, 2019
Feb 1, 2020

Repository files navigation

Sentiment analysis

Sentiment analysis based on movies reviews.

Table of contents

General info

The main goal of this project is to predict the sentiments of sentences.
We work with IMDB movies reviews dataset. Movies are rated from 1 to 10. We consider only the reviews with rating 1 (the worst ones) and 10 (the best ones).
The assumption is that movies with rating 1 have negative sentiment and movies with rating 10 have a positive sentiment.

Technologies

  • Python - version 3.7.3

Libraries

  • numpy
  • pandas
  • matplotlib
  • seaborn
  • sklearn
  • nltk
  • scipy
  • glob
  • re
  • cvxopt
  • tqdm

Results

  • Accuracies below are measured on test data.
  • Baseline accuracy: 52%.
  • Naive Bayes: 89.95%.
  • Logistic Regression: 89.92%.
  • SVM: 92.16%.
  • Heuristic Naive Bayes: 90.32%.

Example screenshot

Status

Project is: finished,

Inspiration

Machine Learning class.

Authors

Created by @TheFebrin, @MatMarkiewicz and @gKlocek