Skip to content

Latest commit

 

History

History
13 lines (11 loc) · 1010 Bytes

README.md

File metadata and controls

13 lines (11 loc) · 1010 Bytes

Tweets Analysis

This project has been developed to address the task of retrieving and analyzing tweets from Twitter API to match the requests of the Big Data Analytics course from the Master Degree in Artificial Intelligence at UB/UPC/URV.

What is this project about?

It consists of different kind of analysis made on Tweets objects, such as:

  • the languages they're written in;
  • the sources(i.e. the devices/apps) they're written through;
  • their number of retweets/responses;
  • a deeper analysis on the text(s) features.

All the tasks have been done by using Python, with its useful pandas, numpy, scikit-learn libraries, and MongoDB for storing the retrieved tweets (both queried in real time or previously tweeted ones). Finally, to address the assignment of analyzing texts, it's been crucial the usage of nltk package.

For further explanations, please read the Wiki.