Many contemporary personality psychologists believe that there are five basic dimensions of personality, often referred to as the "Big 5" personality traits.
Using clustering techniques, we determined clusters of people using these dimensions of personalities based on a dataset from Kaggle (https://www.kaggle.com/tunguz/big-five-personality-test) collected through an interactive online personality test.An application relying on NoSQL databases that allows people with similar personalities to know each other has been created relying on the best clustering technique. Different online data streaming clustering techniques have also been compared using the MOA framework.
The project is organized as follows:
- analysis/ contains the Jupyter notebooks used to compare the different clustering techniques
- moaAnalysis/ contains the results of the different data streaming clustering techniques
- frontend/ contains the source code of the Java application working as a frontend
- backend/ contains the source code of the Java application working as a backend
More information on the analysis and the software architecture are contained in the documentation and presentation.