Multimodal Sentiment Analysis

Colab Link Click Here
Dataset Used: Senticap

Model

This repository explores the task of multimodal sentiment analysis using text and image dual encoder i.e BERT+ResNet50. It helps to provide a holistic view of sentiment expressed in text and images.

Text+Image Neural Network: BERT + CNN

The above image provides a rough idea about how multimodal neural networks work. Basically, the image is passed through the image encoder ( in this case ResNet50 image encoder ) and text content is tokenized and passed through text encoder ( i.e BERT ). The outputs of both the networks and concantenated and are passed through a fully connected layer. For the output we provide the no. of activations which are required to predict desired no. of labels. For our case of sentiment analysis, it's just two i.e 1 ( positive) and 0 ( negative ).

Result

While fine tuning the model, we found out that text only model gives 99 percent accuracy on senticap eval dataset whereas the multimodal neural network almost closes to 100 percent on the eval dataset. That means that the model is understanding the data features quite accurately. Accuracy may not remain so high with other datasets as such but feeding both text and image sure shows a significant improvement over the normal text only model.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
text_image_sentiment_analysis.ipynb		text_image_sentiment_analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multimodal Sentiment Analysis

Model

Result

About

Uh oh!

Releases

Packages

Languages

teamneuralnexus/multimodal-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Multimodal Sentiment Analysis

Model

Result

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages