Skip to content

Polarity classification for products and services reviews written in Darija

License

Notifications You must be signed in to change notification settings

ohidaoui/SentiDarija

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SentiDarija

Polarity classification for products and services reviews written in Darija.

The project aims to develop a sentiment analysis system specifically designed for analyzing product and services reviews written in Darija, the Moroccan dialect, supporting Arabic & Arabizi writing styles. The goal is to provide businesses operating in Morocco with valuable insights into customer sentiments and opinions.

The project involves the following key steps:

  1. Conducting a survey of existing techniques and datasets for sentiment analysis, with a specific focus on resources available for Darija.

  2. Data Collection: Gathering a dataset of Darija products and services reviews from relevalt Youtube videos and Facebook groups.

  3. Data Preprocessing: Cleaning and preparing the collected data for analysis, including text normalization and removal of noise, this step includes writing style specific processing.

  4. Exploring various DL techniques for polarity classification.

  5. Fine-tuning DarijaBERT-mix, a pre-trained transformer-based model specifically designed for darija.

  6. Application Development: Buiding a Demo Streamlit web application.