Accent Recognition with Sequential MFCC Features

This project addresses the challenge of accent diversity in English speech recognition systems, aiming to enhance their accuracy through the implementation of a supervised machine learning model. The model utilizes Sequential Mel-frequency cepstral coefficients (MFCC) features to classify speakers as either having an Indian or American English accent.

Methodology

Proposed Method: Sequential MFCC features are extracted from audio signals to provide a unique perspective for accent identification.
Dataset: The dataset comprises 3-5 second audio clips from VCTK-corpus, categorized into training (80%) and testing (20%).
Preprocessing: To address data imbalances, oversampling is applied to the training set. Feature extraction involves calculating 20 MFCC coefficients for each audio file using the Librosa library.
Feature Extraction: Mel-frequency cepstral coefficients (MFCC) are calculated for each audio frame, and sequential concatenation of these features provides a robust input set for accent classification.
Supervised Learning: Various classifiers, including K-Nearest Neighbors, Support Vector Machines, Gaussian Mixture Models, Neural Networks, and Logistic Regression, are employed for accent classification.

Results and Discussion

Evaluation Metrics: Precision, recall, reject rate, and overall accuracy are used to evaluate classifier performance.
Top Performers: Neural Networks, K-Nearest Neighbors, and Logistic Regression demonstrate superior performance, with Neural Networks emerging as the top classifier.

Conclusion and Future Work

Promising Results: Accent classification through Sequential MFCC features showcases promising results, with up to 95% accuracy.
Future Work: Incorporating this classification into speech recognition systems, extending the model to diverse accents and dialects, and exploring Hidden Markov Models for classification are areas for future development.

Note

This project was developed during the IIIT Hyderabad Hackathon as part of the Qualcomm problem statement on Accent Detection.
The evaluation criteria include accuracy scores, novelty in methodology or ideation, and the clarity of presentation.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Code		Code
Graphs		Graphs
Sample Audio		Sample Audio
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Accent Recognition with Sequential MFCC Features

Methodology

Results and Discussion

Conclusion and Future Work

Note

About

Releases

Packages

Languages

syedamaann/accent-detection

Folders and files

Latest commit

History

Repository files navigation

Accent Recognition with Sequential MFCC Features

Methodology

Results and Discussion

Conclusion and Future Work

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages