Voice-Based-Gender-Classification

Voice-based gender classification is a process of determining the gender of an individual based on their voice characteristics. It utilizes various techniques from the field of speech processing and machine learning to analyze and extract features from the voice signal that are indicative of the person's gender.

The process involves the following steps:

Data Collection: A dataset of voice recordings from individuals of different genders is collected. The dataset should be diverse and representative of the target population.
Feature Extraction: Various acoustic features are extracted from the voice recordings, such as fundamental frequency (pitch), spectral characteristics, and temporal patterns. These features capture the unique characteristics of the voice that are correlated with gender differences.
Model Training: Machine learning algorithms, such as logistic regression, decision tree classifier, or random forest classifier are trained using the extracted features and the corresponding gender labels from the dataset. The model learns to map the voice features to the corresponding gender categories.
Testing and Evaluation: The trained model is then tested on new voice samples to predict the gender of the speaker. The accuracy of the classification is evaluated by comparing the predicted gender with the known ground truth labels.

Dataset

The following acoustic properties of each voice are measured and included within the CSV:

meanfreq: mean frequency (in kHz)
sd: standard deviation of frequency
median: median frequency (in kHz)
Q25: first quantile (in kHz)
Q75: third quantile (in kHz)
IQR: interquantile range (in kHz)
skew: skewness (skewness is a measure of the asymmetry of a distribution)
kurt: kurtosis (kurtosis measures the peakedness or flatness of a distribution compared to a normal distribution)
sp.ent: spectral entropy
sfm: spectral flatness
mode: mode frequency
centroid: frequency centroid (see specprop)
meanfun: average of fundamental frequency measured across acoustic signal
minfun: minimum fundamental frequency measured across acoustic signal
maxfun: maximum fundamental frequency measured across acoustic signal
meandom: average of dominant frequency measured across acoustic signal
mindom: minimum of dominant frequency measured across acoustic signal
maxdom: maximum of dominant frequency measured across acoustic signal
dfrange: range of dominant frequency measured across acoustic signal
modindx: modulation index. Calculated as the accumulated absolute difference between adjacent measurements of fundamental frequencies divided by the frequency range
label: male or female

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
dataset		dataset
LICENSE		LICENSE
README.md		README.md
app.py		app.py
audio_features.py		audio_features.py
gender_classification_ml_v2.ipynb		gender_classification_ml_v2.ipynb
gender_classification_ml_v2.py		gender_classification_ml_v2.py
gender_identification.py		gender_identification.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Voice-Based-Gender-Classification

Dataset

About

Uh oh!

Releases

Packages

Languages

License

AkhilKas/Voice-Based-Gender-Classification

Folders and files

Latest commit

History

Repository files navigation

Voice-Based-Gender-Classification

Dataset

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages