akbaig / Diagnosis-of-Articulation-and-Motor-Speech-Disorders Public

Notifications You must be signed in to change notification settings
Fork 0
Star 3

A bachelors final year project combining various deep learning models to build a pipeline for diagnosing articulation and speech disorders

3 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
audio_scraper		audio_scraper
deca @ fdb6380		deca @ fdb6380
pipeline		pipeline
voca @ a3f53e7		voca @ a3f53e7
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Repository files navigation

Diagnosis-of-Articulation-and-Motor-Speech-Disorders

A bachelors final year project combining various deep learning models to build a pipeline for diagnosing articulation and speech disorders

Requirements

Python
Conda

Steps

git clone this repository
git submodule init and git submodule update to initialize the submodules
Create conda environment voca & resolve dependencies from voca directory
Create conda environment deca & install dependencies from deca directory
Create conda environment autoeditor & run pip install auto-editor
Create conda environment pyqt & run pip install pyqt5
Activate pyqt environment and execute speech app

How it works

User provides input in the form of video
Frame rate of input video is changed to 24
Silence part is removed
Duration is adjusted
Extract Audio from Video
Convert Video to Frames
Convert Frames to 3D Meshes
Compare 3D Meshes with Standard

Generating standards

By default, this repository contains only one standard stream. If you wish to add more standard words, perform the following steps:

Make sure selenium is installed
Place your desired words in words.txt file to scrape from online dictionary.
Run audio_scraper.py
Place scraped mp3 files into standard audios folder
Activate environment voca
Run preprocess_audios.py
Run generate_vertices.py
You should see your standard's 3D Meshes generated by VOCA model

About

A bachelors final year project combining various deep learning models to build a pipeline for diagnosing articulation and speech disorders

Report repository

Releases

No releases published

Packages

No packages published

Languages