Transfer Learning to Quickly Adapt a Text-to-Speech System for the Hearing Impaired

STATEMENT

This repository contains most of the codes I used for my master's thesis: Transfer Learning to Quickly Adapt a Text-to-Speech System for the Hearing Impaired

This repository is a collaborative work with my supervisor, Dr. Josef Schlittenlacher (https://www.schlittenlacher.com/)

BRANCH CONTENT TABLE

Inference
(1) Inference mult: Used for Inference. Users need to manually create a test list of audio and mel (.wav and .wav.pt) based on their own directory.
MOS
(1) Calculate MOS: Calculates the MOS based on the raw data from Prolific (https://www.prolific.co/).
(2) MOS evaluation: Conducts data wrangling, descriptives and ANOVA on the MOS.
STOI
(1) Combined HI: Does the same job as STOI HI, but with Experiment I, Experiment IIa and IIb (i.e. standard audiogram and patient audiogram vocoders).
(2) Compute STOI: Compute STOI from the test speech and original speech.
(3) STOI HI: Reads cleaned STOI data from the patient-audiogram condition. Performs basic data wrangling and visualization before ANOVA.
(4) STOI demo: Generates heatmaps and waveforms to demonstrate the rationale of STOI.
(5) STOI standard_audiograms: Does the same job as STOI HI, but with the standard-audiogram vocoders.
Training
(1) Train config: configurations for transfer learning (need a fully-trained WaveGLow (630k iterations) before adaptation).
(2) Train HI: Training the vocoders according to a certain config file. This is the main training file.

OTHERS

(1) Some other MATLAB/Python codes I used in this thesis, such as inverse amplification and other training-related files, were modified based on codes owned by Josef Schlittenlacher and were not open source, thus I didn't post them here. If required, go to https://arxiv.org/abs/2012.02174 or https://github.com/js2251.
(2) The codes were not well-structured because of the follow-up (Exp IIb) test we included during our experiment and because I made some other attempts or changes during processing the data. However, the files contain all the essential scripts. One may selectively refer to part of these codes.

ACKNOWLEDGEMENT

I would like to thank my supervisor, Dr. Josef Schlittenlacher, for all his help and suggestions during my master's program.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transfer Learning to Quickly Adapt a Text-to-Speech System for the Hearing Impaired

STATEMENT

BRANCH CONTENT TABLE

OTHERS

ACKNOWLEDGEMENT

About

Releases

Packages

T4phage76/TTS-for-the-hearing-impaired

Folders and files

Latest commit

History

Repository files navigation

Transfer Learning to Quickly Adapt a Text-to-Speech System for the Hearing Impaired

STATEMENT

BRANCH CONTENT TABLE

OTHERS

ACKNOWLEDGEMENT

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages