Preparing phoneme based dataset. #5

ahmedalbahnasawi · 2023-04-28T07:34:33Z

i'm dealing with Arabic text mapped to phoneme using my grapheme to phonemes model
eg: این مخزن شامل نمونه mapped to ' E N - M KH Z N - SH AE M L - N M W N HH '.
my phonemes list is the following: pho_ids = {'-':0, ' ZH':1, 'AE':2, 'SS':3, 'AE':4,'IY':5,.....,'eos': 55} where i have two letters representing one phoneme.

character_config=CharactersConfig(
  characters='ءابتثجحخدذرزسشصضطظعغفقلمنهويِپچژکگیآأؤإئ',
  punctuations='!(),-.:;? ̠،؛؟‌<>',
  phonemes='ˈˌːˑpbtdʈɖcɟkɡqɢʔɴŋɲɳnɱmʙrʀⱱɾɽɸβfvθðszʃʒʂʐçʝxɣχʁħʕhɦɬɮʋɹɻjɰlɭʎʟaegiouwy',
  pad="<PAD>",
  eos="<EOS>",
  bos="<BOS>",
  blank="<BLNK>",
  characters_class="TTS.tts.utils.text.characters.IPAPhonemes",
  )

I want to fix character_config to make it suits my experiment.
Many thanks

The text was updated successfully, but these errors were encountered:

karim23657 · 2023-05-02T10:08:14Z

I think , first you should phonemize all your dataset texts , then train model with a simple character config without phonemizer.
also edit L51 use_phonemes=False,
CharactersConfig without phonemes , and characters based on dataset characters

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preparing phoneme based dataset. #5

Preparing phoneme based dataset. #5

ahmedalbahnasawi commented Apr 28, 2023 •

edited

Loading

karim23657 commented May 2, 2023

Preparing phoneme based dataset. #5

Preparing phoneme based dataset. #5

Comments

ahmedalbahnasawi commented Apr 28, 2023 • edited Loading

karim23657 commented May 2, 2023

ahmedalbahnasawi commented Apr 28, 2023 •

edited

Loading