Skip to content

Openutau Yaml Dictionaries for Arpasing, En VCCV, Xsampa phonemizers, En Enunu, and En EnunuX/EnunuX to support Japanese, Korean, Chinese, Cantonese, French, and Spanish (more languages to support soon)

Notifications You must be signed in to change notification settings

Cadlaxa/Openutau-Yaml-Dictionaries

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 

Repository files navigation

Openutau Yaml Dictionaries

Openutau Yaml Dictionaries for Arpasing, En VCCV, Xsampa phonemizers, En Enunu, and En EnunuX/EnunuX to support Japanese, Korean, Chinese, and French (more languages to support soon)

📍 When using these dictionaries with Arpasing banks I recommend using my Syllable-Based ARPAsing Phonemizer because these dictionaries contains not so common [C V], [V C], and specially [C C] aliases, also it has a fallback feature for missing aliases and phonemes such as [ax], [dx], and [zh]

English to Japanese, English to Korean, English to Chinese, English to Cantonese,English to French, and English to Spanish (languages auxiliary support for now) for Arpasing, En VCCV, Xsampa banks, and Enunu/Enunux English banks for Openutau .yaml dictionaries, tweaked based on Slidingwall's synth dictionary.

📍 As of now, the bigger the .yaml file, the slower Openutau changes the phonemes or any changes in general, as of Ou doesn't supports yet the dictionary picking like on Synthv, for now, you can choose and download the single language dictionary or the compiled one. You can customize the dictionary as you please.

Credits

credits for mmemim for the Synthv French .json Dictionary. Credits to Halo/BagelHero & Mim for the Ou En VCCV Custom Dictionary and credits to Lotte's Spanish cmudict for the en2sp base.


  • if there's any issue on the dictionary, and language request to support Openutau's dictionary, you can contact me through my Twitter. Let me know if there's a problem tehee.

📍 There may be wrong entries and mispronunciation in the dictionary, please report them if so. Making your english voicebanks sings in different languages will still require phoneme editing as this lexicon is by no means a plug-and-play solution especially Korean, Chinese, French and the Spanish.

⭐ The dictionaries are updated regularly so feel free to download the latest one here or in this mirror link. ⭐

Table of contents


How to download and install dictionaries for Openutau

  • To download any .yaml dictionary, click on the filename and then click on the download icon in the upper left corner of the file browser, as shown in the image below.

dl

  • If it doesn't show up like that, click Raw...

dl2

  • Then right click, and click save as...

dl3

  • Rename the extension to .yaml not .txt then save

dl4

  • Open your Openutau folder and go to Plug-ins folder. Drag and drop the yaml there and you are ready to go

yamls

  • Reload the english voicebank by choosing any phonemizers then back to the english one or changing voicebanks then going back again to the desired voicebank and you should now be able utilize the dictionary. If it doesn't load up, close and reopen OpenUtau

How to use the dictionary + the suffix support

Japanese Dictionary Usage

  • You can input Hiragana, Katakana and romaji c v in the notes to utilize the japanese dictionary. It also supports the usage of eg: っば, っいぇ, っしょ and glottal stops on vowels eg: あ2, い2, う2, え2, お2, ん2

Japanese Dictionary

Korean Dictionary Usage

  • You can input Hanguul and Romaja with . after the romaja word in the notes to utilize the Korean dictionary. Some have alternate pronunciations by inputing 1, 2, and 3 after the word. Note that korean consonant assimilation is not supported because of the complex character-to-phoneme system with the Hangul input, so using Romaja to place consonants such as ss., p., bb. and so on the note.

Korean Dictionary

  • consonant assimilation kr consonant assimilation

Chinese Dictionary Usage

  • Only Pinyin characters with / after the word to utilize the Chinese dictionary. Multiple pronunciations now supported. eg: ra/1 as [zh, ah], chun/ as [ch, w, ax, n], chun/1 as [ch, uw, n]

Chinese Dictionary

  • Multiple pronunciation variations

ch new

Cantonese Dictionary Usage

  • Only Jyutping characters with \ after the word to utilize the Cantonese dictionary. Multiple pronunciations aresupported. eg: yuk\ as [y, ih, k], yuk\1 as [y, k], byui\1 as [b, y, ih], byui\1 as [b, y, iy].

canto

French Dictionary Usage

  • You can input French, with * after the word in the notes to utilize the French dictionary. Based with mmemim's Synthv French .json Dictionary, If you want to say words like l'année or j'en you'll have to split them in two notes l' + année and j' + en as shown below.

French Dictionary

fr1

  • In French, you also have liaisons, and some words might not always be pronounced the same way. Because of that, you may sometimes need to put word(2), word(3) or even word(4). For example, absentes has 4 entries.

WordEntries

Spanish Dictionary Usage

  • You can input español with ~ after the spanish word in the notes to utilize the Spanish dictionary. Note that rolled r's or rr doesn't support by default on the dictionary itself, if your voibecank have a rolled r extras, you can add them to the dictionary by specifying the phoneme to the symbol list and replacing the dx, dx phoneme with the rolled r phoneme. If your vb doesn't have rolled r extras, you can find a way to do them, mine I just add as many r q r q or r/dx then glottals to the rolled r part of the word spa 2

  • Rolled r's with r's and glottal magic ✨✨ spa rr

  • also words without accent doesn't work eg: útiles~ and you typed utiles~, ou will output the english pronunciation eg: utiles~ to [y, uw, t, ah/ax, l, z] so it is important to add the symbol accents. If this feature is annoys you (also myself as well lmao) just let me know, I'll make a variant without the symbol accents


How to edit .yaml dictionary to fit a specific voicebank

  • Some voicebanks come with extra phonemes especially with En VCCV and Xsampa voicebanks, test out first the dictionary so you will know if its there are errors you will encounter like on this picture below: example

  • Editing the .yaml file via notepad or any other editing software you have to edit your desired phoneme per voicebank. I recommended to copy first one language then edit with the instructions inside the .yaml file

  • If it doesn't work, batch rename your vb's oto or wait for the dev team to make an phoneme fallback to fix the issue.

vccv yaml

About

Openutau Yaml Dictionaries for Arpasing, En VCCV, Xsampa phonemizers, En Enunu, and En EnunuX/EnunuX to support Japanese, Korean, Chinese, Cantonese, French, and Spanish (more languages to support soon)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published