From ed696a983e7a078f8aeaddb07642c3b5184d38c3 Mon Sep 17 00:00:00 2001 From: Jim Schwoebel <30424731+jim-schwoebel@users.noreply.github.com> Date: Wed, 11 Nov 2020 10:50:18 -0500 Subject: [PATCH] Update README.md Added Thorsten dataset, per https://github.com/jim-schwoebel/voice_datasets/issues/6 --- README.md | 1 + 1 file changed, 1 insertion(+) diff --git a/README.md b/README.md index c53e274..0a71ea8 100644 --- a/README.md +++ b/README.md @@ -42,6 +42,7 @@ There are two main types of audio datasets: speech datasets and audio event/musi * [Spoken Wikipeida Corpora](https://nats.gitlab.io/swc/) - 38 GB in size available in both audio and without audio format. * [Tatoeba](https://tatoeba.org/eng/downloads) - Tatoeba is a large database of sentences, translations, and spoken audio for use in language learning. This download contains spoken English recorded by their community. * [Ted-LIUM](https://www.openslr.org/51/) - The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website (noncommercial). +* [Thorston dataset](https://github.com/thorstenMueller/deep-learning-german-tts/) - German language dataset, 22,668 recorded phrases, 23 hours of audio, phrase length 52 characters on average. * [TIMIT dataset](https://catalog.ldc.upenn.edu/LDC93S1) - TIMIT contains broadband recordings of 630 speakers of eight major dialects of American English, each reading ten phonetically rich sentences. It includes time-aligned orthographic, phonetic and word transcriptions as well as a 16-bit, 16 kHz speech waveform file for each utterance (have to pay). * [VCTK dataset](https://datashare.is.ed.ac.uk/handle/10283/3443) - 110 English speakers with various accents; each speaker reads out about 400 sentences. Samples are mostly 2–6 s long, at 48 kHz 16 bits, for a total dataset size of ~10 GiB. * [VCTK-2Mix](https://github.com/JorisCos/VCTK-2Mix) - VCTK-2Mix is an open source dataset for source separation in noisy environments. It is derived from VCTK signals and WHAM noise. It is meant as a test set. It will also enable cross-dataset experiments.