audio-datasets

Here are 18 public repositories matching this topic...

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

data voice voice-commands dataset voice-recognition noise voice-chat datasets voice-control voice-conversion voice-assistant voice-activity-detection voice-synthesis audio-datasets voice-computing voice-dataset voice-datasets audio-dataset

Updated Jun 6, 2024

SuperKogito / SER-datasets

Sponsor

Star

A collection of datasets for the purpose of emotion recognition/detection in speech.

audio speech datasets emotions emotions-recognition speech-emotion-recognition audio-datasets multimodal-emotion-recognition

Updated Sep 30, 2024
HTML

DagsHub / audio-datasets

Star

open-source audio datasets

audio open-source hacktoberfest audio-datasets hacktoberfest2022 codepeak hacktoberfest-2022 hacktoberfest-2023 hacktoberfest22 hacktoberfest-22 codepeak2022

Updated Sep 7, 2023

ynop / audiomate

Star

Python library for handling audio datasets.

audio music speech speech-recognition dataset-filtering noise dataset-creation dataset-manager corpus-tools data-loader audio-datasets

Updated Jul 6, 2023
Python

sovaai / sova-dataset

Star

audio open-source data opensource opendata corpus open-data dataset audio-data datasets russian-datasets audio-datasets chinese-dataset voice-dataset voice-datasets audio-dataset voice-data sova-dataset english-datasets

Updated Nov 8, 2022

Audio-WestlakeU / audiossl

Star

A library built for easier audio self-supervised training, downstream tasks evaluation

pytorch audio-classification audioset nsynth speech-commands audio-datasets self-supervised-learning voxceleb1 urbansound8k pytorch-lightning audio-representation audio-self-supervised-learning audio-pretraining

Updated Aug 27, 2024
Python

Audio-WestlakeU / RealMAN

Star

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NIPS 2024]

multi-channel speech-enhancement microphone-array-processing doa-estimation audio-datasets sound-source-localization microphone-audio-capture real-world-datasets

Updated Dec 11, 2024
Python

silenterus / deepspeech-cleaner

Star

Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework

machine-learning mozilla speech-recognition dataset-filtering dataset-creation dataset-manager multilanguage corpus-tools deepspeech audio-datasets

Updated May 22, 2023
Python

MorenoLaQuatra / audioset-download

Star

This package aims at simplifying the download of the AudioSet dataset.

downloader audioset audio-datasets audioset-download

Updated Sep 28, 2023
Python

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

python data-science data machine-learning deep-learning data-collection dataset-generation text-datasets audio-datasets scarper image-data-generator

Updated Nov 19, 2023
Python

freds0 / katube

Star

KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a list of YouTube playlists or YouTube channels, KATube will generate dataset with audios and texts.

audio-datasets

Updated Jul 27, 2024
Python

Rumeysakeskin / Speech-Datasets-for-ASR

Sponsor

Star

Download speech datasets (English and non-English) for Automatic Speech Recognition

speech-synthesis speech-recognition speech-to-text speech-processing asr speech-dataset audio-datasets voice-datasets common-voice-dataset voxforge-dataset

Updated Jan 22, 2023
Jupyter Notebook

hugolpz / LanguagesGallery

Star

[v.1.0] Lingualibre Languages Gallery in VueJS.

lingualibre audio-datasets languages-spoken

Updated Aug 5, 2024
CSS

Metiu-Metiu / Neural-Texture-Sound-synthesis---data-sets

Star

Synthetic sounds datasets and real sounds datasets of waterflow sounds for the repo 'Neural-Texture-Sound-Synthesis-with-physically-driven-continuous-controls'.

data-augmentation audio-segmentation synthetic-dataset-generation audio-datasets synthetic-dataset real-dataset audio-dataset-for-machine-learning