Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

dataset #7

Open
mshbaita-jo opened this issue Jan 29, 2024 · 2 comments
Open

dataset #7

mshbaita-jo opened this issue Jan 29, 2024 · 2 comments

Comments

@mshbaita-jo
Copy link

I need to understand the format of the dataset I should use ... and what is it contain exactly ?
I mean these files:
"dataset_name": "ylacombe/english_dialects",
"dataset_config_name": "welsh_female",
"override_speaker_embeddings": true,
"filter_on_speaker_id": 5223,

@ylacombe
Copy link
Owner

Hello,
dataset_name corresponds to the dataset name in the HuggingFace hub, here it is: https://huggingface.co/datasets/ylacombe/english_dialects
dataset_config_name is the name of the "subdataset" you're using, here the Welsh female configuration.

In this dataset, there is a column with speaker ids, so I used filter_on_speaker_id=5223 to filter on the speaker id 5223 and throw out the rest.

override_speaker_embeddings=True is because you want your speaker embeddings to start from scratch instead of continuing training on the previous speaker embeddings of the model

@mshbaita-jo
Copy link
Author

mshbaita-jo commented Jan 31, 2024

The dataset you are using contains the audios and text in one data frame. But for me I have a folder that contains the audios with mp3 format and another TSV file that contains the names of the audios, the texts, and the speaker_id. How can I handle this dataset? and integrate it with the json configuration file?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants