Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset not from datasets lib #4

Open
netagl opened this issue Apr 15, 2024 · 5 comments
Open

Dataset not from datasets lib #4

netagl opened this issue Apr 15, 2024 · 5 comments

Comments

@netagl
Copy link

netagl commented Apr 15, 2024

Hi,
Is it possible to run this pipline for dataset that is not in dataset library?
tnx!

@ylacombe
Copy link
Collaborator

Hey @netagl, not currently, what dataset and what dataset format do you have in mind ?
Note that it's quite easy to add a dataset to the library (and you can keep it private if you want of course)!

@netagl
Copy link
Author

netagl commented Apr 15, 2024

How can I add dataset to the library?
I do not have a specific dataset yet. right now Im working on getting relevant audios & transcription for my task, and would like to add them as a dataset to the dataset lib in order to run your pipeline (and eventually, run parler TTS) @ylacombe

@netagl
Copy link
Author

netagl commented Apr 16, 2024

How can I add dataset to the library?
I do not have a specific dataset yet. right now Im working on getting relevant audios & transcription for my task, and would like to add them as a dataset to the dataset lib in order to run your pipeline (and eventually, run parler TTS) @ylacombe

Hey @netagl, not currently, what dataset and what dataset format do you have in mind ? Note that it's quite easy to add a dataset to the library (and you can keep it private if you want of course)!

How can I add dataset to the library?
I do not have a specific dataset yet. right now Im working on getting relevant audios & transcription for my task, and would like to add them as a dataset to the dataset lib in order to run your pipeline (and eventually, run parler TTS) @ylacombe

@ylacombe
Copy link
Collaborator

You should be able to do it following instructions you can find on the datasets docs here, let me know if that helps

@ylacombe
Copy link
Collaborator

Additionally, I've added a FAQ with your question answered here:
https://github.com/huggingface/dataspeech?tab=readme-ov-file#how-do-i-use-datasets-that-i-have-with-this-repository

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants