Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About num_workers > 1 #48

Open
PSacfc opened this issue Aug 9, 2024 · 1 comment
Open

About num_workers > 1 #48

PSacfc opened this issue Aug 9, 2024 · 1 comment

Comments

@PSacfc
Copy link

PSacfc commented Aug 9, 2024

Thanks for your excellent work.
I got a question about why performance will decrease when num_workers > 1. What if only one single dataset is used

@tung-nd
Copy link
Collaborator

tung-nd commented Aug 13, 2024

We implemented a highly customized IterableDataset for dealing with the large dataset size, and num_workers works differently compared to a standard pytorch dataset. We did not find data loading to be the bottleneck so we always set num_workers=1.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants