You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
from datasets import load_dataset
dataset = load_dataset("PixArt-alpha/SAM-LLaVA-Captions10M")
Expected behavior
The dataset should load immediately as it does when loaded through a normal indexed WebDataset loader. Generating splits should be optional and there should be a message showing how to disable it.
Describe the bug
Loading a simple webdataset takes ~45 minutes.
Steps to reproduce the bug
Expected behavior
The dataset should load immediately as it does when loaded through a normal indexed WebDataset loader. Generating splits should be optional and there should be a message showing how to disable it.
Environment info
datasets
version: 2.20.0huggingface_hub
version: 0.24.1fsspec
version: 2024.5.0The text was updated successfully, but these errors were encountered: