Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

failure to unzip the dataset #2

Open
verbose-void opened this issue Jul 31, 2022 · 2 comments
Open

failure to unzip the dataset #2

verbose-void opened this issue Jul 31, 2022 · 2 comments

Comments

@verbose-void
Copy link

on ubuntu 20.04 unzipping the dataset (i selected all files and clicked download) results in a 6.4gb zip file that appears corrupted:

$ unzip -t data/OneDrive_1_7-31-2022.zip 
Archive:  data/OneDrive_1_7-31-2022.zip
warning [data/OneDrive_1_7-31-2022.zip]:  2068632925 extra bytes at beginning or within zipfile
  (attempting to process anyway)
error [data/OneDrive_1_7-31-2022.zip]:  start of central directory not found;
  zipfile corrupt.
  (please check that you have transferred or created the zipfile in the
  appropriate BINARY mode and that you have compiled UnZip properly)
@verbose-void
Copy link
Author

quick fix: download 7zip and use the 7zz x files.zip command

@Miffyli
Copy link
Collaborator

Miffyli commented Jul 31, 2022

Cheers for the info! Sadly I have no control over how onedrive zips the videos... I would have thought it works <.<. Maybe it works on Windows only.

Anywho, the full dataset will use the format of the OpenAI VPT videos where we will provide a Python script that downloads a desired number of videos for you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants