Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not able to download the dataset #5

Open
sparsh999gupta opened this issue Sep 27, 2022 · 1 comment
Open

Not able to download the dataset #5

sparsh999gupta opened this issue Sep 27, 2022 · 1 comment

Comments

@sparsh999gupta
Copy link

sparsh999gupta commented Sep 27, 2022

Hi @Edresson ,

Thanks for open-sourcing this data.

Coraa dataset: [Link]
But unfortunately, I am not able to download the dataset (train.zip) from the given [google drive link].

The gdown command also fails after downloading around 55-60% of 59 GB of data.
This did not work for train:

gdown --id 1deCciFD35EA_OEUl0MrEDa7u5O2KgVJM -O train.zip
unzip train.zip

I was able to download dev set successfully

If I download directly from the google drive link, it also fails somewhere around the aforementioned percentage.
I believe this is due to some quota limit set by google drive.

Please let me know if there is some other way to obtain this data.

@arnaldocan
Copy link

arnaldocan commented Dec 16, 2022

Hello @sparsh999gupta

Sorry for the delay in the response. We were finally able to gather a backup server while we analyse the Google Drive issues. If you still need to download the dataset, please follow new the link in this page:

https://github.com/nilc-nlp/CORAA

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants