Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improve download experience #28

Open
Fadelis98 opened this issue May 27, 2024 · 3 comments
Open

improve download experience #28

Fadelis98 opened this issue May 27, 2024 · 3 comments

Comments

@Fadelis98
Copy link

The full .db files are too large to be downloaded, is it be possible to provide with subsection compressioned versions?

@KuzmaKhrabrov
Copy link
Contributor

KuzmaKhrabrov commented May 28, 2024

Hello! For these purposes there are train2k/test2k versions. Are they still too large?

@Fadelis98
Copy link
Author

Hello! For the purposes there are train2k/test2k versions. Are they still too large?

Sorry for didn't express the point clearly. Sure there are small subsets that are more accessible, but I didn't mean there are too much data in the dataset, I do want to use the full dataset, while downloading the dataset (e.g 7T for hamiltonian) need several weeks due to the limited international bandwidth, and the connection sometimes lost during the procedure. So if the file is splited into chunks, it would be more easy to use

@KuzmaKhrabrov
Copy link
Contributor

I see! Thank you for pointing this out, we will try to find a solution for this. By now, you may work with wavefunctions archives and reconstruct corresponding datasets.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants