What should the output of this repository be? #1

gwaybio · 2018-01-23T20:25:19Z

In my vision, the purpose of the repository is to store the code we used for data downloading and processing, but we don't actually require a user to run it all.

We could provide a link to a versioned figshare url storing the processed datasets. Or maybe we could even build a python package to quickly fetch the data in some sort of matrix form already. We can build upon these ideas here.

Either way, I do not think its necessary for intermediate or final output files to live here.

jaclyn-taroni · 2018-01-24T13:13:44Z

I'm assuming that your concern is file size -- likely too big for git lfs?

We could provide a link to a versioned figshare url storing the processed datasets. Or maybe we could even build a python package to quickly fetch the data in some sort of matrix form already.

I'm down with both of these options. I think ideally we would do both rather than one or the other.

gwaybio mentioned this issue Jan 23, 2018

Download Initial Data #2

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What should the output of this repository be? #1

What should the output of this repository be? #1

gwaybio commented Jan 23, 2018

jaclyn-taroni commented Jan 24, 2018

What should the output of this repository be? #1

What should the output of this repository be? #1

Comments

gwaybio commented Jan 23, 2018

jaclyn-taroni commented Jan 24, 2018