Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset structure #21

Open
MalcolmMielle opened this issue Apr 6, 2020 · 3 comments
Open

Dataset structure #21

MalcolmMielle opened this issue Apr 6, 2020 · 3 comments

Comments

@MalcolmMielle
Copy link
Contributor

Hi all,

Since we should soon get the dataset up and running I'd like to talk about how we plan to provide it to users especially since students are going to be working on it.

building the dataset

Im talking with Dave Hagman about separating the data we will get from the MD from a dataset of community sample. That way we have the original dataset from the doctor which would be a medical dataset, and then we can distribute the app collection to other users and build a larger (but less accurate) dataset. I think the method we will provide to MD has to score high on the medical dataset but a community dataset could be used for training.

Thoughts?

providing the dataset

What do you guys think about making only one half of the dataset public? The non-public part of the datast could be used as testing sub-dataset. This way the user would have only access to the training/validation set but not the final dataset.
It's only an idea I wanted to pitch but we could work with the back end people to create an architecture so that students wiłl only be able to upload the result (or method) and would never be able to see the test dataset (I know some dataset have been set up this way by some uni).

It's definitely low priority but I thought it would be interesting to raise this point.

@MohammedSoliman11
Copy link

ok , as a student how can i get this dataset to be able to run the full model and test it ?

@MalcolmMielle
Copy link
Contributor Author

I don't think you can (or that it is easy to get this data) atm. Sadly, the data wasn't collected as extensively as we wished in the end. @YoniSchirris do you still have access to it ?

@YoniSchirris
Copy link
Contributor

I'm afraid our collected dataset was never finished in this sense. You can play around with the public datasets that were previously collected, however

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants