You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Since we should soon get the dataset up and running I'd like to talk about how we plan to provide it to users especially since students are going to be working on it.
building the dataset
Im talking with Dave Hagman about separating the data we will get from the MD from a dataset of community sample. That way we have the original dataset from the doctor which would be a medical dataset, and then we can distribute the app collection to other users and build a larger (but less accurate) dataset. I think the method we will provide to MD has to score high on the medical dataset but a community dataset could be used for training.
Thoughts?
providing the dataset
What do you guys think about making only one half of the dataset public? The non-public part of the datast could be used as testing sub-dataset. This way the user would have only access to the training/validation set but not the final dataset.
It's only an idea I wanted to pitch but we could work with the back end people to create an architecture so that students wiłl only be able to upload the result (or method) and would never be able to see the test dataset (I know some dataset have been set up this way by some uni).
It's definitely low priority but I thought it would be interesting to raise this point.
The text was updated successfully, but these errors were encountered:
I don't think you can (or that it is easy to get this data) atm. Sadly, the data wasn't collected as extensively as we wished in the end. @YoniSchirris do you still have access to it ?
I'm afraid our collected dataset was never finished in this sense. You can play around with the public datasets that were previously collected, however
Hi all,
Since we should soon get the dataset up and running I'd like to talk about how we plan to provide it to users especially since students are going to be working on it.
building the dataset
Im talking with Dave Hagman about separating the data we will get from the MD from a dataset of community sample. That way we have the original dataset from the doctor which would be a medical dataset, and then we can distribute the app collection to other users and build a larger (but less accurate) dataset. I think the method we will provide to MD has to score high on the medical dataset but a community dataset could be used for training.
Thoughts?
providing the dataset
What do you guys think about making only one half of the dataset public? The non-public part of the datast could be used as testing sub-dataset. This way the user would have only access to the training/validation set but not the final dataset.
It's only an idea I wanted to pitch but we could work with the back end people to create an architecture so that students wiłl only be able to upload the result (or method) and would never be able to see the test dataset (I know some dataset have been set up this way by some uni).
It's definitely low priority but I thought it would be interesting to raise this point.
The text was updated successfully, but these errors were encountered: