You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For large datasets, improved collaboration, and link sharing, data science should be done ✨ In The Cloud ✨
Since most of our data science is done through the Python ecosystem, Jupyter Notebooks is the most obvious technology choice. R and RStudio comes in as a close second.
Ideally, we self-host this so we can take advantage of lower-latency dc locality and such, compared to open tools where data would have to be transferred over arbitrary distance and unknown network conditions.
My only concern with SageMaker is that it seems to be geared towards a machine learning workflow / mindset. I think that's a great strategic goal - TensorFlow is eating the world - but I'm not sure how well that fits the tactical situation. It's definitely accessible from R / RStudio. so it wouldn't lock R programmers out.
@danieldn@DingoEatingFuzz did we decide to use sagemaker or not for people needing cloud access?
The other type of cloud resources that we recommended was Google Colaboratory notebooks which are free.
For large datasets, improved collaboration, and link sharing, data science should be done ✨ In The Cloud ✨
Since most of our data science is done through the Python ecosystem, Jupyter Notebooks is the most obvious technology choice. R and RStudio comes in as a close second.
Ideally, we self-host this so we can take advantage of lower-latency dc locality and such, compared to open tools where data would have to be transferred over arbitrary distance and unknown network conditions.
Ideal solution
Other tools to look at
TODO
The text was updated successfully, but these errors were encountered: