A proposal for INFRAEOSC-02-2019 started by participants in OpenDreamKit.
The draft is built automatically on CircleCI. To see a rendered draft, click on a successful build and then the 'Artifacts' tab, where there will be a link to the resulting PDF:
Or follow the URL on this page to view the latest build.
git submodule init; git submodule update
make draft
produces draft.pdf
if all goes well.
Most writing should occur by creating a task in tasks/
based on tasks/template.tex
. Then the task should be added to a workpackage file in the task list.
Outline of workpackages and tasks:
~11 FTEs
FTEs: 1
FTEs: 2
- Collaborating on notebooks
- Real-time collaboration development
- moving state of notebook into server,
- Exploring novel interaction environments (UPSud)
- (and more?)
- Accessibility of Jupyter
- Link to EINFRA-12-2017
- Maintenance of core JupyterHub infrastructure
- XEUS C++ kernels and infrastructure
FTEs: 2
- Extension of binder service:
- entry portal where binder service can be selected
- mybinder.org
- EOSC hub
- Amazon S3 (against payment)
- local machine?
- more improvements
- infrastructure for archive of containers for reproducibility (EuXFEL use case)
- entry portal where binder service can be selected
- Exploring sustainability of the service
- For reproducibility
- Repo2docker development (more data sources, supported platforms)
- integration of nbval (or similar button) and nbdime for display into notebook (maybe a button “validate”)
- display state of cells (up-to-date, out-of-date, maybe up-to-date)? // depends on server-side state
- Widgets - important to be able to host GUI-like interfaces
- In general, improving interactive experience
- standard scientific visualisations in notebook (including vectors)
- controllable through widgets
- state saveable within the notebook
- also in 3d (-> k3d?)
- Jupyter based workflow for computational and data science
- middle way between interactive and automatic task execution (nbparametrise)
- use notebooks as scripts (papermill)
- coordinate execution of notebooks
- connect to running processes (possible when document state is in kernel?) to see life updates
- computational environment (links to binderhub improvements) (repo2docker)
- Like repo2docker, but not images (build conda environments for hpc, for instance)
- given some specification, build me the right environment
- preservation of this environment
- will it still work in 10 years time
- [Following notes can be re-used in proposal to explain motivation]
- Min: longevity of conda environments is short
- most python packages are packaged per python version
- pip does source install fallback
- conda(-forge) relies on the the right python version
- not building for Python 3.5 anymore.
- most python packages are packaged per python version
FTEs: 3
- XFEL Open Science (LIGO OSC for XFEL?)
- Personal data (Paul)
- Maths demonstrator
at least one or two more
FTEs: 2
- Prototype/operate European Binder
- Migrate service into EOSC
- Cooperation with EOSC
FTEs: 2
- Training workshops
- N / year
- Some dedicated to diversity (WISE, etc.)
- Workshop surveys
- Online training resources (competition issue?)
- Be more specific to our tooling (conda/python/jupyter/etc.)
- Generic resources like OpenAIRE, carpentry exist already
- Focus on gaps filled by our expertise; complementary, not competitive
- Presentations at conferences to meet researchers where they are (maybe not task in its own right)
- help desk (Nicolas)