Cube Query Deployment

THIS PROJECT IS NO LONGER REQUIRED OR MAINTAINED. DO NOT USE OR EXPECT IT TO WORK.

It is only left here as a historical artifact. Go and see the cubequery project to find out how things now work.

This is both an example project to show how to deploy the cubequery system and a real deployment.

This shows pulling in two library modules (sac_utils and data_cube_utilities) and adding some processors.

Building and Running

Clone this repo.
Clone the cubequery repo.
Edit the docker-compose.yml file to include the paths to your repos and set environment vars as you require.
Edit the build.bat file to include the paths to your repos.
Create the containers by running build.bat. If you are working on another platform the commands can be borrowed from that script.
Run docker-compose up to start the environment.

The server should be able to live reload when changes are made, however the worker is not able to do so. That will need manually restarting.

For a production environment check out our helm charts

Converting a notebook to a cubequery process

While a note book is great for developing a process. It is not so great when running the process in production. See processes/ndvi_anomaly.py for an example

Extract everything that can be a parameter to somewhere near the top of the notebook
Remove any and all debug and investigation plotting code. It should not be nessary to import matplotlib at any point in the note book
Create a new python file
Create a class that extends CubeQueryTask
The class needs a member called display_name with what this process should be called when shown to a user.
The class needs a member called description with a description for a user of what this process does.
Create a parameter list which contains details of each parameter.
1. Name
2. Display Name
3. Data type
4. Description
Call the parameter setup function. CubeQueryTask.cal_significant_kwargs(parameters) This is required to setup the parameters in the processing engine
Create a method called generate_product that takes self, a datacube instance, and the path prefix to be used for any outputs. Along with each of the defined parameters from the earlier list as arguments.
In this method implement the process
The method needs to return a list of output files that should be kept when the processing is done. These should include the path prefix passed in to the function. It is important to use the path prefix when generating output data so that each task run gets a unique output folder. Otherwise your process may end up overwriting some of the work of another processor if two are running at the same time.
Rebuild and re-deploy all the containers.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
processes		processes
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE.md		LICENSE.md
README.md		README.md
build.bat		build.bat
build.sh		build.sh
datacube.conf		datacube.conf
docker-compose.yml		docker-compose.yml
init.sql		init.sql
publish.bat		publish.bat
server.Dockerfile		server.Dockerfile
worker.Dockerfile		worker.Dockerfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Cube Query Deployment

Building and Running

Converting a notebook to a cubequery process

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

SatelliteApplicationsCatapult/cubequery-deployment

Folders and files

Latest commit

History

Repository files navigation

Cube Query Deployment

Building and Running

Converting a notebook to a cubequery process

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages