Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalisation for streaming from tf dataset #9

Open
meiertgrootes opened this issue Feb 22, 2021 · 1 comment
Open

Normalisation for streaming from tf dataset #9

meiertgrootes opened this issue Feb 22, 2021 · 1 comment

Comments

@meiertgrootes
Copy link
Contributor

As we are using TF dataset objects to stream data, we need to reconsider how to implement normalisation of input to the VAE

@fnattino
Copy link
Contributor

fnattino commented Feb 22, 2021

Is the maximum intensity of a dataset sufficient (the minimum should be zero) for normalizing it? We could loop over all tiles to determine the overall max value and ultimately store this as a property in the tile catalog, to be used in later runs for all datasets. This would imply all datasets being normalized with the same bounds, regardless of whether a mask for data balancing is applied or not - do you think this might be an issue?

Not sure how different bands have been dealt with so far - should they be normalized independently or as a whole? Maybe due to heterogeneity between visible an NIR bands (and potentially radar data to be added as an additional band in future?), one should have normalise each band independently? If this is the case we could store in the catalog band-specific max values..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants