🚸 ICESat-2 ATL11 pre-processing to analysis ready format #10

weiji14 · 2022-11-18T19:01:30Z

Initial draft tutorial on preparing ICESat-2 ATL11 land ice height time-series data into an analysis ready format.

Preview at https://deploy-preview-10--precious-dragon-494161.netlify.app/chapters/03_prep_land_ice_height_time-series.html

Test on Pangeo Binder (note, may be broken...):

TODO:

Show how ICESat-2 data can be read using xarray-datatree and/or icepyx
- If doing cloud-access, may need to find a way to run CI build on AWS us-west2, see e.g. https://discourse.pangeo.io/t/statement-of-need-integrating-jupyterbook-and-jupyterhubs-via-ci/2705/14
Combine multiple ICESat-2 pair tracks together into a flat data structure
Store the pre-processed data in a cloud-optimized, analysis ready data format

Part of #3.

Initial draft tutorial on preparing ICESat-2 ATL11 land ice height time-series data into an analysis ready format. Currently just a minimal script with some Python library imports and printing a list of library versions.

github-actions · 2022-11-18T19:04:57Z

🚀 Deployed on https://deploy-preview-10--precious-dragon-494161.netlify.app

Using personal fork of datatree that patches an issue with file pointers. As of https://github.com/weiji14/datatree/commit/d18961b2d1efdf6cfbea3038d6d2f8761edceba4, it's still not perfect though as there is still some ValueError about malformed variables.

Add s3fs

Reading the ICESat-2 ATL11 HDF5 files directly from AWS S3 bucket into xarray! Based on lots of tutorials available at https://nasa-openscapes.github.io/earthdata-cloud-cookbook. Still need some bugfixes in xarray and/or xarray-datatree for this to work fully.

Not the most elegant solution, but manually reading the ATL11 pair track groups one by one in the for loop works. If the file pointer bug is fixed, a dictionary comprehension could be used to remove the for loop. Even better if the ValueError about malformed variables disappears, which would make this a one-liner!

Adopt geosmart Jupyter Book template and other content.

No longer using personal fork, use the official version instead!

github-actions · 2023-03-02T11:06:02Z

🚀 Deployed on https://deploy-preview-10--precious-dragon-494161.netlify.app

Include placeholder DBSCAN subglacial lake finder demo.

Just for the sake of a quick demo.

Bug was fixed in xarray=2022.12.0. Also linted the code a bit.

weiji14 · 2023-06-01T08:21:14Z

book/chapters/03_data_prep.py

+gran_ids = region.avail_granules(ids=True, cloud=True)
+print(len(gran_ids[0]))
+print(gran_ids[0][:10])


@JessicaS11, I'm seeing this warning when getting the granule IDs

/srv/conda/envs/notebook/lib/python3.10/site-packages/icepyx/core/granules.py:86: UserWarning: We are still working in implementing ID generation for this data product. warnings.warn("We are still working in implementing ID generation for this data product.", UserWarning)

Assuming that the PR at icesat2py/icepyx#426 fixes this somewhat? Will the full s3 urls (e.g. s3://nsidc-cumulus-prod-protected/ATLAS/ATL11/005/2019/09/30/ATL11_005411_0315_005_03.h5) be returned in the list, or just the HDF5 filename (e.g. ATL11_005411_0315_005_03.h5)? I'm hoping for the full s3 urls (because it's been a pain working out how the ATL11 files are organized in the S3 bucket) 🙏

I'm seeing this warning when getting the granule IDs

Are you using the latest dev version? I think I only noted it in slack, but

Assuming that the PR at icesat2py/icepyx#426 fixes this somewhat?

Is correct and should fix it entirely (but hasn't been pushed through to a new release yet; I thought I took the warning off but perhaps not?). It returns the full s3 urls (agree on the pain point!), and the rest of this workflow (including reading in with datatree) worked successfully for me. If you're working in CryoCloud, you may need to jump through some extra hoops to use the dev install (so I've made a habit of checking my version at every import).

Ah cool, let me try things out on the dev branch, and yes I was working on the CryoCloud 😃

Yep, using pip install git+https://github.com/icesat2py/icepyx.git@development to get icepyx=0.7.1.dev9+g66e2863 returns the full s3 urls! I can't tell you how happy I am not having to manually generate a list of 4000+ urls in a file like this ATL11_to_download.txt anymore 😃

Gonna play with those ATL11 files and xarray-datatree a bit more now, hoping that this map_over_subtree method can save me from writing some nasty for-loops.

🚧 ICESat-2 ATL11 pre-processing to analysis ready format

b9598dc

Initial draft tutorial on preparing ICESat-2 ATL11 land ice height time-series data into an analysis ready format. Currently just a minimal script with some Python library imports and printing a list of library versions.

weiji14 added documentation Improvements or additions to documentation preview labels Nov 18, 2022

geo-smart deleted a comment from github-actions bot Nov 18, 2022

weiji14 added 4 commits November 21, 2022 16:09

🔀 Merge branch 'main' into preprocess-atl11

327c585

Add s3fs

weiji14 mentioned this pull request Nov 29, 2022

Add jupytext CryoInTheCloud/hub-image#12

Merged

weiji14 added 2 commits March 2, 2023 23:59

🔀 Merge branch 'main' into preprocess-atl11

3b62509

Adopt geosmart Jupyter Book template and other content.

📌 Pin xarray-datatree to 0.0.11

3ee030f

No longer using personal fork, use the official version instead!

github-actions bot temporarily deployed to pull request March 2, 2023 11:06 Inactive

weiji14 added 2 commits March 3, 2023 01:20

🔀 Merge branch 'main' into preprocess-atl11

c43694b

Include placeholder DBSCAN subglacial lake finder demo.

🚩 Temporarily trigger Jupyter Book build to GitHub Pages

6d13551

Just for the sake of a quick demo.

github-actions bot temporarily deployed to pull request March 2, 2023 12:23 Inactive

weiji14 had a problem deploying to github-pages March 2, 2023 12:25 — with GitHub Actions Failure

weiji14 temporarily deployed to github-pages March 2, 2023 12:27 — with GitHub Actions Inactive

JessicaS11 added 2 commits May 15, 2023 14:26

simplify filenames and remove unneeded notebook

88de11b

add about icesat2 and basic data access resources

f5a177b

github-actions bot temporarily deployed to pull request May 15, 2023 18:56 Inactive

JessicaS11 had a problem deploying to github-pages May 15, 2023 18:57 — with GitHub Actions Failure

JessicaS11 added 2 commits May 15, 2023 15:19

add cloud data access via icepyx

13b086d

work on debugging data access (in progress)

b11a132

github-actions bot temporarily deployed to pull request May 15, 2023 19:57 Inactive

JessicaS11 had a problem deploying to github-pages May 15, 2023 19:59 — with GitHub Actions Failure

Merge branch 'main' into preprocess-atl11

96bfa5d

github-actions bot temporarily deployed to pull request May 15, 2023 20:05 Inactive

JessicaS11 had a problem deploying to github-pages May 15, 2023 20:08 — with GitHub Actions Failure

fix typos

895d317

github-actions bot temporarily deployed to pull request May 15, 2023 20:10 Inactive

JessicaS11 had a problem deploying to github-pages May 15, 2023 20:11 — with GitHub Actions Failure

fix more typos and links/formatting

9c07ad4

github-actions bot temporarily deployed to pull request May 15, 2023 20:22 Inactive

JessicaS11 had a problem deploying to github-pages May 15, 2023 20:24 — with GitHub Actions Failure

JessicaS11 temporarily deployed to github-pages May 15, 2023 21:18 — with GitHub Actions Inactive

confirm s3urls are obtained for atl11; open one in datatree

ee105f9

github-actions bot temporarily deployed to pull request May 25, 2023 17:59 Inactive

JessicaS11 temporarily deployed to github-pages May 25, 2023 17:59 — with GitHub Actions Inactive

🩹 Remove .seek(0) workaround

72da611

Bug was fixed in xarray=2022.12.0. Also linted the code a bit.

github-actions bot temporarily deployed to pull request June 1, 2023 08:14 Inactive

weiji14 deployed to github-pages June 1, 2023 08:16 — with GitHub Actions View deployment

weiji14 commented Jun 1, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚸 ICESat-2 ATL11 pre-processing to analysis ready format #10

🚸 ICESat-2 ATL11 pre-processing to analysis ready format #10

weiji14 commented Nov 18, 2022 •

edited

Loading

github-actions bot commented Nov 18, 2022 •

edited

Loading

github-actions bot commented Mar 2, 2023 •

edited

Loading

weiji14 Jun 1, 2023 •

edited

Loading

JessicaS11 Jun 1, 2023

weiji14 Jun 1, 2023

weiji14 Jun 3, 2023

🚸 ICESat-2 ATL11 pre-processing to analysis ready format #10

Are you sure you want to change the base?

🚸 ICESat-2 ATL11 pre-processing to analysis ready format #10

Conversation

weiji14 commented Nov 18, 2022 • edited Loading

github-actions bot commented Nov 18, 2022 • edited Loading

github-actions bot commented Mar 2, 2023 • edited Loading

weiji14 Jun 1, 2023 • edited Loading

Choose a reason for hiding this comment

JessicaS11 Jun 1, 2023

Choose a reason for hiding this comment

weiji14 Jun 1, 2023

Choose a reason for hiding this comment

weiji14 Jun 3, 2023

Choose a reason for hiding this comment

weiji14 commented Nov 18, 2022 •

edited

Loading

github-actions bot commented Nov 18, 2022 •

edited

Loading

github-actions bot commented Mar 2, 2023 •

edited

Loading

weiji14 Jun 1, 2023 •

edited

Loading