Support selecting a percent of total flux given a clean mask file #54

sjperkins · 2023-04-05T13:37:23Z

Given a clean mask file produced by Sofia 2, [registers] and groups (https://en.wikipedia.org/wiki/Image_registration) a wsclean component list by source ID's in a SoFiA2 Clean Mask file. Components are lexicographically sorted by (total source flux, component flux) and a percentage of components contributing to total flux are selected.

Tests added / passed
```
$ py.test --flake8 -v -s .
```
If the flake8 tests fail, the quickest way to correct
this is to run autopep8 and then flake8
to fix the remaining issues.
```
$ pip install -U autopep8 flake8
$ autopep8 -r -i .
$ flake8 .
```

sjperkins · 2023-04-24T15:07:27Z

crystalball/crystalball.py

@@ -66,6 +66,8 @@ def create_parser():
                   help="Fraction of system RAM that can be used. "
                        "Used when setting automatically the "
                        "chunk size. Default in 0.1.")
+    p.add_argument("--clean-mask-file", required=False, default="",
+                   help="Clean Mask File. If supplied")


I should update the help description

sjperkins · 2023-04-24T15:08:02Z

crystalball/wsclean.py

+        if m := re.match("^SoFiA (?P<version>\\d+\\.\\d+\\.\\d+)$", origin):
+            major, _, _ = map(int, m.group("version").split("."))
+            if major < 2:
+                raise ValueError(f"SoFiA major version is less than 2: {origin}")


Check that the Clean Mask file is produce by SoFiA 2.

sjperkins · 2023-04-24T15:08:57Z

crystalball/wsclean.py

+    try:
+        assert header["CTYPE1"].strip() == "RA---SIN"
+        assert header["CTYPE2"].strip() == "DEC--SIN"
+        assert header["CTYPE3"].strip() == "VRAD"


Assume radio velocity quantity, but potentially other types could be supported.

sjperkins · 2023-04-24T15:14:24Z

crystalball/wsclean.py

+
+        freq = SpectralQuantity(wsclean_comps["ReferenceFrequency"] * u.Hz,
+                                doppler_rest=wcs.wcs.restfrq * u.Hz,
+                                doppler_convention="radio")


It would be nice to find some better way to do this conversion as a Radio doppler convention os assumed here. The restfreq is on the wcs attribute for example. I intuit that the wcs.spectral atttribute (which is a wcs object for the Spectral coordinate axis) should be used, but a simple wcs.spectral.world_to_pixel didn't work for me..

sjperkins · 2023-04-24T15:15:44Z

crystalball/wsclean.py

+        integrated_fluxes = np.array([flux[sid == source_ids].sum() for sid in source_id_range])
+        broadcast_fluxes = integrated_fluxes[source_ids - 1]
+
+        comp_sort_idx = np.lexsort((flux, broadcast_fluxes))[::-1]


Broadcast the total flux for each source over the number of a components and lexically sort. This allows us to partially select components of a source.

sjperkins · 2023-04-24T15:16:17Z

tests/test_clean_mask.py

+        (2, 4, 5, 100),
+        (2, 3, 4, -100),
+        (2, 3, 2, -100),
+        (3, 0, 1, -25)]


Model 3 sources

3 components

2 components

1 component

sjperkins · 2023-04-24T15:17:20Z

tests/test_clean_mask.py

+
+
+def test_clean_mask(wsclean_model_and_clean_mask):
+    # 3 sources


Test that the appropriate components are chosen given the percentage flux. I should change this to check that the appropriate sources are selected.

sjperkins · 2023-04-24T15:19:41Z

crystalball/wsclean.py

+
+        source_ids = clean_mask[z, y, x]
+
+        if(np.any(source_ids == 0)):


It's possible that the wsclean components could register to the edge of a clean mask pixel, in which case coordinates could map to the wrong source_id.

I suspect this won't be a problem in practice, but I guess the only way to find out is to try this PR with real data.

sjperkins marked this pull request as draft April 7, 2023 09:05

sjperkins added 10 commits April 11, 2023 17:17

Remove travis testing

6cccddc

Remove pipenv dependency

b637ce0

Merge branch 'master' into github-actions

c2ab3b8

Newer regions fail on circles with zero radius

9423764

Merge branch 'master' into github-actions

55db9df

Include long_description

016911f

Update build and source distribution check actions

8adcee8

Initial commit

a331db5

WIP

4e4b3ae

Fixes and test case updates

4bb2803

sjperkins force-pushed the clean-mask-file branch from 4a8d79e to 4bb2803 Compare April 21, 2023 10:38

sjperkins added 5 commits April 21, 2023 12:58

Check for SoFiA headers

3224bb1

Allow partial selection of source components

9a62a9b

Merge branch 'master' into clean-mask-file

215e256

Merge branch 'master' into clean-mask-file

a8cdea6

Also register the frequency dimension

6533b4b

sjperkins marked this pull request as ready for review April 24, 2023 15:01

sjperkins assigned paoloserra Apr 24, 2023

sjperkins requested a review from paoloserra April 24, 2023 15:17

sjperkins self-assigned this Apr 24, 2023

sjperkins commented Apr 24, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support selecting a percent of total flux given a clean mask file #54

Support selecting a percent of total flux given a clean mask file #54

sjperkins commented Apr 5, 2023 •

edited

Loading

sjperkins Apr 24, 2023

sjperkins Apr 24, 2023

sjperkins Apr 24, 2023

sjperkins Apr 24, 2023

sjperkins Apr 24, 2023

sjperkins Apr 24, 2023

sjperkins Apr 24, 2023

sjperkins Apr 24, 2023



		def test_clean_mask(wsclean_model_and_clean_mask):
		# 3 sources


		source_ids = clean_mask[z, y, x]

		if(np.any(source_ids == 0)):

Support selecting a percent of total flux given a clean mask file #54

Are you sure you want to change the base?

Support selecting a percent of total flux given a clean mask file #54

Conversation

sjperkins commented Apr 5, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sjperkins commented Apr 5, 2023 •

edited

Loading