JP-3281: Change resampling defaults to NaN padding instead of INDEF #8488

drlaw1558 · 2024-05-15T16:58:36Z

Resolves JP-3281 by changing padding around science data in resampled arrays to NaN instead of INDEF.

Closes #7664

Checklist for PR authors (skip items if you don't have permissions or they are not applicable)

added entry in CHANGES.rst within the relevant release section
updated or added relevant tests
updated relevant documentation
added relevant milestone
added relevant label(s)
ran regression tests, post a link to the Jenkins job below.
How to run regression tests on a PR
All comments are resolved
Make sure the JIRA ticket is resolved properly

codecov · 2024-05-15T17:32:50Z

Codecov Report

Attention: Patch coverage is 0% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 57.97%. Comparing base (4179c09) to head (f0ad63b).
Report is 1 commits behind head on master.

❗ Current head f0ad63b differs from pull request most recent head 98b7f47

Please upload reports for the commit 98b7f47 to get more accurate results.

Files	Patch %	Lines
jwst/resample/gwcs_drizzle.py	0.00%	1 Missing ⚠️
jwst/resample/resample.py	0.00%	1 Missing ⚠️
jwst/resample/resample_step.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #8488   +/-   ##
=======================================
  Coverage   57.97%   57.97%           
=======================================
  Files         387      387           
  Lines       38830    38830           
=======================================
  Hits        22513    22513           
  Misses      16317    16317

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

drlaw1558 · 2024-05-15T17:49:34Z

Probably failing a bunch of reg tests as outputs are now NaN-padded insted of 0-padded.

CHANGES.rst

hbushouse · 2024-05-20T15:12:39Z

Regression tests started at https://plwishmaster.stsci.edu:8081/job/RT/job/JWST-Developers-Pull-Requests/1459

hbushouse · 2024-05-20T15:15:25Z

Unit tests contained in the jwst/resample/tests/test_resample_step.py module are causing all the CI failures, due to the change from zero to NaN in output arrays. These tests will need updating.

drlaw1558 · 2024-05-20T16:38:33Z

Unit tests contained in the jwst/resample/tests/test_resample_step.py module are causing all the CI failures, due to the change from zero to NaN in output arrays. These tests will need updating.

This seems a little tricky. The failing test test_custom_refwcs_resample_imaging is creating an array of random numbers, passing it through resample in two slightly different ways, and checking that the results are allclose. However, for whatever reason the values coming out of resample before this PR are all zeros everywhere in both resample calls. All zeroes matches all zeroes. After this change the results of both calls are two arrays of all NaNs. All NaNs apparently is not considered a match for all NaNs.

The easiest thing to do is delete the assert np.allclose(data1, data2) as it doesn't seem to be testing anything meaningful anyway. However, what was it meant to test? Calling resample on the random number array ordinarily gives meaningful results, but the test as set up is trimming to an entirely empty part of the array.

hbushouse · 2024-05-23T23:36:41Z

Unit tests contained in the jwst/resample/tests/test_resample_step.py module are causing all the CI failures, due to the change from zero to NaN in output arrays. These tests will need updating.

This seems a little tricky. The failing test test_custom_refwcs_resample_imaging is creating an array of random numbers, passing it through resample in two slightly different ways, and checking that the results are allclose. However, for whatever reason the values coming out of resample before this PR are all zeros everywhere in both resample calls. All zeroes matches all zeroes. After this change the results of both calls are two arrays of all NaNs. All NaNs apparently is not considered a match for all NaNs.

The easiest thing to do is delete the assert np.allclose(data1, data2) as it doesn't seem to be testing anything meaningful anyway. However, what was it meant to test? Calling resample on the random number array ordinarily gives meaningful results, but the test as set up is trimming to an entirely empty part of the array.

You're right - that test is just bizarre. The nircam_rate image is defined to have a size of 204x204, and then the test is resampling to an output size of more than 1000x1000? And setting the wcs reference at around 600? Guessing that the 204x204 size might've been a mistake and that they really meant 2048x2048, I tried running it with that image size and the result (using the current master branch) is still zero-filled in all the SCI, WHT, and CON arrays, which means it has no input pixels contributing to any of the output pixels anywhere in the entire image. That whole test is just screwy. I suggest adding an xfail to it, in order to allow it to fail for now, and we'll try to get someone to fix/rewrite it in the future.

hbushouse · 2024-05-24T11:45:49Z

Started a regtest run at https://plwishmaster.stsci.edu:8081/job/RT/job/JWST-Developers-Pull-Requests/1471

drlaw1558 · 2024-05-24T13:59:26Z

I suggest adding an xfail to it, in order to allow it to fail for now, and we'll try to get someone to fix/rewrite it in the future.

Done.

hbushouse · 2024-05-28T12:15:13Z

The latest regtest results seem pretty reasonable. Lots of i2d and s2d products that now have zero values changed to NaN, some of which then propagate into NaN values of x1d products (if they're spectra). In general it's 5-20% of the pixels that see such a change. There are a few cases, however, where as many as 80-90% of the pixels in an i2d or s2d have been changed from zero to NaN. This suggests we might have some fairly ratty test data in some tests, if that many pixels of the resample products have no contribution from their inputs. But mechanically it all looks reasonable.

drlaw1558 · 2024-05-28T12:20:05Z

The latest regtest results seem pretty reasonable. Lots of i2d and s2d products that now have zero values changed to NaN, some of which then propagate into NaN values of x1d products (if they're spectra). In general it's 5-20% of the pixels that see such a change. There are a few cases, however, where as many as 80-90% of the pixels in an i2d or s2d have been changed from zero to NaN. This suggests we might have some fairly ratty test data in some tests, if that many pixels of the resample products have no contribution from their inputs. But mechanically it all looks reasonable.

@hbushouse Can you point me at a couple of examples with 80-90% NaN-valued i2d and s2d data?

hbushouse · 2024-05-28T14:53:34Z

The latest regtest results seem pretty reasonable. Lots of i2d and s2d products that now have zero values changed to NaN, some of which then propagate into NaN values of x1d products (if they're spectra). In general it's 5-20% of the pixels that see such a change. There are a few cases, however, where as many as 80-90% of the pixels in an i2d or s2d have been changed from zero to NaN. This suggests we might have some fairly ratty test data in some tests, if that many pixels of the resample products have no contribution from their inputs. But mechanically it all looks reasonable.

@hbushouse Can you point me at a couple of examples with 80-90% NaN-valued i2d and s2d data?

The two that seem to stand out are the i2d files created by the test_miri_image_stages and test_nircam_image_stages regtests. But it may be a red herring. In addition to different values in the SCI extensions, they also show lots of differences in the various WHT, CON, ERR, etc. extensions, and not all just simple zero to NaN conversions. Many of the other extensions just show differences in numerical values. I'm not sure why. So the very large fraction of different pixels in the SCI extensions is probably a combination of multiple effects (not just the zero->NaN happening here).

drlaw1558 · 2024-05-28T15:41:54Z

@hbushouse Interesting; I took a look at the MIRI imaging case (PID 1024 Obs 1) and all of the fluxes in JFrog are about 10% fainter than in the same image in MAST. If I reprocess this data from uncal locally though (with the relevant branch for this ticket) then things largely match the MAST result, with the difference of 0s changing to NaNs in the expected places. So I agree that this PR seems ok for that data, though I'm puzzled why the regtest values are all off by 10%. Does it start from intermediate files that might be out of date or does it run from uncal?

hbushouse · 2024-05-28T15:49:57Z

@hbushouse Interesting; I took a look at the MIRI imaging case (PID 1024 Obs 1) and all of the fluxes in JFrog are about 10% fainter than in the same image in MAST. If I reprocess this data from uncal locally though (with the relevant branch for this ticket) then things largely match the MAST result, with the difference of 0s changing to NaNs in the expected places. So I agree that this PR seems ok for that data, though I'm puzzled why the regtest values are all off by 10%. Does it start from intermediate files that might be out of date or does it run from uncal?

I'm sure that's the issue. The test setup is using cached/archived rate files as input to the image2 pipeline, and then uses the cal outputs of that to feed the image3 pipeline, which is where the differences show up. So the input rate files are probably "stale" relative to updated calibrations or detector1 code.

The fact that you've independently verified that the zero->NaN conversion looks OK gives me the confidence to approve this as is.

Change resampling defaults to NaN padding instead of INDEF

468f108

drlaw1558 requested a review from a team as a code owner May 15, 2024 16:58

github-actions bot added resample documentation labels May 15, 2024

Add change log entry

8b6817a

stscijgbot-jp mentioned this pull request May 15, 2024

Is the fillval = 'INDEF' parameter in resample_spec working as intended? #7664

Closed

hbushouse added this to the Build 11.0 milestone May 20, 2024

hbushouse reviewed May 20, 2024

View reviewed changes

CHANGES.rst Outdated Show resolved Hide resolved

CHANGES.rst Outdated Show resolved Hide resolved

Update change log

5c7c37c

xfail bad pytest

27ab570

github-actions bot added the testing label May 24, 2024

Merge branch 'master' into jp3281-resamplenan

f0ad63b

stscijgbot-jp mentioned this pull request May 24, 2024

Fix problematic resample step unit test #8506

Closed

Merge branch 'master' into jp3281-resamplenan

98b7f47

hbushouse approved these changes May 28, 2024

View reviewed changes

hbushouse merged commit f866557 into spacetelescope:master May 28, 2024
24 checks passed

braingram mentioned this pull request May 28, 2024

use doubles for filter values used with fft in refpix #8512

Merged

8 tasks

stscijgbot-jp mentioned this pull request Jun 5, 2024

Discontinue use of drizpars reference file in favor of resample pars reference file #7080

Closed

melanieclarke mentioned this pull request Oct 3, 2024

JP-3347: Improve spectral outlier detection #8828

Merged

10 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JP-3281: Change resampling defaults to NaN padding instead of INDEF #8488

JP-3281: Change resampling defaults to NaN padding instead of INDEF #8488

drlaw1558 commented May 15, 2024 •

edited by hbushouse

Loading

codecov bot commented May 15, 2024 •

edited

Loading

drlaw1558 commented May 15, 2024

hbushouse commented May 20, 2024

hbushouse commented May 20, 2024

drlaw1558 commented May 20, 2024

hbushouse commented May 23, 2024

hbushouse commented May 24, 2024

drlaw1558 commented May 24, 2024

hbushouse commented May 28, 2024

drlaw1558 commented May 28, 2024

hbushouse commented May 28, 2024

drlaw1558 commented May 28, 2024

hbushouse commented May 28, 2024

JP-3281: Change resampling defaults to NaN padding instead of INDEF #8488

JP-3281: Change resampling defaults to NaN padding instead of INDEF #8488

Conversation

drlaw1558 commented May 15, 2024 • edited by hbushouse Loading

codecov bot commented May 15, 2024 • edited Loading

Codecov Report

drlaw1558 commented May 15, 2024

hbushouse commented May 20, 2024

hbushouse commented May 20, 2024

drlaw1558 commented May 20, 2024

hbushouse commented May 23, 2024

hbushouse commented May 24, 2024

drlaw1558 commented May 24, 2024

hbushouse commented May 28, 2024

drlaw1558 commented May 28, 2024

hbushouse commented May 28, 2024

drlaw1558 commented May 28, 2024

hbushouse commented May 28, 2024

drlaw1558 commented May 15, 2024 •

edited by hbushouse

Loading

codecov bot commented May 15, 2024 •

edited

Loading