-
Notifications
You must be signed in to change notification settings - Fork 300
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make caching warn if some of the args are unhashable #2585
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Works for me. I'm not sure why I did it this way originally. Maybe I had hoped that we could still cache based on all the other inputs...but that doesn't really make sense.
Feel free to merge once tests pass and we'll see what breaks.
Ah the tests have revealed it. We still want to be able to run the function even if it can't be cached. Maybe a warning? But the warning would have to go after the check to see if caching is requested (enabled) and after we know that there is an uncacheable type (ex. SwathDefinition). |
Codecov Report
@@ Coverage Diff @@
## main #2585 +/- ##
==========================================
+ Coverage 94.91% 95.18% +0.27%
==========================================
Files 351 354 +3
Lines 51215 51270 +55
==========================================
+ Hits 48611 48802 +191
+ Misses 2604 2468 -136
Flags with carried forward coverage won't be shown. Click here to find out more.
|
satpy/modifiers/angles.py
Outdated
@@ -250,6 +248,7 @@ def _hash_args(*args, unhashable_types=DEFAULT_UNCACHE_TYPES): | |||
hashable_args = [] | |||
for arg in args: | |||
if isinstance(arg, unhashable_types): | |||
warnings.warn(f"Unhashable type in function signature ({type(arg)}), cannot be cached.", stacklevel=2) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm unsure about this. For one the stacklevel should be at least 3 in my opinion as 2 only gets us to __call__
of this helper class.
As for warning versus something else, I think it is confusing. It is perfectly valid to call these functions with an uncacheable/unhashable type like a SwathDefinition or a StackedAreaDefinition. There shouldn't be a warning for these cases. In my previous comment I pointed out that this warning needs to go after the should_cache
check which checks if the user has caching enabled. Otherwise users could be getting this when they know they aren't going to cache things (uncacheable types) and they haven't even requested that it be attempted (the config key is disabled/off).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point! Let me fix that.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is now fixed. Thanks for pointing this out.
satpy/modifiers/angles.py
Outdated
self._cache_results(res, zarr_format) | ||
self._warn_if_irregular_input_chunks(args, args_to_use) | ||
self._cache_results(res, zarr_format) | ||
|
||
# if we did any caching, let's load from the zarr files |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@djhoese while refactoring this function a bit, I got confused by this: Why do we load from the zarr files just after we cached it to disk?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- So that we're caching immediately and using those cached values.
- So that future calls to this function that result in the same cached results will use the same cached values and when
.compute()
is finally called there is no redundant computation. By that I mean if we generate these angles once but then continue to use the dask array and then a different product generates these angles and detects the cached value and uses the cached arrays, then when we finally dodask.array.compute(composite1, composite2)
we would get one set of computations based on the original dask array and one set of computations based on the cached array even though they should be the same numbers.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah, ok, good. I might improve the comment to explain point 2, for future us.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this all looks good. I walked through the logic and it seems consistent with what was there before. I guess my only request is wondering if it can be split into helper functions/methods to make it any more clear? If that seems too hard I'm fine with this being merged.
Indeed I unfolded the logic a bit to make it clearer but at the expense of the size of the function. I'll try refactoring now that you also shed light on the details I was missing. |
I'm ready to merge this, @djhoese any more comments? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One small naming question that shouldn't hold up merging, but otherwise I remember agreeing with the logic before the refactor and this still looks understandable so 👍
I think this has been good for a while, we just forgot to merge it. Merging now... |
Oh great, CI on |
Wtf? |
|
Might be a hiccup or race condition. |
Or could the caching behaviour have changed and change the call count |
Restarting just to see... |
Restarting and it seems to pass |
This PR make the caching decorator fail if one of the arguments in unhashable where it was silently ignoring it previously and not caching.