Confusion matrix in terms of map area proportions instead of sample size #381

hannah-rae · 2024-01-26T18:56:51Z

Currently, our intercomparison and other metrics are being computed based on the confusion matrix expressed in terms of sample size. However, the recommendation in Stehman & Foody 2019 is to compute metrics from the confusion matrix expressed in terms of map area proportions of each class (which is what we do for area estimation in src/area_utils.py see lines 563-567 in that file).

This PR adds a new function to compute the mapped area for each class and use these totals to compute the "area matrix". There are a few issues to be addressed still:

I count the number of pixels in each class by summing the binary values (e.g., if cropland = 1 and noncrop = 0, then the sum of the map is equal to the number of cropland pixels). However, the sum returned by the ee Reducer is a float, while I would expect an integer. [SOLVED] This is because the reduction is weighted by default, so that partial pixels resulting from the clip() operation, e.g., are partial. I changed this to use an unweighted reduction because this is how we do it in our area estimation pipeline too and integers seem more transparent to me. The difference is negligible between the two anyway.
Getting the number of pixels for the ensemble in a way that lets us be flexible about which map(s) we specify to use for the ensemble is tricky. I am still thinking about the best way to do this. [SOLVED]
The F1 scores for the Rwanda test case do not look right. I am still investigating this. [SOLVED] This was because the F1 score (and some of the standard errors) were computed using the sample metrics from the sklearn classification report, but others were computed using the population error matrix.
Update all of the notebooks under maps

…sample size

review-notebook-app · 2024-01-26T18:56:56Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

for more information, see https://pre-commit.ci

…into stderr-fix

ivanzvonkov · 2024-01-30T17:59:39Z

src/compare_covermaps.py

+            .get("crop")
+            .getInfo()
+        )
+


Here's an alternative to getting the crop/non-crop proportion without the need for two reduceRegion calls.
It also may be more "accurate" since the proportions are computed using area vs pixel count.

geom = aoi.geometry() area = geom.area() crop_proportion = binary_image \ .multiply(ee.Image.pixelArea()) # Multiplies each pixel by its area .divide(area) # Divides each pixel by the total area .reduceRegion( reducer=ee.Reducer.sum(), geometry=geom, scale=self.resolution, maxPixels=1e12, bestEffort=True # Should reduce scale if computation times out ).get("crop").getInfo() w_j = np.array([1-crop_proportion, crop_proportion]) return w_j

Can be multiplied by area to get a_j

Never mind I see you need the exact amount of pixels for the producer variance estimator 🤦

ivanzvonkov · 2024-01-31T17:49:16Z

src/compare_covermaps.py

+    if precision + recall == 0:
+        return 0
+    f1_score = 2 * (precision * recall) / (precision + recall)
+    return f1_score


Thanks for adding this!

for more information, see https://pre-commit.ci

…into stderr-fix

for more information, see https://pre-commit.ci

…into stderr-fix

ivanzvonkov

This looks excellent! Looks like worldcover got a bit of a boost from these area weights. Thank you for updating the template notebook and the descriptions in each notebook.

for more information, see https://pre-commit.ci

Compute confusion matrix in terms of map area proportions instead of …

06d71b2

…sample size

hannah-rae requested a review from ivanzvonkov January 26, 2024 18:57

hannah-rae self-assigned this Jan 26, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

f00ba40

for more information, see https://pre-commit.ci

hannah-rae added the enhancement New feature or request label Jan 26, 2024

hannah-rae and others added 9 commits January 26, 2024 13:12

Use unweighted reduction for integer sums

30dc476

resolve merge conflict

90f50d3

Add ensemble area computation

835346c

[pre-commit.ci] auto fixes from pre-commit.com hooks

413335c

for more information, see https://pre-commit.ci

add area proportion confusion matrix cells to table

8046d89

Make sure all metrics computed from population error matrix

a6731b8

[pre-commit.ci] auto fixes from pre-commit.com hooks

5789589

for more information, see https://pre-commit.ci

Fix order of area estimates and add use_strat flag

5550967

Merge branch 'stderr-fix' of https://github.com/nasaharvest/crop-mask …

3982b09

…into stderr-fix

ivanzvonkov reviewed Jan 30, 2024

View reviewed changes

ivanzvonkov reviewed Jan 31, 2024

View reviewed changes

hannah-rae and others added 12 commits February 5, 2024 11:42

Fix binary map calc for maps with list of crop labels

1f3c68c

[pre-commit.ci] auto fixes from pre-commit.com hooks

34f59c6

for more information, see https://pre-commit.ci

Fix digital earth africa discrepancy

72b6e02

Merge branch 'stderr-fix' of https://github.com/nasaharvest/crop-mask …

3a15d44

…into stderr-fix

Merge branch 'master' into stderr-fix

f2a845f

Merge branch 'stderr-fix' of https://github.com/nasaharvest/crop-mask …

407997e

…into stderr-fix

[pre-commit.ci] auto fixes from pre-commit.com hooks

43116e8

for more information, see https://pre-commit.ci

Add option to split into smaller tiles before computing map area

748e084

merge conflict

46dd510

[pre-commit.ci] auto fixes from pre-commit.com hooks

15d5789

for more information, see https://pre-commit.ci

Update map notebooks

f90af63

Merge branch 'stderr-fix' of https://github.com/nasaharvest/crop-mask …

e001e0e

…into stderr-fix

ivanzvonkov approved these changes Feb 13, 2024

View reviewed changes

hannah-rae and others added 9 commits February 20, 2024 20:55

Add option to export large computation

e0856b3

[pre-commit.ci] auto fixes from pre-commit.com hooks

1b1498e

for more information, see https://pre-commit.ci

fix lint error

3b6fdfe

resolve merge

586b444

[pre-commit.ci] auto fixes from pre-commit.com hooks

221655a

for more information, see https://pre-commit.ci

Use original intercomparison sets for countries with changed splits

6221ef1

merge conflict

a65bf5c

[pre-commit.ci] auto fixes from pre-commit.com hooks

52933cf

for more information, see https://pre-commit.ci

merge master branch into this branch

f6e1600

hannah-rae merged commit 47cb0e5 into master Feb 26, 2024
8 checks passed

hannah-rae deleted the stderr-fix branch February 26, 2024 17:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusion matrix in terms of map area proportions instead of sample size #381

Confusion matrix in terms of map area proportions instead of sample size #381

hannah-rae commented Jan 26, 2024 •

edited

Loading

review-notebook-app bot commented Jan 26, 2024

ivanzvonkov Jan 30, 2024 •

edited

Loading

ivanzvonkov Jan 30, 2024

ivanzvonkov Jan 31, 2024

ivanzvonkov Jan 31, 2024

ivanzvonkov left a comment

Confusion matrix in terms of map area proportions instead of sample size #381

Confusion matrix in terms of map area proportions instead of sample size #381

Conversation

hannah-rae commented Jan 26, 2024 • edited Loading

review-notebook-app bot commented Jan 26, 2024

ivanzvonkov Jan 30, 2024 • edited Loading

Choose a reason for hiding this comment

ivanzvonkov Jan 30, 2024

Choose a reason for hiding this comment

ivanzvonkov Jan 31, 2024

Choose a reason for hiding this comment

ivanzvonkov Jan 31, 2024

Choose a reason for hiding this comment

ivanzvonkov left a comment

Choose a reason for hiding this comment

hannah-rae commented Jan 26, 2024 •

edited

Loading

ivanzvonkov Jan 30, 2024 •

edited

Loading