Use active learning based on uncertainty to augment dataset #35

jcohenadad · 2023-05-13T21:26:34Z

This PR introduces the following algorithm:

Run predictions using models published in https://github.com/ivadomed/model_seg_mouse-sc_wm-gm_t1/releases/tag/v0.2
Compute STD across predictions
Average STD per slice (ignore zeros and nans)
If STD is above a threshold, remove prediction for that slice.
Save predictions under derivatives/labels_active_learning
Train new model with uncertainty-filtered predicted labels

Fixes #31

jcohenadad · 2023-05-13T21:53:44Z

Output prediction_std on sub-mouse1/anat/sub-mouse1_chunk-1_T1w with version 534f16a

prediction_std.csv

Based on this plot, slices 416 --> 440 have the highest STD. Let's look at prediction STD from these slices:

Yup! that looks about right:

jcohenadad · 2023-05-13T22:06:41Z

Few things to consider:

Would an STD threshold be generalizable across the whole dataset?
How reliable is STD computed on very few pixels?

jcohenadad · 2023-05-13T22:10:02Z

How reliable is STD computed on very few pixels?

In this example, STD is 0.000305 (relatively low):

But there is no pixel picked up by majority voting!

In this other example, STD is high (0.03803):

But the segmentation is pretty good!

So, I need to find another way to filter based on STD (or another metric...)

jcohenadad · 2023-05-13T22:16:27Z

Few ideas to address #35 (comment)

A "good looking" segmentation has high uncertainty only at its edges. So maybe I could ignore STD computed at the edge of the segmentation?
Instead of computing STD, compute the total overlap (a bit like a Dice, but across more than 2 labels).

jcohenadad added 6 commits May 12, 2023 16:02

Compute STD across model predictions

eaf324f

Added flag to optionally output uncertainty

a6bcbf6

Fixed typo

5aaf555

Fixed logic for testing uncertainty flag

3231f91

Compute average STD per slice

f58aa99

Output prediction_std as CSV file

534f16a

Merge branch 'main' into jca/uncertainty

121e53d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use active learning based on uncertainty to augment dataset #35

Use active learning based on uncertainty to augment dataset #35

jcohenadad commented May 13, 2023

jcohenadad commented May 13, 2023 •

edited

Loading

jcohenadad commented May 13, 2023 •

edited

Loading

jcohenadad commented May 13, 2023 •

edited

Loading

jcohenadad commented May 13, 2023 •

edited

Loading

Use active learning based on uncertainty to augment dataset #35

Are you sure you want to change the base?

Use active learning based on uncertainty to augment dataset #35

Conversation

jcohenadad commented May 13, 2023

jcohenadad commented May 13, 2023 • edited Loading

jcohenadad commented May 13, 2023 • edited Loading

jcohenadad commented May 13, 2023 • edited Loading

How reliable is STD computed on very few pixels?

jcohenadad commented May 13, 2023 • edited Loading

jcohenadad commented May 13, 2023 •

edited

Loading

jcohenadad commented May 13, 2023 •

edited

Loading

jcohenadad commented May 13, 2023 •

edited

Loading

jcohenadad commented May 13, 2023 •

edited

Loading