Fixes #726: Implemented two accuracy functions #842

1160300918 · 2024-09-03T01:04:01Z

Pull Request

What problem does this PR solve?

Issue Number: Fixed #726

Implemented two accuracy functions for binary classification and multi-class classification.

github-actions · 2024-09-03T01:04:17Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

sml/metrics/classification/classification.py

1160300918 · 2024-09-03T03:05:41Z

Can anyone tell me how I can sign the CLA，please? I commented “I have read the CLA Document and I hereby sign the CLA”，but the CLA Assistant Lite bot won‘t pass.

anakinxc · 2024-09-03T03:09:02Z

Hi @1160300918

The commits and this pull request are created by different accounts, you have to sign the CLA with both accounts.

hesy7 · 2024-09-03T03:19:14Z

I have read the CLA Document and I hereby sign the CLA

1160300918 · 2024-09-03T03:21:42Z

I have read the CLA Document and I hereby sign the CLA

1160300918 · 2024-09-03T03:29:19Z

Hi @1160300918

The commits and this pull request are created by different accounts, you have to sign the CLA with both accounts.

Thanks. The bot passed, but the CLA Assistant in workflows still failed, should I commit again to activate it？

anakinxc · 2024-09-03T03:30:20Z

Hi @1160300918
The commits and this pull request are created by different accounts, you have to sign the CLA with both accounts.

Thanks. The bot passed, but the CLA Assistant in workflows still failed, should I commit again to activate it？

ignore that.

1160300918 · 2024-09-03T09:19:36Z

Hi @1160300918
The commits and this pull request are created by different accounts, you have to sign the CLA with both accounts.

Thanks. The bot passed, but the CLA Assistant in workflows still failed, should I commit again to activate it？

ignore that.

OK, is there anything else I should do besides waiting for the code review results?

anakinxc · 2024-09-04T00:43:22Z

Hi @1160300918
The commits and this pull request are created by different accounts, you have to sign the CLA with both accounts.

Thanks. The bot passed, but the CLA Assistant in workflows still failed, should I commit again to activate it？

ignore that.

OK, is there anything else I should do besides waiting for the code review results?

just be patient

deadlywing · 2024-09-05T02:41:13Z

sml/metrics/classification/classification.py

+def balanced_accuracy_score(y_true, y_pred, labels, sample_weight=None, adjusted=False):
+    """calculate balanced accuracy score"""
+    C = confusion_matrix(y_true, y_pred, labels, sample_weight=sample_weight)
+    with np.errstate(divide="ignore", invalid="ignore"):


The context management can not work in SPU, can just delete this

deadlywing · 2024-09-05T03:27:29Z

sml/metrics/classification/classification.py


 from sml.preprocessing.preprocessing import label_binarize
 from spu.ops.groupby import groupby, groupby_sum

 from .auc import binary_clf_curve, binary_roc_auc


+def confusion_matrix(y_true, y_pred, labels, sample_weight=None, normalize=None):


Not implement the logic about sample_weight and normalize

deadlywing · 2024-09-05T03:31:32Z

sml/metrics/classification/classification.py

+    cm = jnp.zeros((num_labels, num_labels), dtype=jnp.int32)
+
+    # Calculate the confusion matrix
+    for i, label in enumerate(labels):


Vectorized operations can replace these two for-loops, So the round complexity can be reduced.

e.g.
y_true == labels gives an n*c matrix, where n is the number of samples, and c is the number of labels.
Same as y_pred == labels, then the cm is just the inner product of all column-pairs of two matrics.

deadlywing · 2024-09-05T03:34:25Z

sml/metrics/classification/classification.py


 from sml.preprocessing.preprocessing import label_binarize
 from spu.ops.groupby import groupby, groupby_sum

 from .auc import binary_clf_curve, binary_roc_auc


+def confusion_matrix(y_true, y_pred, labels, sample_weight=None, normalize=None):


Please add some docs for the functionality of this function, and the means of all params.

deadlywing · 2024-09-05T03:34:31Z

sml/metrics/classification/classification.py

+    return cm
+
+
+def balanced_accuracy_score(y_true, y_pred, labels, sample_weight=None, adjusted=False):


Please add some docs for the functionality of this function, and the means of all params.

deadlywing · 2024-09-05T04:35:12Z

sml/metrics/classification/classification_test.py

+            return top_k_score
+
+        def check(spu_result, sk_result):
+            np.testing.assert_allclose(spu_result, sk_result, rtol=1, atol=1e-5)


rtol and atol can be set to 1e-3

deadlywing · 2024-09-05T04:35:25Z

sml/metrics/classification/classification_test.py

+            return balanced_score
+
+        def check(spu_result, sk_result):
+            np.testing.assert_allclose(spu_result, sk_result, rtol=1, atol=1e-5)


rtol and atol can be set to 1e-3

deadlywing · 2024-09-05T04:35:43Z

sml/metrics/classification/classification_test.py

+
+    def test_balanced_accuracy(self):
+        sim = spsim.Simulator.simple(
+            3, spu_pb2.ProtocolKind.ABY3, spu_pb2.FieldType.FM128


FM64 is enough

deadlywing · 2024-09-05T04:35:50Z

sml/metrics/classification/classification_test.py

+
+    def test_top_k_accuracy(self):
+        sim = spsim.Simulator.simple(
+            3, spu_pb2.ProtocolKind.ABY3, spu_pb2.FieldType.FM128


FM64 is enough

deadlywing · 2024-09-05T04:37:22Z

sml/metrics/classification/classification_test.py

+        y_true = jnp.array([0, 1, 1, 0, 1, 1])
+        y_pred = jnp.array([0, 0, 1, 0, 1, 1])
+        labels = jnp.array([0, 1])
+        spu_result = spsim.sim_jax(sim, proc)(y_true, y_pred, labels)


Test the param sample_weight and adjusted;
Test larger datasets, please (maybe ~1000 samples are enough).

github-actions · 2024-10-05T09:43:19Z

Stale pull request message. Please comment to remove stale tag. Otherwise this pr will be closed soon.

github-actions · 2024-11-28T09:44:22Z

Stale pull request message. Please comment to remove stale tag. Otherwise this pr will be closed soon.

Fixes secretflow#726: Implemented two accuracy functions

1e1029a

anakinxc requested a review from deadlywing September 3, 2024 01:05

rivertalk reviewed Sep 3, 2024

View reviewed changes

sml/metrics/classification/classification.py Outdated Show resolved Hide resolved

sml/metrics/classification/classification.py Outdated Show resolved Hide resolved

sml/metrics/classification/classification.py Outdated Show resolved Hide resolved

hesy7 added 2 commits September 3, 2024 10:20

Update code comments

a4668c2

Formatted with black

d1d8166

deadlywing reviewed Sep 5, 2024

View reviewed changes

github-actions bot added the no-pr-activity label Oct 5, 2024

github-actions bot closed this Oct 13, 2024

Candicepan reopened this Oct 29, 2024

github-actions bot removed the no-pr-activity label Oct 29, 2024

github-actions bot added the no-pr-activity label Nov 28, 2024

Candicepan removed the no-pr-activity label Nov 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes #726: Implemented two accuracy functions #842

Fixes #726: Implemented two accuracy functions #842

1160300918 commented Sep 3, 2024 •

edited

Loading

github-actions bot commented Sep 3, 2024 •

edited

Loading

1160300918 commented Sep 3, 2024 •

edited

Loading

anakinxc commented Sep 3, 2024

hesy7 commented Sep 3, 2024

1160300918 commented Sep 3, 2024

1160300918 commented Sep 3, 2024

anakinxc commented Sep 3, 2024

1160300918 commented Sep 3, 2024

anakinxc commented Sep 4, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

deadlywing Sep 5, 2024

github-actions bot commented Oct 5, 2024

github-actions bot commented Nov 28, 2024

		return cm


		def balanced_accuracy_score(y_true, y_pred, labels, sample_weight=None, adjusted=False):

Fixes #726: Implemented two accuracy functions #842

Are you sure you want to change the base?

Fixes #726: Implemented two accuracy functions #842

Conversation

1160300918 commented Sep 3, 2024 • edited Loading

Pull Request

What problem does this PR solve?

github-actions bot commented Sep 3, 2024 • edited Loading

1160300918 commented Sep 3, 2024 • edited Loading

anakinxc commented Sep 3, 2024

hesy7 commented Sep 3, 2024

1160300918 commented Sep 3, 2024

1160300918 commented Sep 3, 2024

anakinxc commented Sep 3, 2024

1160300918 commented Sep 3, 2024

anakinxc commented Sep 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Oct 5, 2024

github-actions bot commented Nov 28, 2024

1160300918 commented Sep 3, 2024 •

edited

Loading

github-actions bot commented Sep 3, 2024 •

edited

Loading

1160300918 commented Sep 3, 2024 •

edited

Loading