FedPer Implementation #72

emersodb · 2023-11-22T15:07:19Z

PR Type

Feature

Short Description

Clickup Ticket: https://app.clickup.com/t/8686ckn31

This PR adds in the FedPer method into the repository. The addition is fairly straightforward within our infrastructure. It's essentially just a globally trained feature extractor with a locally trained classification head in each client. I added an example using the infrastructure while also training it with MOON. So the example client inherits from the MOON client to apply the auxiliary losses to the model along with local personalization, which I thought was nice.

In the course of adding the method, I noticed a few small bugs in MOON and the auxiliary losses (like PerFCL) in the FENDA approaches. These have been fixed and tests have been added to ensure that they are indeed fixed.

Tests Added

Added losses associated with MOON's contrastive loss calculations and the loss computations of MOON, FENDA, FedProx, and FedPer

…o modify the readme and test the run.

…mentation.

…calculated when using auxiliary losses in FENDA. Added tests for fedprox and fenda to ensure the losses are being formed correctly. Also making a small change to the fenda example to added perfcl loss in the example for testing

emersodb · 2023-11-22T15:08:23Z

examples/fedper_example/client.py

+from fl4health.utils.sampler import MinorityLabelBasedSampler
+
+
+class MnistFedPerClient(MoonClient):


Note: We inherit from a MOON client here intentionally to be able to use auxiliary losses associated with the global module's feature space in addition to the personalized architecture.

emersodb · 2023-11-22T15:11:03Z

examples/fenda_example/client.py

@@ -26,7 +26,7 @@ def __init__(
        device: torch.device,
        minority_numbers: Set[int],
    ) -> None:
-        super().__init__(data_path=data_path, metrics=metrics, device=device)
+        super().__init__(data_path=data_path, metrics=metrics, device=device, perfcl_loss_weights=(1.0, 1.0))


Adding in the perfcl loss to the FENDA example. This is just for testing when running the example to make sure nothing is broken there.

emersodb · 2023-11-22T15:13:03Z

fl4health/clients/fenda_client.py

@@ -207,7 +205,7 @@ def compute_loss(
        """

        loss = self.criterion(preds["prediction"], target)
-        total_loss = loss
+        total_loss = loss.clone()


Without clone, total_loss and loss share memory. This means that anything that is added to total_loss below is also added to loss. This means that checkpoint and backward in the loss object end up being identical, which we don't want. I added a unit test to make sure the clone here fixes the issue.

emersodb · 2023-11-22T15:14:30Z

fl4health/clients/moon_client.py

+            _, old_model_features = old_model(input)
+            old_features[i] = old_model_features["features"]
+        _, global_model_features = self.global_model(input)
+        features.update({"global_features": global_model_features["features"], "old_features": old_features})


With the refactor of the use of a features dictionary, this predict function ended up being broken. I believe these changes fix the issue. However, correct me if I'm wrong.

That is true; I have fixed it adding contrastive losses PR and it seems correct on the version of main that I have. However, it might be changed in further merges. Thanks for pointing out.

emersodb · 2023-11-22T15:15:11Z

fl4health/clients/moon_client.py

@@ -89,8 +89,6 @@ def get_contrastive_loss(
        return self.ce_criterion(logits, labels)

    def set_parameters(self, parameters: NDArrays, config: Config) -> None:
-        assert isinstance(self.model, MoonModel)


This is unnecessary, unless I'm mistaken. Removing it also allows for FedPer models to be used with MOON Clients, which is nice...

Yes it seems like mypy doesn't have problem with omitting it (it was originally for that). However if we want to build up FedPer over MOON, why don't we inherit it from the MOON model? This would make it easier for users to understand that FedPer models can be used with MOON Clients.

That's a fair question. The main reason I didn't do that was because FedPer model's exchange a partial subset of weights and don't, at least by default, admit projection modules for their features. They are very related. So it's possible that unifying them is a good idea. What do you think?

projection_module is also optional in moon_base, so you can easily pass by None for it in inheritance, and everything should work well. I kinda prefer fedper_base to inherit from both moon and partial_layer_exchange_model so users can get the relationship between all of them and the added functionality of fed_per based on both.

Yeah I think that makes sense. I'll do that right now.

I added the assert back in now that FedPer inherits from MOON

emersodb · 2023-11-22T15:17:09Z

fl4health/model_bases/apfl_base.py


-class ApflModule(nn.Module):
+
+class ApflModule(PartialLayerExchangeModel):


Having models that exchange a subset of their weights inherit from an abstract base class that forces the implementation of the layers_to_exchange function. This is just a formalism for now, but should be useful in future iterations of the parameter exchanger mechanisms.

emersodb · 2023-11-22T15:18:11Z

fl4health/model_bases/apfl_base.py

@@ -69,5 +71,7 @@ def update_alpha(self) -> None:
        self.alpha = alpha

    def layers_to_exchange(self) -> List[str]:
-        layers_to_exchange: List[str] = [layer for layer in self.state_dict().keys() if "global_model" in layer]
+        layers_to_exchange: List[str] = [
+            layer for layer in self.state_dict().keys() if layer.startswith("global_model.")


Changing this function to mirror FENDA and FedPer. In particular, we only want the layer name to start with global_model. rather than have it appear anywhere else in a name.

emersodb · 2023-11-22T15:19:17Z

fl4health/model_bases/fedper_base.py

+        super().__init__()
+        self.global_feature_extractor = global_feature_extractor
+        self.local_prediction_head = local_prediction_head
+        self.flatten_features = flatten_features


Flatten features is used to make the model compatible with MOON which requires the intermediate feature representations to be flattened for similarity calculations.

Why don't you flatten it as default? Is there any specific use case in FedPer for features compared to MOON?

The reason I don't flatten by default is that it changes the shape of the features tensor. If a user was going to do something downstream with the features (other than the MOON calculations), I think they would be surprised if they weren't in the expected shape.

emersodb · 2023-11-22T15:19:40Z

examples/fedper_example/client.py

+        model: nn.Module = FedPerModel(
+            global_feature_extractor=FedPerGloalFeatureExtractor(),
+            local_prediction_head=FedPerLocalPredictionHead(),
+            flatten_features=True,


Flatten features is used to make the model compatible with MOON which requires the intermediate feature representations to be flattened for similarity calculations.

Maybe you should write a comment regarding that? Here or in fedper_base.py.

Yes, definitely. Thanks for pointing that out.

emersodb · 2023-11-22T15:20:08Z

fl4health/model_bases/moon_base.py

-        # Return preds and features as seperate dictionairy as in fenda base
-        return {"prediction": preds}, {"features": features.view(len(features), -1)}
+        # Return preds and features as seperate dictionary as in fenda base
+        return {"prediction": preds}, {"features": features.reshape(len(features), -1)}


reshape is slightly more general than view

sanaAyrml · 2023-11-27T16:46:15Z

fl4health/clients/moon_client.py

+            _, old_model_features = old_model(input)
+            old_features[i] = old_model_features["features"]
+        _, global_model_features = self.global_model(input)
+        features.update({"global_features": global_model_features["features"], "old_features": old_features})


That is true; I have fixed it adding contrastive losses PR and it seems correct on the version of main that I have. However, it might be changed in further merges. Thanks for pointing out.

sanaAyrml · 2023-11-27T17:19:10Z

fl4health/clients/moon_client.py

@@ -89,8 +89,6 @@ def get_contrastive_loss(
        return self.ce_criterion(logits, labels)

    def set_parameters(self, parameters: NDArrays, config: Config) -> None:
-        assert isinstance(self.model, MoonModel)


Yes it seems like mypy doesn't have problem with omitting it (it was originally for that). However if we want to build up FedPer over MOON, why don't we inherit it from the MOON model? This would make it easier for users to understand that FedPer models can be used with MOON Clients.

sanaAyrml · 2023-11-27T23:37:54Z

fl4health/model_bases/fedper_base.py

+        super().__init__()
+        self.global_feature_extractor = global_feature_extractor
+        self.local_prediction_head = local_prediction_head
+        self.flatten_features = flatten_features


Why don't you flatten it as default? Is there any specific use case in FedPer for features compared to MOON?

sanaAyrml · 2023-11-27T23:56:24Z