Allow `feature_shape` to be passed to `fx plan initialize` #983

psfoley · 2024-06-05T23:39:09Z

This PR:

Fixes the feature_shape argument that can be passed to fx plan initialize. By passing this argument (which allows for list types arguments, such as [1,28,28]), loading the data loader for the model owner / aggregator can be avoided. This is important because the aggregator may not have access to local data to generate the initial model.

…ation Signed-off-by: Patrick Foley <[email protected]>

Signed-off-by: Patrick Foley <[email protected]>

MasterSkepticista

While this method works for single-input models with static shapes, it may not work for multi-tensor input models, or single-input models with custom input dtypes.

A general approach could be for the aggregator to take InputSpec as an argument. InputSpec is could be a dictionary of list of TensorSpec[TensorShape, DType].

Example multi-input-output model:

# Can generalize if `ast` parses it as a dict instead
--input_shape {'input_0': [1, 240, 240, 4], 'output_1': [1, 240, 240, 1]}

# Assumes first input to take the given shape, if `ast` parses as list.
--input_shape [1, 240, 240, 1]

openfl/interface/plan.py

openfl/utilities/click_types.py

openfl/interface/plan.py

Signed-off-by: Patrick Foley <[email protected]>

psfoley · 2024-06-06T22:05:02Z

While this method works for single-input models with static shapes, it may not work for multi-tensor input models, or single-input models with custom input dtypes.

A general approach could be for the aggregator to take InputSpec as an argument. InputSpec is could be a dictionary of list of TensorSpec[TensorShape, DType].

Example multi-input-output model:
# Can generalize if `ast` parses it as a dict instead
--input_shape {'input_0': [1, 240, 240, 4], 'output_1': [1, 240, 240, 1]}

# Assumes first input to take the given shape, if `ast` parses as list.
--input_shape [1, 240, 240, 1]

This is a good point. I tested that the existing code (that makes use of ast.literal_eval) can handle dictionaries as well. The main caveat is that the dictionary needs to be wrapped in quotes to be passed via command line. I've tried to make it clear in the help string that this is a requirement, and provided your example (wrapped in double quotes) as a reference for users.

kta-intel · 2024-06-07T16:04:10Z

This looks good to me, overall. It successfully avoids the need of loading the data at the aggregator to initialize the model, which is a great addition.

One comment: this won't always guarantee that the model is initialized using the specified input values or that the dataloader will load in data with the specified shape at the collaborators. For example, the torch_cnn_mnist workspace hard codes the model parameters in src/taskrunner.py so it will ultimately ignore the values passed through fx plan initialize --input_shape and initialize regardless, and even if we use .get_feature_shape() during initialization, only the channel size matters since the HxW isn't required in for torch conv layers.

In my opinion, this is not a breaking issue and this PR is good to go. However, going forward we should consider ways to reduce any potential confusion caused by this from a user perspective, such as having the existing workspace templates erroring out if an invalid input_shape is provided or issuing a user warning if it is ignored.

teoparvanov

Looks great, thanks @psfoley !

teoparvanov · 2024-06-18T11:47:33Z

openfl/utilities/mocks.py

+"""Mock objects to eliminate extraneous dependencies"""
+
+
+class MockDataLoader:


I suppose you don't extend DataLoader, because only the get_feature_shape() method is required during plan initialization? In this case, would it make sense to call this class FeatureShapeLoader?

* Remove need for aggregator to have dataset for model weight initialization Signed-off-by: Patrick Foley <[email protected]> * Remove extra print arguments Signed-off-by: Patrick Foley <[email protected]> * Address review comments Signed-off-by: Patrick Foley <[email protected]> * Remove extraneous print statement Signed-off-by: Patrick Foley <[email protected]> --------- Signed-off-by: Patrick Foley <[email protected]>

psfoley added 2 commits June 5, 2024 23:32

Remove need for aggregator to have dataset for model weight initializ…

e00827a

…ation Signed-off-by: Patrick Foley <[email protected]>

Remove extra print arguments

e26f441

Signed-off-by: Patrick Foley <[email protected]>

psfoley requested review from MasterSkepticista and kta-intel June 5, 2024 23:39

MasterSkepticista reviewed Jun 6, 2024

View reviewed changes

openfl/interface/plan.py Outdated Show resolved Hide resolved

openfl/utilities/click_types.py Outdated Show resolved Hide resolved

openfl/utilities/click_types.py Show resolved Hide resolved

openfl/interface/plan.py Outdated Show resolved Hide resolved

psfoley added 2 commits June 6, 2024 21:54

Address review comments

2d7d5d1

Signed-off-by: Patrick Foley <[email protected]>

Remove extraneous print statement

ded58ca

Signed-off-by: Patrick Foley <[email protected]>

teoparvanov approved these changes Jun 18, 2024

View reviewed changes

psfoley merged commit eeafaf5 into securefederatedai:develop Jun 24, 2024
25 of 26 checks passed

kta-intel mentioned this pull request Oct 3, 2024

fx model save errors when taskrunner is initialized with --input_shape without a local dataset present #1080

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow `feature_shape` to be passed to `fx plan initialize` #983

Allow `feature_shape` to be passed to `fx plan initialize` #983

psfoley commented Jun 5, 2024

MasterSkepticista left a comment •

edited

Loading

psfoley commented Jun 6, 2024

kta-intel commented Jun 7, 2024 •

edited

Loading

teoparvanov left a comment

teoparvanov Jun 18, 2024

		"""Mock objects to eliminate extraneous dependencies"""


		class MockDataLoader:

Allow feature_shape to be passed to fx plan initialize #983

Allow feature_shape to be passed to fx plan initialize #983

Conversation

psfoley commented Jun 5, 2024

MasterSkepticista left a comment • edited Loading

Choose a reason for hiding this comment

psfoley commented Jun 6, 2024

kta-intel commented Jun 7, 2024 • edited Loading

teoparvanov left a comment

Choose a reason for hiding this comment

teoparvanov Jun 18, 2024

Choose a reason for hiding this comment

Allow `feature_shape` to be passed to `fx plan initialize` #983

Allow `feature_shape` to be passed to `fx plan initialize` #983

MasterSkepticista left a comment •

edited

Loading

kta-intel commented Jun 7, 2024 •

edited

Loading