[Bug] Make figure models subclassable #606

antonymilne · 2024-07-30T16:34:17Z

Description

Solving -> https://github.com/McK-Internal/vizro-internal/issues/1166

Screenshot

Notice

I acknowledge and agree that, by checking this box and clicking "Submit Pull Request":
- I submit this contribution under the Apache 2.0 license and represent that I am entitled to do so on behalf of myself, my employer, or relevant third parties, as applicable.
- I certify that (a) this contribution is my original creation and / or (b) to the extent it is not my original creation, I am authorized to submit this contribution on behalf of the original creator(s) or their licensees.
- I certify that the use of this contribution as authorized by the Apache 2.0 license does not violate the intellectual property rights of anyone else.
- I have not referenced individuals, products or companies in any commits, directly or indirectly.
- I have not added data or restricted code in any commits, directly or indirectly.

…e-figures-subclassable

for more information, see https://pre-commit.ci

…kinsey/vizro into fix/make-figures-subclassable

…e-figures-subclassable

huong-li-nguyen

LGTM 👍

vizro-core/src/vizro/models/_components/ag_grid.py

vizro-core/src/vizro/models/_components/graph.py

vizro-core/src/vizro/models/_components/table.py

maxschulz-COL

LGTM, couple of suggestions

vizro-core/changelog.d/20240801_113636_petar_pejovic_make_figures_subclassable.md

maxschulz-COL · 2024-08-01T13:19:02Z

vizro-core/src/vizro/models/_action/_action.py

@@ -32,7 +31,7 @@ class Action(VizroBaseModel):

    """

-    function: CapturedCallable = Field(..., import_path=vizro.actions, mode="action", description="Action function.")
+    function: CapturedCallable = Field(..., import_path="vizro.actions", mode="action", description="Action function.")


Maybe just one question to double check - could this now allow people to circumvent security? I remember that part of this was also to forbid people injecting arbitrary code - now that this is a string, is this maybe somehow easier now?

FYI: @antonymilne, @huong-li-nguyen

Thank you @maxschulz-COL for the great question that prompted me to do more research on this topic!

The main purpose of the import_path is not to forbid you to use arbitrary code, but to enable you to use predefined figure functions (e.g. scatter, dash_ag_grid, kpi_card) when you're developing a dashboard using the yaml configuration. So, the import_path represents a path from where the _target_ (string) figure function (scatter, dash_ag_grid, kpi_card) will be imported.

Okay, but is it possible to inject a custom figure function (e.g. my_advanced_scatter, my_custom_dash_ag_grid) into your app? Yes, it's possible to do it with the Python Vizro configuration.
So what about the yaml config? -> That was not possible before this PR, but now it is.

Changing the import_path to the relative path string (changes made in this PR) only enables you to in a really hacky way inject a custom figure function into your yaml dashboard. Here are the steps to do it:

(rename vizro lib) mv <path_to_site_packages>/vizro <path_to_site_packages>/vizro_original

create a vizro folder near your app.py and add a table.py module file inside:

example/ ├── vizro/ │ └── table.py └── app.py

define the custom table figure function inside the table.py: -> @capture("table")(def my_custom_table(data_frame)...)

set the following yaml configuration

- figure: _target_: my_custom_table data_frame: df type: table

That was pretty hacky (and probably possible even in the current vizro version) 😄
What is more important to acknowledge is that with "making figure models to be inheritable" we actually enable our users to in a pretty much native way inject custom figure function into the yaml Vizro config 🎉 . Here are the steps to do it:

# app.py class MyTable(vm.Table): type: Literal["my_table"] = "my_table" figure: CapturedCallable = Field( ..., import_path="my_table_file", mode="table", description="Function that returns a `Dash DataTable`." ) vm.Page.add_type("components", MyTable)

# my_table_file.py from vizro.models.types import capture from vizro.tables import dash_data_table @capture("table") def my_custom_table_figure(data_frame): return dash_data_table(data_frame)()

# dashboard.yaml - figure: _target_: my_custom_table_figure data_frame: df_kpi type: my_table

Okay the question is: Should we announce it?
There are many examples in our docs where it stands: "# Custom X are currently only possible via python configuration".

Amazing research @petar-qb ! I think this is a great addition then! And I think we should consider changing it in the docs/adding it to the docs. What do you think @huong-li-nguyen @antonymilne ?

@antonymilne I am still not 100% sure about the fact that this is not security related. I understand now that we allow our custom figures to be used like that in a yaml config, and now even user created custom figures could be used in a yaml config (by subclassing the figure models), but there is also a conversation I remember we had with Ismail that was dedicatedly around this topic. Can you recall?

Great question @maxschulz-COL and great information indeed @petar-qb.

@maxschulz-COL I do remember the conversation with Ismail, and you're absolutely right to question the security of this 👍 I spent a while thinking about this while writing the solution and decided that nothing has changed here at all.

previously we imported e.g. vizro.tables and then users could specify a target function in that namespace; now exactly the same thing happens, it's just that the place that import happens has changed (it's in CapturedCallable rather than the model that uses CapturedCallable). Either way we are doing exactly the same getattr(imported_module, target_function) so security here is the same

so long as you have access to Python, it's always possible to "circumvent" security of this to inject an arbitrary function from yaml. e.g. before this change it would already be possible to make your own model (just not by subclassing) and specify an arbitrary import path in there. Or actually you could even just do vizro.tables.my_custom_function = custom_function without needing a custom model at all (we never advertised this because it feels hacky). So nothing has changed here also. The significance of the security is that someone who has access only to yaml should not be able to inject an arbitrary Python function, and that is still the case now as it was before. (if you have access to Python then you can easily do damage through millions of ways anyway, so doing it through custom graphs is pretty much irrelevant)

So the only difference this change really makes is that now that subclassing is possible, there's a much more obvious way to inject a custom function than there was before. From a security perspective this is fine, because it still requires access to Python. The interesting thing here is that, as @petar-qb pointed out (but I had completely forgotten about so thank you very much for saying it!!), from a usability perspective there's now a much cleaner route for a user to inject a custom function than before 💯

So should we announce this new route? Not sure. Let me just look back on my notes to see if there's any other routes we should prefer instead. e.g. imagine the capture decorator could register the function as usable from YAML configuration like this:

@capture("graph") # or maybe you need to explicitly say register=True to register it as available from YAML def f(): ... # in capture function somewhere setattr(vizro.tables, captured_function) # or maybe we make a new namespace for it setattr(my_custom_stuff, captured_function) # vizro.tables.captured_function exists and hence can be used from # YAML without any subclassing or any new models needed

The advantage of the subclassing approach is that it's already possible (and will always remain possible, even if we enable other routes in future) and requires no effort from us.

So what I think we should do is a make a ticket for this to consider possible solutions and whether we want to advertise this one or if we think actually there are better approaches that we would prefer. While it would be really nice to have complete parity between YAML and Python, I don't think it's super urgent tbh. But if we like we can already advertise this new subclassing approach as one way of doing it (since it will always remain possible) and then in the future we might develop an even better solution. The YAML code should stay the same whatever the solution so it's just a question of writing the subclassing part up in docs.

@antonymilne really thanks for your opinion. 🥇

Here is the ticket that includes the considering of the final solution and the announcement.

Yes, read through it as well in detail, great summary indeed! 💪

vizro-core/examples/scratch_dev/app.py

…e-figures-subclassable

Co-authored-by: Li Nguyen <[email protected]>

…res_subclassable.md Co-authored-by: Maximilian Schulz <[email protected]>

…kinsey/vizro into fix/make-figures-subclassable

antonymilne

Thank you very much for finishing this off! I can't approve my own PR but take this as an approval 😁 Subject to one comment I had about tests.

vizro-core/src/vizro/models/types.py

antonymilne · 2024-08-02T10:04:53Z

vizro-core/src/vizro/models/_action/_action.py

@@ -32,7 +31,7 @@ class Action(VizroBaseModel):

    """

-    function: CapturedCallable = Field(..., import_path=vizro.actions, mode="action", description="Action function.")
+    function: CapturedCallable = Field(..., import_path="vizro.actions", mode="action", description="Action function.")


Great question @maxschulz-COL and great information indeed @petar-qb.

@maxschulz-COL I do remember the conversation with Ismail, and you're absolutely right to question the security of this 👍 I spent a while thinking about this while writing the solution and decided that nothing has changed here at all.

previously we imported e.g. vizro.tables and then users could specify a target function in that namespace; now exactly the same thing happens, it's just that the place that import happens has changed (it's in CapturedCallable rather than the model that uses CapturedCallable). Either way we are doing exactly the same getattr(imported_module, target_function) so security here is the same

so long as you have access to Python, it's always possible to "circumvent" security of this to inject an arbitrary function from yaml. e.g. before this change it would already be possible to make your own model (just not by subclassing) and specify an arbitrary import path in there. Or actually you could even just do vizro.tables.my_custom_function = custom_function without needing a custom model at all (we never advertised this because it feels hacky). So nothing has changed here also. The significance of the security is that someone who has access only to yaml should not be able to inject an arbitrary Python function, and that is still the case now as it was before. (if you have access to Python then you can easily do damage through millions of ways anyway, so doing it through custom graphs is pretty much irrelevant)

So the only difference this change really makes is that now that subclassing is possible, there's a much more obvious way to inject a custom function than there was before. From a security perspective this is fine, because it still requires access to Python. The interesting thing here is that, as @petar-qb pointed out (but I had completely forgotten about so thank you very much for saying it!!), from a usability perspective there's now a much cleaner route for a user to inject a custom function than before 💯

So should we announce this new route? Not sure. Let me just look back on my notes to see if there's any other routes we should prefer instead. e.g. imagine the capture decorator could register the function as usable from YAML configuration like this:

@capture("graph") # or maybe you need to explicitly say register=True to register it as available from YAML def f(): ... # in capture function somewhere setattr(vizro.tables, captured_function) # or maybe we make a new namespace for it setattr(my_custom_stuff, captured_function) # vizro.tables.captured_function exists and hence can be used from # YAML without any subclassing or any new models needed

The advantage of the subclassing approach is that it's already possible (and will always remain possible, even if we enable other routes in future) and requires no effort from us.

So what I think we should do is a make a ticket for this to consider possible solutions and whether we want to advertise this one or if we think actually there are better approaches that we would prefer. While it would be really nice to have complete parity between YAML and Python, I don't think it's super urgent tbh. But if we like we can already advertise this new subclassing approach as one way of doing it (since it will always remain possible) and then in the future we might develop an even better solution. The YAML code should stay the same whatever the solution so it's just a question of writing the subclassing part up in docs.

antonymilne added 2 commits July 30, 2024 17:32

Make Graph import_path a string rather than module

64f1ea1

Make Graph import_path a string rather than module

3a05f16

petar-qb self-requested a review July 31, 2024 07:33

Changes in all other figure components + tests

f7ded0d

petar-qb assigned antonymilne and petar-qb Aug 1, 2024

Merge branch 'main' of https://github.com/mckinsey/vizro into fix/mak…

48bcbd6

…e-figures-subclassable

petar-qb marked this pull request as ready for review August 1, 2024 09:39

petar-qb requested review from Joseph-Perkins, huong-li-nguyen and maxschulz-COL as code owners August 1, 2024 09:39

petar-qb and others added 2 commits August 1, 2024 11:41

changelog message

c8bc3bb

[pre-commit.ci] auto fixes from pre-commit.com hooks

73011a0

for more information, see https://pre-commit.ci

huong-li-nguyen changed the title ~~[Fix] Make figure models subclassable~~ [Bug] Make figure models subclassable Aug 1, 2024

petar-qb and others added 6 commits August 1, 2024 14:06

scratch dev example changed

b64b506

[pre-commit.ci] auto fixes from pre-commit.com hooks

ff58b2f

for more information, see https://pre-commit.ci

Return changes for scratch_dev example

b0a74ff

Merge branch 'fix/make-figures-subclassable' of https://github.com/mc…

81a1b99

…kinsey/vizro into fix/make-figures-subclassable

Merge branch 'main' of https://github.com/mckinsey/vizro into fix/mak…

502934d

…e-figures-subclassable

Fix scratch_dev example

68cd1ef

huong-li-nguyen approved these changes Aug 1, 2024

View reviewed changes

vizro-core/src/vizro/models/_components/ag_grid.py Outdated Show resolved Hide resolved

vizro-core/src/vizro/models/_components/graph.py Outdated Show resolved Hide resolved

vizro-core/src/vizro/models/_components/table.py Outdated Show resolved Hide resolved

maxschulz-COL approved these changes Aug 1, 2024

View reviewed changes

petar-qb and others added 8 commits August 2, 2024 07:50

Fix integration tests

6e4fe88

Merge branch 'main' of https://github.com/mckinsey/vizro into fix/mak…

1157aff

…e-figures-subclassable

Update vizro-core/src/vizro/models/_components/ag_grid.py

5c551ed

Co-authored-by: Li Nguyen <[email protected]>

Update vizro-core/src/vizro/models/_components/graph.py

3a2b6f9

Co-authored-by: Li Nguyen <[email protected]>

Update vizro-core/src/vizro/models/_components/table.py

c6568bb

Co-authored-by: Li Nguyen <[email protected]>

Update vizro-core/changelog.d/20240801_113636_petar_pejovic_make_figu…

9a88f18

…res_subclassable.md Co-authored-by: Maximilian Schulz <[email protected]>

Merge branch 'fix/make-figures-subclassable' of https://github.com/mc…

8a8b346

…kinsey/vizro into fix/make-figures-subclassable

Tests added

6296f32

antonymilne commented Aug 2, 2024

View reviewed changes

petar-qb added 2 commits August 2, 2024 15:06

Add invalid_import_path test

22a0ff6

Merge main with the feature branch

c8111cf

petar-qb merged commit 8eb353e into main Aug 2, 2024
30 checks passed

petar-qb deleted the fix/make-figures-subclassable branch August 2, 2024 13:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Make figure models subclassable #606

[Bug] Make figure models subclassable #606

antonymilne commented Jul 30, 2024 •

edited by petar-qb

Loading

huong-li-nguyen left a comment

maxschulz-COL left a comment

maxschulz-COL Aug 1, 2024

petar-qb Aug 2, 2024 •

edited

Loading

maxschulz-COL Aug 2, 2024

antonymilne Aug 2, 2024

petar-qb Aug 2, 2024

maxschulz-COL Aug 6, 2024

antonymilne left a comment

antonymilne Aug 2, 2024

[Bug] Make figure models subclassable #606

[Bug] Make figure models subclassable #606

Conversation

antonymilne commented Jul 30, 2024 • edited by petar-qb Loading

Description

Screenshot

Notice

huong-li-nguyen left a comment

Choose a reason for hiding this comment

maxschulz-COL left a comment

Choose a reason for hiding this comment

maxschulz-COL Aug 1, 2024

Choose a reason for hiding this comment

petar-qb Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

maxschulz-COL Aug 2, 2024

Choose a reason for hiding this comment

antonymilne Aug 2, 2024

Choose a reason for hiding this comment

petar-qb Aug 2, 2024

Choose a reason for hiding this comment

maxschulz-COL Aug 6, 2024

Choose a reason for hiding this comment

antonymilne left a comment

Choose a reason for hiding this comment

antonymilne Aug 2, 2024

Choose a reason for hiding this comment

antonymilne commented Jul 30, 2024 •

edited by petar-qb

Loading

petar-qb Aug 2, 2024 •

edited

Loading