SNOW-1636678 add server side param for complexity bounds #2273

sfc-gh-aalam · 2024-09-11T21:01:50Z

Which Jira issue is this PR addressing? Make sure that there is an accompanying issue to your PR.

Fixes SNOW-1636678
Fill out the following pre-review checklist:
- I am adding a new automated test(s) to verify correctness of my new code
  - If this test skips Local Testing mode, I'm requesting review from @snowflakedb/local-testing
- I am adding new logging messages
- I am adding a new telemetry message
- I am adding new credentials
- I am adding a new dependency
- If this is a new feature/behavior, I'm adding the Local Testing parity changes.
Please describe how your code solves the related issue.

This PR makes the following changes:
1. Read default values for complexity score bounds from session parameters. These bounds are used in large query breakdown optimization.
2. Allow users to easily change the bounds using session object.
3. Send telemetry about updates made to bounds by the user.

…-complexity-bounds

sfc-gh-helmeleegy · 2024-09-12T21:32:49Z

src/snowflake/snowpark/session.py

+)
+# The complexity score lower bound is set to match COMPILATION_MEMORY_LIMIT
+# in Snowflake. This is the limit where we start seeing compilation errors.
+DEFAULT_COMPLEXITY_SCORE_LOWER_BOUND = 10_000_000


I'm assuming the compilation memory limit is measured in bytes? Can we also assume that our complexity score is measured in bytes? Wasn't it more based on the number of plan nodes?

If they don't use the same units, then is there a mapping between such units that we use?

sfc-gh-helmeleegy · 2024-09-12T21:35:34Z

src/snowflake/snowpark/session.py

+    "PYTHON_SNOWPARK_LARGE_QUERY_BREAKDOWN_COMPLEXITY_LOWER_BOUND"
+)
+# The complexity score lower bound is set to match COMPILATION_MEMORY_LIMIT
+# in Snowflake. This is the limit where we start seeing compilation errors.


Do we know if this is a soft limit or a hard limit? IoW, does the compiler intentionally errors out as soon as the limit is exceeded, or does it keep going anyway with undefined behavior?

sfc-gh-helmeleegy · 2024-09-12T21:37:06Z

src/snowflake/snowpark/session.py

+)
+# The complexity score lower bound is set to match COMPILATION_MEMORY_LIMIT
+# in Snowflake. This is the limit where we start seeing compilation errors.
+DEFAULT_COMPLEXITY_SCORE_LOWER_BOUND = 10_000_000


I'm also assuming that this limit is configurable on the compiler side. Is there a way to pull the currently configured value instead of hard coding it here - to make sure the two configurations are in sync?

sfc-gh-yzou

left a comment

sfc-gh-yzou · 2024-09-13T00:36:49Z

src/snowflake/snowpark/session.py

@@ -784,6 +810,24 @@ def large_query_breakdown_enabled(self, value: bool) -> None:
                "value for large_query_breakdown_enabled must be True or False!"
            )

+    @large_query_breakdown_complexity_bounds.setter
+    def large_query_breakdown_complexity_bounds(self, value: Tuple[int, int]) -> None:


might be easier to take a lower and upper bound there, then you can reconstruct the tuple internally, and we can make lower or upper optional also

@sfc-gh-yzou this is a @setter method. I'm afraid we have to abide by this format.

sfc-gh-aalam added 6 commits September 4, 2024 10:52

Add telemetry for before and after complexities

38b7705

merge with main

d680b9b

read lower/upper bounds from server

7545bb6

merge with main

bf7a994

fixes after merge

6f35070

fix test

427266f

sfc-gh-aalam added NO-CHANGELOG-UPDATES This pull request does not need to update CHANGELOG.md labels Sep 11, 2024

sfc-gh-aalam marked this pull request as ready for review September 11, 2024 22:56

sfc-gh-aalam requested a review from a team as a code owner September 11, 2024 22:56

sfc-gh-aalam requested review from sfc-gh-yixie, sfc-gh-yuwang and sfc-gh-jrose September 11, 2024 22:56

sfc-gh-aalam added 2 commits September 12, 2024 13:29

Merge branch 'main' into aalam-SNOW-1636678-add-server-side-param-for…

c4abe3f

…-complexity-bounds

add more test coverage

9e01708

sfc-gh-helmeleegy reviewed Sep 12, 2024

View reviewed changes

sfc-gh-yzou approved these changes Sep 13, 2024

View reviewed changes

sfc-gh-jdu approved these changes Sep 13, 2024

View reviewed changes

fix merge issues

86e4f21

sfc-gh-aalam enabled auto-merge (squash) September 13, 2024 21:41

sfc-gh-aalam merged commit 08ff293 into main Sep 13, 2024
36 checks passed

sfc-gh-aalam deleted the aalam-SNOW-1636678-add-server-side-param-for-complexity-bounds branch September 13, 2024 22:07

github-actions bot locked and limited conversation to collaborators Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SNOW-1636678 add server side param for complexity bounds #2273

SNOW-1636678 add server side param for complexity bounds #2273

sfc-gh-aalam commented Sep 11, 2024

sfc-gh-helmeleegy Sep 12, 2024 •

edited

Loading

sfc-gh-helmeleegy Sep 12, 2024

sfc-gh-helmeleegy Sep 12, 2024

sfc-gh-yzou left a comment

sfc-gh-yzou Sep 13, 2024

sfc-gh-aalam Sep 13, 2024

SNOW-1636678 add server side param for complexity bounds #2273

SNOW-1636678 add server side param for complexity bounds #2273

Conversation

sfc-gh-aalam commented Sep 11, 2024

sfc-gh-helmeleegy Sep 12, 2024 • edited Loading

Choose a reason for hiding this comment

sfc-gh-helmeleegy Sep 12, 2024

Choose a reason for hiding this comment

sfc-gh-helmeleegy Sep 12, 2024

Choose a reason for hiding this comment

sfc-gh-yzou left a comment

Choose a reason for hiding this comment

sfc-gh-yzou Sep 13, 2024

Choose a reason for hiding this comment

sfc-gh-aalam Sep 13, 2024

Choose a reason for hiding this comment

sfc-gh-helmeleegy Sep 12, 2024 •

edited

Loading