EXPERIMENTAL: PPL to catalyst plan translator #2041

YANG-DB · 2023-08-30T20:47:06Z

Description

In Apache Spark, the DataFrame API serves as a programmatic interface for data manipulation and queries, allowing the construction of complex operations using a chain of method calls. This API can work in tandem with other query languages like SQL or PPL.

For instance, if you have a PPL query and a translator, you can convert it into DataFrame operations to generate an optimized execution plan. Spark's underlying Catalyst optimizer will convert these DataFrame transformations and actions into an optimized physical plan executed over RDDs or Datasets.

The pi is addressing the translating the PPL query (using the logical plan) into the spark corespondent logical plan - Catalyst based execution library.

Issues Resolved

#1875

Check List

New functionality includes testing.
- All tests pass, including unit test, integration test and doctest
New functionality has been documented.
- New functionality has javadoc added
- New functionality has user manual doc added
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: YANGDB <[email protected]>

codecov · 2023-08-30T20:55:32Z

Codecov Report

Merging #2041 (5a4e59d) into main (627189b) will decrease coverage by 0.83%.
Report is 5 commits behind head on main.
The diff coverage is 71.42%.

❗ Current head 5a4e59d differs from pull request most recent head acaf59c. Consider uploading reports for the commit acaf59c to get more accurate results

@@             Coverage Diff              @@
##               main    #2041      +/-   ##
============================================
- Coverage     97.30%   96.47%   -0.83%     
- Complexity     4623     4718      +95     
============================================
  Files           407      412       +5     
  Lines         11934    12327     +393     
  Branches        828      864      +36     
============================================
+ Hits          11612    11893     +281     
- Misses          315      397      +82     
- Partials          7       37      +30

Flag	Coverage Δ
sql-engine	`96.47% <71.42%> (-0.83%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed	Coverage Δ
.../opensearch/sql/spark/ppl/CatalystPlanContext.java	`56.25% <56.25%> (ø)`
...search/sql/spark/ppl/CatalystQueryPlanVisitor.java	`66.94% <66.94%> (ø)`
...rc/main/java/org/opensearch/sql/utils/Builder.java	`74.50% <74.50%> (ø)`

... and 5 files with indirect coverage changes

…QL translation path Adding multiple PPL queries for different test purpose Signed-off-by: YANGDB <[email protected]>

…e cases Signed-off-by: YANGDB <[email protected]>

Signed-off-by: YANGDB <[email protected]>

include initial data-type transformations Signed-off-by: YANGDB <[email protected]>

vmmusings · 2023-10-03T23:05:33Z

@YANG-DB Closing this pull request. Reopen if required.

YANG-DB added 2 commits August 29, 2023 17:11

add ppl to catalyst logical plan transformer

82f5193

Signed-off-by: YANGDB <[email protected]>

update tests

9a8700e

Signed-off-by: YANGDB <[email protected]>

YANG-DB added 5 commits August 30, 2023 17:06

update with an SQL query builder for future support in the PPL into S…

839debc

…QL translation path Adding multiple PPL queries for different test purpose Signed-off-by: YANGDB <[email protected]>

Adding multiple diverse PPL queries for different test purpose and us…

d7337b6

…e cases Signed-off-by: YANGDB <[email protected]>

Adding multiple diverse PPL queries for different test purpose and us…

5a4e59d

…e cases Signed-off-by: YANGDB <[email protected]>

add ComparatorTransformer skeleton

58febc3

Signed-off-by: YANGDB <[email protected]>

support basic filter plans with basic literals add ComparatorTransformer

acaf59c

include initial data-type transformations Signed-off-by: YANGDB <[email protected]>

vmmusings closed this Oct 3, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EXPERIMENTAL: PPL to catalyst plan translator #2041

EXPERIMENTAL: PPL to catalyst plan translator #2041

YANG-DB commented Aug 30, 2023

codecov bot commented Aug 30, 2023 •

edited

Loading

vmmusings commented Oct 3, 2023

EXPERIMENTAL: PPL to catalyst plan translator #2041

EXPERIMENTAL: PPL to catalyst plan translator #2041

Conversation

YANG-DB commented Aug 30, 2023

Description

Issues Resolved

Check List

codecov bot commented Aug 30, 2023 • edited Loading

Codecov Report

vmmusings commented Oct 3, 2023

codecov bot commented Aug 30, 2023 •

edited

Loading