[RFC] CDK to deploy EMR GLUE dev enviornment #1000
Labels
enhancement
New feature or request
Lang:PPL
Pipe Processing Language support
testing
test related feature
Is your feature request related to a problem?
Currently there is no easy way to spin up an environment to test spark sql/ppl commands against EMR spark instance leveraging GLUE catalogue.
What solution would you like?
Provide CDK to deploy AWS stack needed for manual testing within spark repo under docs with instructions for deployment.
CDK will deploy the following :
opensearch-spark-ppl_2.12-0.7.0-SNAPSHOT.jar
such that spark instance can leverage opensearch-sparktest.csv
containing data for test tables that GLUE will integrate.test.csv
inopensearch-spark-ppl_2.12-0.7.0-SNAPSHOT.jar
Implementation details
For user to run spark.sql() commands with PPL support there are 2 options:
What alternatives have you considered?
Do you have any additional context?
Possible CDK stack (stack has not been tested as I currently do not have access to AWS account):
The text was updated successfully, but these errors were encountered: