Skip to content

Commit

Permalink
e2e-Normalize-ITN
Browse files Browse the repository at this point in the history
  • Loading branch information
AnkitCLI committed Jun 6, 2024
1 parent 1ecdbcf commit ac1dd45
Show file tree
Hide file tree
Showing 8 changed files with 83 additions and 4 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -147,3 +147,52 @@ Feature: Normalize transform - Verify File source data transfer using Normalize
Then Verify the pipeline status is "Succeeded"
Then Close the pipeline logs
Then Validate output file generated by file sink plugin "fileSinkTargetBucket" is equal to expected output file "normalizeCsvAllDataTypeOutputFile"

@NORMALIZE_TEST3 @FILE_SINK_TEST
Scenario: To verify data is getting transferred from File source to File sink successfully with Normalized fields using empty attribute
Given Open Datafusion Project to configure pipeline
When Select plugin: "File" from the plugins list as: "Source"
When Expand Plugin group in the LHS plugins list: "Transform"
When Select plugin: "Transpose" from the plugins list as: "Transform"
Then Connect plugins: "File" and "Transpose" to establish connection
When Expand Plugin group in the LHS plugins list: "Sink"
When Select plugin: "File" from the plugins list as: "Sink"
Then Connect plugins: "Transpose" and "File2" to establish connection
Then Navigate to the properties page of plugin: "File"
Then Enter input plugin property: "referenceName" with value: "FileReferenceName"
Then Enter input plugin property: "path" with value: "normalizeTest3"
Then Select dropdown plugin property: "format" with option value: "csv"
Then Click plugin property: "skipHeader"
Then Click on the Get Schema button
Then Verify the Output Schema matches the Expected Schema: "normalizeCsvAllDataTypeFileSchema"
Then Validate "File" plugin properties
Then Close the Plugin Properties page
Then Navigate to the properties page of plugin: "Transpose"
Then Enter Normalize plugin Fields to be Mapped "normalizeFileValidFieldsMapping"
Then Enter Normalize plugin Fields to be Normalized "normalizeFieldsToBeNormalizedFile"
Then Select Normalize plugin output schema action: "clear"
Then Enter Normalize plugin outputSchema "normalizeBQValidOutputSchema"
Then Validate "Transpose" plugin properties
Then Close the Plugin Properties page
Then Navigate to the properties page of plugin: "File2"
Then Enter input plugin property: "referenceName" with value: "FileReferenceName"
Then Enter input plugin property: "path" with value: "fileSinkTargetBucket"
Then Replace input plugin property: "pathSuffix" with value: "yyyy-MM-dd-HH-mm-ss"
Then Select dropdown plugin property: "format" with option value: "csv"
Then Click plugin property: "writeHeader"
Then Validate "File2" plugin properties
Then Close the Plugin Properties page
Then Save the pipeline
Then Preview and run the pipeline
Then Wait till pipeline preview is in running state
Then Open and capture pipeline preview logs
Then Verify the preview run status of pipeline in the logs is "succeeded"
Then Close the pipeline logs
Then Close the preview
Then Deploy the pipeline
Then Run the Pipeline in Runtime
Then Wait till pipeline is in running state
Then Open and capture logs
Then Verify the pipeline status is "Succeeded"
Then Close the pipeline logs
Then Validate output file generated by file sink plugin "fileSinkTargetBucket" is equal to expected output file "normalizeCsvAllDataTypeOutputFile1"
Original file line number Diff line number Diff line change
Expand Up @@ -244,7 +244,16 @@ public static void createBucketWithNormalizeTest1File() throws IOException, URIS
BeforeActions.scenario.write("Normalize 1st bucket name - " + gcsSourceBucketName1);
}

@After(order = 1, value = "@NORMALIZE_TEST1")
@Before(order = 1, value = "@NORMALIZE_TEST3")
public static void createBucketWithNormalizeTest3File() throws IOException, URISyntaxException {
gcsSourceBucketName1 = createGCSBucketWithFile(PluginPropertyUtils.pluginProp(
"normalizeCsvAllDataTypeFile3"));
PluginPropertyUtils.addPluginProp("normalizeTest3", "gs://" + gcsSourceBucketName1 + "/" +
PluginPropertyUtils.pluginProp("normalizeCsvAllDataTypeFile3"));
BeforeActions.scenario.write("Normalize 1st bucket name - " + gcsSourceBucketName1);
}

@After(order = 1, value = "@NORMALIZE_TEST1 or @NORMALIZE_TEST3")
public static void deleteSourceBucketWithNormalizeTest1File() {
deleteGCSBucket(gcsSourceBucketName1);
gcsSourceBucketName1 = StringUtils.EMPTY;
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -131,7 +131,9 @@ normalizeCsvFileOutputSchema={ "type": "record", "name": "text", "fields": [ \
normalizeBQValidDatatypeOutputSchema=[{"key":"ID","value":"int"},{"key":"AttributeType","value":"string"},\
{"key":"AttributeValue","value":"string"},{"key":"Date","value":"string"}]
normalizeCsvAllDataTypeFile=testdata/file/CSV_Normalize_TEST_1.csv
normalizeCsvAllDataTypeFile3=testdata/file/CSV_Normalize_TEST_3.csv
normalizeCsvAllDataTypeOutputFile=e2e-tests/expected_outputs/CSV_NormalizeOutput.csv
normalizeCsvAllDataTypeOutputFile1=e2e-tests/file/expected_outputs/OUTPUT_FOR_NORMALIZE_TEST.csv
normalizeCsvAllDataTypeFileSchema=[{"key":"id","value":"int"},{"key":"name","value":"string"},\
{"key":"yearofbirth","value":"int"},{"key":"isdeleted","value":"boolean"},{"key":"email","value":"string"},\
{"key":"createddate","value":"string"},{"key":"revenue","value":"string"},{"key":"points","value":"string"},\
Expand Down Expand Up @@ -279,6 +281,7 @@ joinerInputTest1=dummy
joinerCsvNullFileInputTest1=dummy
joinerCsvNullFileInputTest2=dummy
normalizeTest1=dummy
normalizeTest3=dummy
csvTest=dummy
csvAllDataTypeTestFile=dummy
csvNoHeaderTestFile=dummy
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
id,name,yearofbirth,isdeleted,email,createddate,revenue,points,longdatatype,doubledatatype,date,null,BytesData
1,albert einstein,1879,true,[email protected],,900750000.01,3.14235678,-9223372036854770000,22.8,1996-07-21,,10111011101110111011
2,isaac newton,1643,false,[email protected],,900750000.01,3.14235678906787648,-9223372036854770000,123.08,1996-07-21,str,10111011101110111011
3,marie curie,1867,true,[email protected],2021-09-20 11:27:50 UTC,900750000.01,3.14235678,-9223372036854770000,124.97,1996-07-21,,10111011101110111011
4,galilée,1564,false,[email protected],2021-09-20 11:27:50 UTC,900750000.01,3.14235678,-2^63,234.89,1996-07-21,,10111011101110111011
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
AttributeValue,AttributeType,ID,Date
[email protected],email,1,null
900750000.01,revenue,1,null
3.14235678,points,1,null
[email protected],email,2,null
900750000.01,revenue,2,null
3.14235678906787648,points,2,null
[email protected],email,3,2021-09-20 11:27:50 UTC
900750000.01,revenue,3,2021-09-20 11:27:50 UTC
3.14235678,points,3,2021-09-20 11:27:50 UTC
[email protected],email,4,2021-09-20 11:27:50 UTC
900750000.01,revenue,4,2021-09-20 11:27:50 UTC
3.14235678,points,4,2021-09-20 11:27:50 UTC
2 changes: 1 addition & 1 deletion google-cloud
Submodule google-cloud updated 123 files
2 changes: 1 addition & 1 deletion wrangler-transform
Submodule wrangler-transform updated 79 files
+2 −2 .github/workflows/build.yml
+6 −12 .github/workflows/e2e.yml
+2 −2 pom.xml
+50 −0 wrangler-api/src/main/java/io/cdap/wrangler/api/RemoteDirectiveResponse.java
+5 −0 wrangler-core/pom.xml
+1 −1 wrangler-core/src/main/antlr4/io/cdap/wrangler/parser/Directives.g4
+3 −3 wrangler-core/src/main/java/io/cdap/directives/column/Copy.java
+65 −9 wrangler-core/src/main/java/io/cdap/directives/column/SetType.java
+1 −1 wrangler-core/src/main/java/io/cdap/directives/datamodel/DataModelMapColumn.java
+2 −0 wrangler-core/src/main/java/io/cdap/directives/row/SendToErrorAndContinue.java
+11 −1 wrangler-core/src/main/java/io/cdap/directives/xml/XmlToJson.java
+3 −2 wrangler-core/src/main/java/io/cdap/wrangler/parser/MigrateToV2.java
+3 −0 wrangler-core/src/main/java/io/cdap/wrangler/schema/DirectiveOutputSchemaGenerator.java
+1 −0 wrangler-core/src/main/java/io/cdap/wrangler/schema/TransientStoreKeys.java
+46 −19 wrangler-core/src/main/java/io/cdap/wrangler/utils/ColumnConverter.java
+100 −0 wrangler-core/src/main/java/io/cdap/wrangler/utils/KryoSerializer.java
+1 −1 wrangler-core/src/main/java/io/cdap/wrangler/utils/SchemaConverter.java
+3 −1 wrangler-core/src/test/java/io/cdap/directives/column/CopyTest.java
+44 −9 wrangler-core/src/test/java/io/cdap/directives/column/SetTypeTest.java
+54 −0 wrangler-core/src/test/java/io/cdap/directives/parser/XmlToJsonTest.java
+25 −0 wrangler-core/src/test/java/io/cdap/directives/row/SendToErrorAndContinueTest.java
+1 −1 wrangler-core/src/test/java/io/cdap/wrangler/parser/GrammarMigratorTest.java
+1 −0 wrangler-core/src/test/java/io/cdap/wrangler/utils/JsonTestData.java
+146 −0 wrangler-core/src/test/java/io/cdap/wrangler/utils/KryoSerializerTest.java
+18 −0 wrangler-core/src/test/java/io/cdap/wrangler/utils/ObjectSerDeTest.java
+3 −1 wrangler-docs/directives/parse-xml-to-json.md
+3 −2 wrangler-docs/directives/set-type.md
+1 −1 wrangler-service/src/main/java/io/cdap/wrangler/service/directive/AbstractDirectiveHandler.java
+8 −1 wrangler-service/src/main/java/io/cdap/wrangler/service/directive/RemoteDirectiveRequest.java
+31 −4 wrangler-service/src/main/java/io/cdap/wrangler/service/directive/RemoteExecutionTask.java
+19 −2 wrangler-service/src/main/java/io/cdap/wrangler/service/directive/WorkspaceHandler.java
+2 −2 wrangler-transform/pom.xml
+6 −5 wrangler-transform/src/e2e-test/features/Wrangler/DataTypeParsers.feature
+43 −0 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsAvro.feature
+4 −4 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsCsv.feature
+40 −0 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsExcel.feature
+3 −3 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsFixedLength.feature
+4 −5 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsHl7.feature
+43 −0 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsJson.feature
+43 −0 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsLog.feature
+43 −0 wrangler-transform/src/e2e-test/features/Wrangler/ParseAsXmlToJson.feature
+43 −0 wrangler-transform/src/e2e-test/features/Wrangler/Runtime.feature
+101 −19 wrangler-transform/src/e2e-test/java/io/cdap/plugin/common/stepsdesign/TestSetupHooks.java
+3 −3 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_parse_DateTime
+3 −0 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_parse_avro
+2 −0 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_parse_excel
+2 −2 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_parse_fixedlength
+3 −0 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_parse_json
+1 −0 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_parse_log
+6 −0 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_parse_xmltojson
+5 −0 wrangler-transform/src/e2e-test/resources/BQValidationExpectedFiles/Directive_wrangler_GroupBy
+2 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryCreateTableQuery.txt
+1 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryCreateTableQueryAvro.txt
+1 −1 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryCreateTableQueryDatetime.txt
+1 −1 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryCreateTableQueryFxdlen.txt
+1 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryCreateTableQueryLog.txt
+1 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryCreateTableQueryXml.txt
+10 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryInsertDataQuery.txt
+3 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryInsertDataQueryAvro.txt
+3 −3 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryInsertDataQueryDatetime.txt
+1 −1 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryInsertDataQueryFxdlen.txt
+3 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryInsertDataQueryLog.txt
+5 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryInsertDataQueryXml.txt
+6 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQueryInsertDataQueryparsejson.txt
+1 −0 wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/BigQuerycreateTableQueryjson.txt
+ wrangler-transform/src/e2e-test/resources/BQtesdata/BigQuery/test1.xlsx
+1 −0 wrangler-transform/src/e2e-test/resources/errorMessage.properties
+27 −3 wrangler-transform/src/e2e-test/resources/pluginParameters.properties
+215 −0 wrangler-transform/src/e2e-test/resources/testData/Wrangler/BQ2BQwithWrnglerNGrpby-cdap-data-pipeline (1).json
+704 −0 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parseAsAvro-cdap-data-pipeline (1).json
+4 −10 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_HL7_Wrangler-cdap-data-pipeline (1).json
+4 −10 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_csv_wrangle-cdap-data-pipeline.json
+167 −0 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_datetime_wrangler-cdap-data-pipeline.json
+180 −0 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_excel_wrangler_copy-cdap-data-pipeline.json
+26 −38 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_fixedlength_wrangler-cdap-data-pipeline.json
+467 −0 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_json_wrangler1-cdap-data-pipeline.json
+168 −0 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_log_wrangler_copy-cdap-data-pipeline.json
+4 −10 wrangler-transform/src/e2e-test/resources/testData/Wrangler/parse_timestamp_wrangle-cdap-data-pipeline.json
+28 −32 ...ler-transform/src/e2e-test/resources/testData/Wrangler/parse_xmltojson_wrangler-cdap-data-pipeline (1).json

0 comments on commit ac1dd45

Please sign in to comment.