Automatically read JSON types #1203

Guilherme-B · 2024-03-25T23:14:23Z

As noted on the repository, Spark does not support JSON types for this reason, the BQ connector converts the JSON record into a String. In Spark 3.4.0 a new method to was introduced, which allows us to cast the DataFrame into a target schema. However, the casting fails and forces us to use the fromJson function, which would essentially mean having to store each StructType separately for each column that needs to be parsed, in other words, hardcoding.

I was wondering if there is any other way to do this? Can we somehow determine if a column is of the JSON type in BigQuery (metadata does not seem to be available on the read DataFrame) and if so, retrieve only the relevant schema portion for that specific column and convert it into a StructType?

The text was updated successfully, but these errors were encountered:

davidrabinowitz added the enhancement New feature or request label Mar 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically read JSON types #1203

Automatically read JSON types #1203

Guilherme-B commented Mar 25, 2024 •

edited

Loading

Automatically read JSON types #1203

Automatically read JSON types #1203

Comments

Guilherme-B commented Mar 25, 2024 • edited Loading

Guilherme-B commented Mar 25, 2024 •

edited

Loading