use TypeInfo to serialize to MetacatType instead of ObjectInspector #585
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Previously for both hive and iceberg table, we use Hive ObjectInspector when serializing the schema into Metacat type but Hive ObjectInspector will internally lowercase every fieldNames, which breaks the fidelity of the fieldName.
Even though hive ObjectInspector contains fieldName and fieldType information, it's core purpose is to inspect the actual data of a hive field, which is not needed for schema description.
And instead we can just simply serialize schema directly using the Hive TypeInfo, which also contains the information such as field name and field type. And one advantage of using TypeInfo is that it preserves the fidelity of the fieldName.