[
https://issues.apache.org/jira/browse/IMPALA-5982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Armstrong updated IMPALA-5982:
----------------------------------
Component/s: Backend
> String columns saved to Parquet files should be annotated with the UTF8
> logical type
> ------------------------------------------------------------------------------------
>
> Key: IMPALA-5982
> URL: https://issues.apache.org/jira/browse/IMPALA-5982
> Project: IMPALA
> Issue Type: Bug
> Components: Backend
> Reporter: Zoltan Ivanfi
> Priority: Major
> Labels: parquet, utf8
>
> When creating Parquet files, Impala doesn't add the proper logical type
> corresponding to the string SQL type. String columns should be annotated with
> the
> [{{UTF8}}|https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#utf8-strings]
> logical type.
> The lack of the {{UTF8}} logical type annotation makes it harder to consume
> the data using other tools, or even by Impala itself if the files are
> manually moved around in the filesystem and a new table has to be created
> based on their metadata.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]