[ 
https://issues.apache.org/jira/browse/IMPALA-5982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Armstrong updated IMPALA-5982:
----------------------------------
    Labels: parquet utf8  (was: )

> String columns saved to Parquet files should be annotated with the UTF8 
> logical type
> ------------------------------------------------------------------------------------
>
>                 Key: IMPALA-5982
>                 URL: https://issues.apache.org/jira/browse/IMPALA-5982
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Zoltan Ivanfi
>            Priority: Major
>              Labels: parquet, utf8
>
> When creating Parquet files, Impala doesn't add the proper logical type 
> corresponding to the string SQL type. String columns should be annotated with 
> the 
> [{{UTF8}}|https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#utf8-strings]
>  logical type.
> The lack of the {{UTF8}} logical type annotation makes it harder to consume 
> the data using other tools, or even by Impala itself if the files are 
> manually moved around in the filesystem and a new table has to be created 
> based on their metadata.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to