[ 
https://issues.apache.org/jira/browse/DRILL-7825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17253599#comment-17253599
 ] 

James Turton commented on DRILL-7825:
-------------------------------------

On the workarounds, and while proper support is in the works, using a VARBINARY 
column to store UUIDs would incur no space penalty.  Encoding and decoding may 
be a bit awkward but the functions `STRING_BINARY` and `BINARY_STRING` may be 
of interest.

> Error: SYSTEM ERROR: RuntimeException: Unknown logical type <LogicalType 
> UUID:UUIDType()>
> -----------------------------------------------------------------------------------------
>
>                 Key: DRILL-7825
>                 URL: https://issues.apache.org/jira/browse/DRILL-7825
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>    Affects Versions: 1.17.0
>         Environment: Windows 10 single local node.
>            Reporter: ian
>            Priority: Major
>         Attachments: uuid.parquet
>
>
> Parquet logical type UUID fails on read.  Only workaround is to store as 
> text, a 225% penalty. 
> Here is the schema dump for the attached test parquet file.  I can read the 
> file okay from R and natively through C++.
> {code:java}
> 3961 $ parquet-dump-schema uuid.parquet
> required group field_id=0 schema {
>  required fixed_len_byte_array(16) field_id=1 uuid_req1 (UUID);
>  optional fixed_len_byte_array(16) field_id=2 uuid_opt1 (UUID);
>  required fixed_len_byte_array(16) field_id=3 uuid_req2 (UUID);
> }{code}
> I'm new.. I put this as MAJOR from reading the severity definitions, but 
> gladly defer to those who know better how to classify.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to