[ 
https://issues.apache.org/jira/browse/ARROW-12203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17374891#comment-17374891
 ] 

Jorge Leitão commented on ARROW-12203:
--------------------------------------

I am of the opinion that it is time to move on; version 2.0 by default (NANOS 
out for now).

For the data pages, I do not think there are so many differences between 1 and 
2, right? it is mostly where is the compression is applied and where the byte 
length of the def and rep levels are declared (in the page data or in the 
header).

So, in that context keeping data pages v1 by default seems ok.


> [C++][Python] Switch default Parquet version to 2.0
> ---------------------------------------------------
>
>                 Key: ARROW-12203
>                 URL: https://issues.apache.org/jira/browse/ARROW-12203
>             Project: Apache Arrow
>          Issue Type: Wish
>          Components: C++, Python
>            Reporter: Antoine Pitrou
>            Priority: Major
>             Fix For: 6.0.0
>
>
> Currently, Parquet write APIs default to maximum-compatibility Parquet 
> version "1.0", which disables some logical types such as UINT32. We may want 
> to switch the default to "2.0" instead, to allow faithful representation of 
> more types.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to