[ 
https://issues.apache.org/jira/browse/IMPALA-7936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16890554#comment-16890554
 ] 

Alex Rodoni commented on IMPALA-7936:
-------------------------------------

[~boroknagyz] . [~lv] The PARQUET_READ_PAGE_INDEX query option is not 
documented. Does it need to be documented along with these 2 new query options? 
Or is it undocumented for a reason?

> Enable better control over Parquet writing
> ------------------------------------------
>
>                 Key: IMPALA-7936
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7936
>             Project: IMPALA
>          Issue Type: Improvement
>            Reporter: Zoltán Borók-Nagy
>            Assignee: Zoltán Borók-Nagy
>            Priority: Major
>             Fix For: Impala 3.3.0
>
>
> With the introduction of the Parquet page indexes it became desirable to have 
> more control over how Impala writes Parquet files.
> These configuration options (probably implemented as query options) would be:
>  * enable/disable Parquet page index writing (currently we can do it with a 
> command-line argument)
>  * set page-size limits based on row count
>  * -Set truncation length for statistics about string values-   (current 
> truncation length is 64, it is unlikely to have user data that needs longer 
> truncation than that)
> They'd enable writing more complete tests for page filtering. They'd be also 
> useful for fine-tuning the page index for better performance.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to