[jira] [Commented] (IMPALA-5842) Write page index in Parquet files
[ https://issues.apache.org/jira/browse/IMPALA-5842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16480162#comment-16480162 ] ASF subversion and git services commented on IMPALA-5842: - Commit 5f9641043aed8590cad37f003921c462cda934af in impala's branch refs/heads/2.x from [~boroknagyz] [ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=5f96410 ] IMPALA-5842: Write page index in Parquet files This commit builds on the previous work of Pooja Nilangekar: https://gerrit.cloudera.org/#/c/7464/ The commit implements the write path of PARQUET-922: "Add column indexes to parquet.thrift". As specified in the parquet-format, Impala writes the page indexes just before the footer. This allows much more efficient page filtering than using the same information from the 'statistics' field of DataPageHeader. I updated Pooja's python tests as well. Change-Id: Icbacf7fe3b7672e3ce719261ecef445b16f8dec9 Reviewed-on: http://gerrit.cloudera.org:8080/9693 Reviewed-by: Zoltan Borok-NagyTested-by: Impala Public Jenkins > Write page index in Parquet files > - > > Key: IMPALA-5842 > URL: https://issues.apache.org/jira/browse/IMPALA-5842 > Project: IMPALA > Issue Type: New Feature > Components: Backend >Affects Versions: Impala 2.10.0 >Reporter: Lars Volker >Assignee: Zoltán Borók-Nagy >Priority: Critical > Labels: parquet > > Once PARQUET-922 has been resolved, we should start writing page indices to > Parquet files. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org
[jira] [Commented] (IMPALA-5842) Write page index in Parquet files
[ https://issues.apache.org/jira/browse/IMPALA-5842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16479803#comment-16479803 ] ASF subversion and git services commented on IMPALA-5842: - Commit ccf19f9f8f2914639b6997849a56c13cfd2399b8 in impala's branch refs/heads/master from [~boroknagyz] [ https://git-wip-us.apache.org/repos/asf?p=impala.git;h=ccf19f9 ] IMPALA-5842: Write page index in Parquet files This commit builds on the previous work of Pooja Nilangekar: https://gerrit.cloudera.org/#/c/7464/ The commit implements the write path of PARQUET-922: "Add column indexes to parquet.thrift". As specified in the parquet-format, Impala writes the page indexes just before the footer. This allows much more efficient page filtering than using the same information from the 'statistics' field of DataPageHeader. I updated Pooja's python tests as well. Change-Id: Icbacf7fe3b7672e3ce719261ecef445b16f8dec9 Reviewed-on: http://gerrit.cloudera.org:8080/9693 Reviewed-by: Zoltan Borok-NagyTested-by: Impala Public Jenkins > Write page index in Parquet files > - > > Key: IMPALA-5842 > URL: https://issues.apache.org/jira/browse/IMPALA-5842 > Project: IMPALA > Issue Type: New Feature > Components: Backend >Affects Versions: Impala 2.10.0 >Reporter: Lars Volker >Assignee: Zoltán Borók-Nagy >Priority: Critical > Labels: parquet > > Once PARQUET-922 has been resolved, we should start writing page indices to > Parquet files. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org