[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet
[ https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544018#comment-14544018 ] Sergio Peña commented on HIVE-10256: The patch is failing with the following error: {noformat} [ERROR] Failed to execute goal org.apache.maven.plugins:maven-compiler-plugin:3.1:testCompile (default-testCompile) on project hive-exec: Compilation failure [ERROR] /opt/local/hive/commit/hive/ql/src/test/org/apache/hadoop/hive/ql/io/parquet/TestParquetBlockElimination.java:[60,17] cannot find symbol [ERROR] symbol: method getObjectInspector(java.lang.String,java.lang.String) [ERROR] location: class org.apache.hadoop.hive.ql.io.parquet.TestParquetBlockElimination [ERROR] - [Help 1] {noformat} Filter row groups based on the block statistics in Parquet -- Key: HIVE-10256 URL: https://issues.apache.org/jira/browse/HIVE-10256 Project: Hive Issue Type: Sub-task Reporter: Dong Chen Assignee: Dong Chen Attachments: HIVE-10256-parquet.1.patch, HIVE-10256-parquet.patch In Parquet PPD, the not matched row groups should be eliminated. See {{TestOrcSplitElimination}} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet
[ https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14538010#comment-14538010 ] Sergio Peña commented on HIVE-10256: Looks good. +1 Filter row groups based on the block statistics in Parquet -- Key: HIVE-10256 URL: https://issues.apache.org/jira/browse/HIVE-10256 Project: Hive Issue Type: Sub-task Reporter: Dong Chen Assignee: Dong Chen Attachments: HIVE-10256-parquet.1.patch, HIVE-10256-parquet.patch In Parquet PPD, the not matched row groups should be eliminated. See {{TestOrcSplitElimination}} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet
[ https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534715#comment-14534715 ] Sergio Peña commented on HIVE-10256: Is this method name correct {{recordReader.getFiltedBlocks()}} ? Isn't getFilteredBlocks? Filter row groups based on the block statistics in Parquet -- Key: HIVE-10256 URL: https://issues.apache.org/jira/browse/HIVE-10256 Project: Hive Issue Type: Sub-task Reporter: Dong Chen Assignee: Dong Chen Attachments: HIVE-10256-parquet.patch In Parquet PPD, the not matched row groups should be eliminated. See {{TestOrcSplitElimination}} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet
[ https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531891#comment-14531891 ] Dong Chen commented on HIVE-10256: -- After HIVE-10252 was fixed, the row groups can be filtered. So this patch has no code change and add 2 tests for regression. Tests passed and no bugs were found. The tests ensure: 1. multiple row groups can be eliminated, if satisfying predicate. 2. combined expression predicate can work on filtering row groups. Filter row groups based on the block statistics in Parquet -- Key: HIVE-10256 URL: https://issues.apache.org/jira/browse/HIVE-10256 Project: Hive Issue Type: Sub-task Reporter: Dong Chen Assignee: Dong Chen Attachments: HIVE-10256-parquet.patch In Parquet PPD, the not matched row groups should be eliminated. See {{TestOrcSplitElimination}} -- This message was sent by Atlassian JIRA (v6.3.4#6332)