[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet

2015-05-14 Thread JIRA

[ 
https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14544018#comment-14544018
 ] 

Sergio Peña commented on HIVE-10256:


The patch is failing with the following error:

{noformat}
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-compiler-plugin:3.1:testCompile 
(default-testCompile) on project hive-exec: Compilation failure
[ERROR] 
/opt/local/hive/commit/hive/ql/src/test/org/apache/hadoop/hive/ql/io/parquet/TestParquetBlockElimination.java:[60,17]
 cannot find symbol
[ERROR] symbol:   method getObjectInspector(java.lang.String,java.lang.String)
[ERROR] location: class 
org.apache.hadoop.hive.ql.io.parquet.TestParquetBlockElimination
[ERROR] - [Help 1]
{noformat}

 Filter row groups based on the block statistics in Parquet
 --

 Key: HIVE-10256
 URL: https://issues.apache.org/jira/browse/HIVE-10256
 Project: Hive
  Issue Type: Sub-task
Reporter: Dong Chen
Assignee: Dong Chen
 Attachments: HIVE-10256-parquet.1.patch, HIVE-10256-parquet.patch


 In Parquet PPD, the not matched row groups should be eliminated. See 
 {{TestOrcSplitElimination}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet

2015-05-11 Thread JIRA

[ 
https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14538010#comment-14538010
 ] 

Sergio Peña commented on HIVE-10256:


Looks good.
+1

 Filter row groups based on the block statistics in Parquet
 --

 Key: HIVE-10256
 URL: https://issues.apache.org/jira/browse/HIVE-10256
 Project: Hive
  Issue Type: Sub-task
Reporter: Dong Chen
Assignee: Dong Chen
 Attachments: HIVE-10256-parquet.1.patch, HIVE-10256-parquet.patch


 In Parquet PPD, the not matched row groups should be eliminated. See 
 {{TestOrcSplitElimination}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet

2015-05-08 Thread JIRA

[ 
https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14534715#comment-14534715
 ] 

Sergio Peña commented on HIVE-10256:


Is this method name correct {{recordReader.getFiltedBlocks()}} ?  Isn't 
getFilteredBlocks?

 Filter row groups based on the block statistics in Parquet
 --

 Key: HIVE-10256
 URL: https://issues.apache.org/jira/browse/HIVE-10256
 Project: Hive
  Issue Type: Sub-task
Reporter: Dong Chen
Assignee: Dong Chen
 Attachments: HIVE-10256-parquet.patch


 In Parquet PPD, the not matched row groups should be eliminated. See 
 {{TestOrcSplitElimination}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (HIVE-10256) Filter row groups based on the block statistics in Parquet

2015-05-06 Thread Dong Chen (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-10256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14531891#comment-14531891
 ] 

Dong Chen commented on HIVE-10256:
--

After HIVE-10252 was fixed, the row groups can be filtered. So this patch has 
no code change and add 2 tests for regression. Tests passed and no bugs were 
found.
The tests ensure: 1. multiple row groups can be eliminated, if satisfying 
predicate. 2. combined expression predicate can work on filtering row groups.

 Filter row groups based on the block statistics in Parquet
 --

 Key: HIVE-10256
 URL: https://issues.apache.org/jira/browse/HIVE-10256
 Project: Hive
  Issue Type: Sub-task
Reporter: Dong Chen
Assignee: Dong Chen
 Attachments: HIVE-10256-parquet.patch


 In Parquet PPD, the not matched row groups should be eliminated. See 
 {{TestOrcSplitElimination}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)