David Mollitor created HIVE-24833:
-------------------------------------

             Summary: Hive Should Only Pushdown EQ Predicate on 
HBaseStorageHandler
                 Key: HIVE-24833
                 URL: https://issues.apache.org/jira/browse/HIVE-24833
             Project: Hive
          Issue Type: Improvement
            Reporter: David Mollitor


I believe that a Hive query with an HBase Storage Handler is incorrectly 
applies a predicate pushdown into the storage handler.

I observed a FETCH optimization that took a long time to complete because it 
was performing a table scan across the entire HBase table.

The only case in which a predicate should be pushed down the storage layer is 
for

`SELECT * FROM TABLE my_hbase_table WHERE row_key=?`

This would be appropriate (EQ on the row key).  Anything else will involve a 
scan of the table and there is no way to easily calculate how small a scan it 
will require and therefore should always be passed to the compute engine (Tez).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to