[ 
https://issues.apache.org/jira/browse/HIVE-1643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12977472#action_12977472
 ] 

John Sichi commented on HIVE-1643:
----------------------------------

Notes for working on this:

Background is in

http://wiki.apache.org/hadoop/Hive/FilterPushdownDev

* In HiveHBaseTableInputFormat, newIndexPredicateAnalyzer needs to add 
additional operators (and stop restricting the allowed column names).  And then 
convertFilter needs to set up corresponding HBase filter conditions based on 
the predicates it finds.  Note that for inequality conditions on the key, it's 
necessary to muck with startRow/stopRow (not just the filter evaluator).

* See also the comment in HBaseStorageHandler.decomposePredicate.  Currently, 
it can only accept a single predicate.  If you want to be able to support AND 
of multiple predicates (using HBase's FilterList) then this will need to be 
relaxed.

* Beware of the fact that until HIVE-1538 gets committed, it is more difficult 
to make sure that the HBase-level filtering is working as expected.  The reason 
is that without HIVE-1538, a second copy of the filter gets applied within Hive 
(regardless of how the filter was decomposed when it was pushed down to HBase). 
 So even if HBase doesn't filter out everything you're expecting it to, you 
won't notice in the results since Hive will do the filtering again.


> support range scans and non-key columns in HBase filter pushdown
> ----------------------------------------------------------------
>
>                 Key: HIVE-1643
>                 URL: https://issues.apache.org/jira/browse/HIVE-1643
>             Project: Hive
>          Issue Type: Improvement
>          Components: HBase Handler
>    Affects Versions: 0.7.0
>            Reporter: John Sichi
>            Assignee: John Sichi
>             Fix For: 0.7.0
>
>
> HIVE-1226 added support for WHERE rowkey=3.  We would like to support WHERE 
> rowkey BETWEEN 10 and 20, as well as predicates on non-rowkeys (plus 
> conjunctions etc).  Non-rowkey conditions can't be used to filter out entire 
> ranges, but they can be used to push the per-row filter processing as far 
> down as possible.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to