[
https://issues.apache.org/jira/browse/HBASE-28622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Istvan Toth updated HBASE-28622:
--------------------------------
Description:
org.apache.hadoop.hbase.filter.FilterListWithAND.filterRowKey(Cell) will return
true if ANY of the filters returns true for Filter#filterRowKey().
However, the SEEK_NEXT_USING_HINT mechanism relies on filterRowKey() returning
false, so that filterCell() can return SEEK_NEXT_USING_HINT.
If none of the filters matches, but one of them returns true for
filterRowKey(), then the filter(s) that returned false, so that they can
return SEEK_NEXT_USING_HINT in filterCell() never get a chance to return
SEEK_NEXT_USING_HINT, and instead of seeking to the hint, FilterListWithAND
will do very slow full scan.
was:
org.apache.hadoop.hbase.filter.FilterListWithAND.filterRowKey(Cell) will return
true if ANY of the filters returns true for Filter#filterRowKey().
However, the SEEK_NEXT_USING_HINT mechanism relies on filterRowKey() returning
false, so that filterCell() can return SEEK_NEXT_USING_HINT.
If none of the filters matches, but one of them returns true for
filterRowKey(), then the filter(s) that returned to false, so that they can
return SEEK_NEXT_USING_HINT in filterCell() never get a chance to return
SEEK_NEXT_USING_HINT, and instead of seeking, FilterListWithAND will do very
slow full scan.
> FilterListWithAND can swallow SEEK_NEXT_USING_HINT
> --------------------------------------------------
>
> Key: HBASE-28622
> URL: https://issues.apache.org/jira/browse/HBASE-28622
> Project: HBase
> Issue Type: Bug
> Components: Filters
> Reporter: Istvan Toth
> Assignee: Istvan Toth
> Priority: Major
>
> org.apache.hadoop.hbase.filter.FilterListWithAND.filterRowKey(Cell) will
> return true if ANY of the filters returns true for Filter#filterRowKey().
> However, the SEEK_NEXT_USING_HINT mechanism relies on filterRowKey()
> returning false, so that filterCell() can return SEEK_NEXT_USING_HINT.
> If none of the filters matches, but one of them returns true for
> filterRowKey(), then the filter(s) that returned false, so that they can
> return SEEK_NEXT_USING_HINT in filterCell() never get a chance to return
> SEEK_NEXT_USING_HINT, and instead of seeking to the hint, FilterListWithAND
> will do very slow full scan.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)