[
https://issues.apache.org/jira/browse/HBASE-18165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16037784#comment-16037784
]
Lars Hofhansl commented on HBASE-18165:
---------------------------------------
I was thinking Filters indeed. We already have string-to-filter converters for
the HBase shell. Could "simply" store those in an HBase table and read before
compactions.
Here's another crazy idea: An HFileReader that produces Cells arithmetically.
Now one could potentially write some metadata into a file and have an
HFileReader that produces (say) the delete marker you want - would need to
insert one of these for each affected region. Now everything would just work,
HBase merge-sort would do the right thing.
> Predicate based deletion during major compactions
> -------------------------------------------------
>
> Key: HBASE-18165
> URL: https://issues.apache.org/jira/browse/HBASE-18165
> Project: HBase
> Issue Type: Brainstorming
> Reporter: Lars Hofhansl
>
> In many cases it is expensive to place a delete per version, column, or
> family.
> HBase should have way to specify a predicate and remove all Cells matching
> the predicate during the next compactions (major and minor).
> Nothing more concrete. The tricky part would be to know when it is safe to
> remove the predicate, i.e. when we can be sure that all Cells matching the
> predicate actually have been removed.
> Could potentially use HBASE-12859 for that.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)