And for any of the cases - if not, then why not? Because that hasn't been implemented, or there's an actual reason that HBase would not want to do it? With reads for a custom time range, it's possible to still read data that is waiting to be GCed from one of the above mechanisms and will disappear after that happens. Doing the GC during minor compactions as well as major ones would change that visibility window, but doesn't seem to change that odd behavior that is there to begin with.
On Wed, Jun 14, 2017 at 5:51 PM, Dave Latham <[email protected]> wrote: > What cells, if any, are removed during minor compactions? > > Cells that > (a) are beyond the TTL? > (b) are shadowed by a delete marker? (from the files compacted) > (c) are shadowed by newer versions? (assuming numVersions configured < num > versions of the cell found) >
