[ 
https://issues.apache.org/jira/browse/HBASE-15368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15176063#comment-15176063
 ] 

Dave Latham commented on HBASE-15368:
-------------------------------------

This still is not going to work well.  Using the min timestamp does not fix it.

With relative windows, there's nothing to prevent a fully compacted HFile for 
one window from sliding across the window boundary, being compacted into that 
window, and sliding quickly to the next again.  There would be no reliable 
behavior of actual data staying partitioned in a well defined way.  For 
example, suppose you have windows of .2 days, 1 day, 5 days, 25 days, each with 
a single HFile containing well patterned data spread across most of their 
windows.  A few seconds later the .2 day HFile could be in the 1 day window, 
then be compacted with it, then that HFile slide to the 5 day window, be 
compacted, then to the 25 day window.  Now quickly all of your data winds up in 
a single window and HFile.  This is just one example of how the relative 
windows could go wrong.  To make it work you would probably need to more 
rigidly control which files within a window could be allowed to be compacted 
together - I'm not sure a reasonable scheme exists that doesn't wind up 
approximating the fixed windows anyway.



> Add relative window support
> ---------------------------
>
>                 Key: HBASE-15368
>                 URL: https://issues.apache.org/jira/browse/HBASE-15368
>             Project: HBase
>          Issue Type: Sub-task
>          Components: Compaction
>            Reporter: Duo Zhang
>            Assignee: Duo Zhang
>         Attachments: HBASE-15368-v1.patch, HBASE-15368.patch
>
>
> To better determine 'hot' data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to