[ 
https://issues.apache.org/jira/browse/HBASE-14468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-14468:
--------------------------------------
    Description: 
FIFO compaction policy selects only files which have all cells expired. The 
column family MUST have non-default TTL. One of the use cases for this policy 
is when we need to store raw data which will be post-processed later and 
discarded completely after quite short period of time. Raw time-series vs. 
time-based rollup aggregates and compacted time-series. We collect raw 
time-series and store them into CF with FIFO compaction policy, periodically we 
run  task which creates rollup aggregates and compacts time-series, the 
original raw data can be discarded after that.

See: https://github.com/facebook/rocksdb/wiki/FIFO-compaction-style

  was:FIFO compaction policy selects only files which have all cells expired. 
The column family MUST have non-default TTL. One of the use cases for this 
policy is when we need to store raw data which will be post-processed later and 
discarded completely after quite short period of time. Raw time-series vs. 
time-based rollup aggregates and compacted time-series. We collect raw 
time-series and store them into CF with FIFO compaction policy, periodically we 
run  task which creates rollup aggregates and compacts time-series, the 
original raw data can be discarded after that.


> Compaction improvements: FIFO compaction policy
> -----------------------------------------------
>
>                 Key: HBASE-14468
>                 URL: https://issues.apache.org/jira/browse/HBASE-14468
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>             Fix For: 2.0.0
>
>
> FIFO compaction policy selects only files which have all cells expired. The 
> column family MUST have non-default TTL. One of the use cases for this policy 
> is when we need to store raw data which will be post-processed later and 
> discarded completely after quite short period of time. Raw time-series vs. 
> time-based rollup aggregates and compacted time-series. We collect raw 
> time-series and store them into CF with FIFO compaction policy, periodically 
> we run  task which creates rollup aggregates and compacts time-series, the 
> original raw data can be discarded after that.
> See: https://github.com/facebook/rocksdb/wiki/FIFO-compaction-style



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to