[ 
https://issues.apache.org/jira/browse/KYLIN-750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14558061#comment-14558061
 ] 

Shaofeng SHI commented on KYLIN-750:
------------------------------------

The first version has been developed (see MergeCuboidFromHBaseJob.java in 
branch 0.8.0); While to gain better performance and minimal the impact to HBase 
server, we need seek the way to run MR over snapshot files; see 
https://hbase.apache.org/apidocs/org/apache/hadoop/hbase/mapreduce/TableSnapshotInputFormat.html

Currently hbase doesn't support running MR over multiple snapshots; there is an 
open JIRA with it: 
https://issues.apache.org/jira/browse/HBASE-13356 

We need figure out how to do the cube merge with this way.

> Merge cube segments from HBase table
> ------------------------------------
>
>                 Key: KYLIN-750
>                 URL: https://issues.apache.org/jira/browse/KYLIN-750
>             Project: Kylin
>          Issue Type: Sub-task
>          Components: Job Engine
>    Affects Versions: v0.8.1
>            Reporter: Shaofeng SHI
>            Assignee: Shaofeng SHI
>             Fix For: v0.8.1
>
>
> With the new cubing algorithm, there is no intermediate cuboid files 
> persisted, so when merge the cube segments, need read the data from HBase 
> table directly.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to