[ 
https://issues.apache.org/jira/browse/HUDI-1399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17250765#comment-17250765
 ] 

liwei commented on HUDI-1399:
-----------------------------

[~vinoth] code freeze is Dec 31?

just like compaction asynchronously have four option
1. option one: in spark inline clustering 
now in https://github.com/apache/hudi/pull/2263/files have base implementation, 
but have not support run in spark [~satishkotha] 
2. option two: support a independent clustering spark job to asynchronously 
clustering just like HoodieCompactor
3. option three: hudi cli support clustering 
4. option four: DeltaStreamer Continuous mode support clustering

for functional coverage i think we can first support option one and option two.
as https://github.com/apache/hudi/pull/2263/files have not merge, i can land 
this two in on satishkotha:sk/clustering branch. I plan to do it this weekend, 
and submit pr next week. [~vinoth] what do you think ? Does my plan conflict 
with you? [~satishkotha]  cc [~nagarwal] 

> support clustering operation can run asynchronously
> ---------------------------------------------------
>
>                 Key: HUDI-1399
>                 URL: https://issues.apache.org/jira/browse/HUDI-1399
>             Project: Apache Hudi
>          Issue Type: Sub-task
>            Reporter: liwei
>            Assignee: liwei
>            Priority: Blocker
>             Fix For: 0.7.0
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to