[
https://issues.apache.org/jira/browse/HUDI-538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17019265#comment-17019265
]
vinoyang edited comment on HUDI-538 at 1/20/20 7:14 AM:
--------------------------------------------------------
[~vinoth] OK, another thing we may need to consider. Based on our discussion,
we agreed on put {{hudi-utilities}} aside. However, for both Flink and Spark,
they follow {{source -> transform -> sink}} mode. Currently, the sources host
in {{hudi-utilities}} package and they are not Spark-free. So, it seems we also
need to consider it. WDYT?
was (Author: yanghua):
[~vinoth] OK, another thing we may need to consider. Based on our discussion,
we agreed on put {{hudi-utilities}} aside. However, for both Flink and Spark,
they observe {{source -> transform -> sink}} mode. Currently, the sources host
in {{hudi-utilities}} package and they are not Spark-free. So, it seems we also
need to consider it. WDYT?
> Restructuring hudi client module for multi engine support
> ---------------------------------------------------------
>
> Key: HUDI-538
> URL: https://issues.apache.org/jira/browse/HUDI-538
> Project: Apache Hudi (incubating)
> Issue Type: Wish
> Components: Code Cleanup
> Reporter: vinoyang
> Priority: Major
>
> Hudi is currently tightly coupled with the Spark framework. It caused the
> integration with other computing engine more difficult. We plan to decouple
> it with Spark. This umbrella issue used to track this work.
> Some thoughts wrote here:
> https://docs.google.com/document/d/1Q9w_4K6xzGbUrtTS0gAlzNYOmRXjzNUdbbe0q59PX9w/edit?usp=sharing
> The feature branch is {{restructure-hudi-client}}.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)