[jira] [Commented] (MAHOUT-1570) Adding support for Apache Flink as a backend for the Mahout DSL

Dmitriy Lyubimov (JIRA) Tue, 09 Jun 2015 10:22:02 -0700

    [ 
https://issues.apache.org/jira/browse/MAHOUT-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14579260#comment-14579260
 ]


Dmitriy Lyubimov commented on MAHOUT-1570:
------------------------------------------

I would like to emphasize that this work gets us all in Mahout totally excited. 

Yes, with the current state of the code I personally would encourage you to do 
a PR.

I think [~ssc] is missing the point that PR is tool of collaboration but not 
just "request to commit".  Making suggestions on PR is no different than making 
suggestion on Jira, except it is more powerful since one could "point and 
elaborate" on the code. Nobody will rush to merge your work prematurely and 
until you say so, but it enables collaboration on the issue to a greater extent 
than just published github branch or Jira comments -- IMO.

Ultimately it is up to you to if you want to limit the scope of external input 
and collaboration before you present things to public. This is a totally valid 
approach if you want to stay 100% focused on your work. 

However it seems since you have published some ongoing work on this Jira, you 
are at least somewhat ready to interact with reviews and suggestions. If so, PR 
would alleviate that to a greater extent than this Jira. 

Another reason to do PR is if we eventually all agree that something needs to 
be changed, it is less work to do it when things are still fluid rather than 
when things are all nailed down.


> Adding support for Apache Flink as a backend for the Mahout DSL
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-1570
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1570
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Till Rohrmann
>            Assignee: Suneel Marthi
>              Labels: DSL, flink, scala
>
> With the finalized abstraction of the Mahout DSL plans from the backend 
> operations (MAHOUT-1529), it should be possible to integrate further backends 
> for the Mahout DSL. Apache Flink would be a suitable candidate to act as a 
> good execution backend. 
> With respect to the implementation, the biggest difference between Spark and 
> Flink at the moment is probably the incremental rollout of plans, which is 
> triggered by Spark's actions and which is not supported by Flink yet. 
> However, the Flink community is working on this issue. For the moment, it 
> should be possible to circumvent this problem by writing intermediate results 
> required by an action to HDFS and reading from there.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (MAHOUT-1570) Adding support for Apache Flink as a backend for the Mahout DSL

Reply via email to