Till Rohrmann created MAHOUT-1570:
-------------------------------------
Summary: Adding support for Stratosphere as a backend for the
Mahout DSL
Key: MAHOUT-1570
URL: https://issues.apache.org/jira/browse/MAHOUT-1570
Project: Mahout
Issue Type: Improvement
Reporter: Till Rohrmann
With the finalized abstraction of logical Mahout DSL plans from the backend
operations (MAHOUT-1529), it should be possible to integrate further backends
for the Mahout DSL.
I like to evaluate to what extent this can already be done for Stratosphere and
what can be done to solve possibly occuring problems.
The biggest difference between Spark and Stratosphere at the moment is probably
the incremental rollout of plans, which is triggered by Spark's actions and
which is not supported by Stratosphere yet. However, the Stratosphere team is
working on this issue. For the moment, it should be possible to circumvent this
problem by writing intermediate results required by an action to HDFS and
reading from there.
Thus, this work shall rather be considered as a proof of concept than a
strongly efficient implementation and has the purpose to evaluate where the
logical plan abstraction might be refined in order to support different
backends.
--
This message was sent by Atlassian JIRA
(v6.2#6252)