[jira] [Commented] (MAHOUT-1544) make Mahout DSL shell depend dynamically on Spark

Anand Avati (JIRA) Sun, 18 May 2014 00:39:07 -0700

    [ 
https://issues.apache.org/jira/browse/MAHOUT-1544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14001041#comment-14001041
 ]


Anand Avati commented on MAHOUT-1544:
-------------------------------------

[~ssc] - I investigated how Spark is achieving the task of exposing user 
defined data types and closures to the server. It turns out, Spark is using a 
heavily refactored and modified version of the Scala 2.10 REPL. Those 
modifications are now part of Scala 2.11 (in the form of two options: 
Yclass-based-repl and Yrepl-outdir). So if Mahout were to use Scala 2.11 (which 
in turn depends on Spark to be available in 2.11 Scala) then the proposal in 
this JIRA can be achieved with much more simplicity (It also turns out that 
Spark's REPL itself can be made much simpler when moved to Scala 2.11, for the 
same reasons). However right now I'm trying to get Spark's dependencies in 
Scala 2.11.

So in any case, this proposal can be achieved at the earliest with Spark 1.1 
(Spark devs are considering Scala 2.11 support in version 1.1). To do it any 
earlier, we would need to inherit a refactored and modified version of Scala 
REPL, just like how Spark is doing - just not worth the effort.

What's the right "process" here now? Close this JIRA? Or leave it open with a 
decreased priority?

> make Mahout DSL shell depend dynamically on Spark
> -------------------------------------------------
>
>                 Key: MAHOUT-1544
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1544
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Anand Avati
>             Fix For: 1.0
>
>         Attachments: 0001-spark-shell-rename-to-shell.patch, 
> 0002-shell-make-dependency-on-Spark-optional-and-dynamic.patch, 
> 0002-shell-make-dependency-on-Spark-optional-and-dynamic.patch, 
> 0002-shell-make-dependency-on-Spark-optional-and-dynamic.patch
>
>
> Today the Mahout's scala shell depends on spark.
> Create a cleaner separation between the shell and Spark. For e.g, the in core 
> scalabindings and operators do not need Spark. So make Spark a runtime 
> "addon" to the shell. Similarly in the future new distributed backend engines 
> can transparently (dynamically) be available through the DSL shell.
> The new shell works, looks and feels exactly like the shell before, but has a 
> cleaner modular architecture.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAHOUT-1544) make Mahout DSL shell depend dynamically on Spark

Reply via email to