I would rather see a matrix that looks local but acts global so that coders can produce very simple code that is still parallelized.
Sent from my iPhone > On Apr 1, 2014, at 11:09, "Anand Avati (JIRA)" <[email protected]> wrote: > > > [ > https://issues.apache.org/jira/browse/MAHOUT-1500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13956283#comment-13956283 > ] > > Anand Avati commented on MAHOUT-1500: > ------------------------------------- > > Thanks for your feedback, Dmitry. > > Now it seems to me (with my limited exploring of Mahout) that it might > actually be viable to provide a "hadoop alternative" in the form of an > alternate implementation of DistributedRowMatrix (instead of AbstractMatrix) > and AbstractJob (by internally using h2o's Frame/Vec and MRTask2 APIs), and > thereby allow for a runtime choice of Hadoop vs H2O. This seems like a > reasonable first step? > >> H2O integration >> --------------- >> >> Key: MAHOUT-1500 >> URL: https://issues.apache.org/jira/browse/MAHOUT-1500 >> Project: Mahout >> Issue Type: Improvement >> Reporter: Anand Avati >> Fix For: 1.0 >> >> >> Integration with h2o (github.com/0xdata/h2o) in order to exploit its high >> performance computational abilities. >> Start with providing implementations of AbstractMatrix and AbstractVector, >> and more as we make progress. > > > > -- > This message was sent by Atlassian JIRA > (v6.2#6252)
