Mahout would have CSR support in the upcoming release as well as hybrid
GPU/CPU execution depending on the work load.

This discussion is better moved to dev@mahout, since the Mahout project has
long abstracted out Spark, Flink and H2O for distributed linear algebra;
and we are now adding swappable Math backend - ViennaCL, cublas, etc..




On Wed, Jan 11, 2017 at 3:49 PM, Kam Kasravi <kamkasr...@gmail.com> wrote:

> Thanks Andrew -
>
> Since you're quite familiar with how Mahout backends (flink, spark, h20)
> bind and enable DRM and the API Mahout/Samsara exposes -
> I think the end goal would be to surface a JAVA/Python API as well as
> outline a declarative syntax that the various runners can adhere to
> (perhaps via some kind of execution plan).
>
> This would be added to the current design document within a 'low-level /
> linear algebra' section.
>
> What are your thoughts (and Mahout's thoughts) on SystemML's approach to
> DRM and Declarative Machine Learning (DML) in general?
> http://www.vldb.org/pvldb/vol9/p960-elgohary.pdf
> Does Mahout plan on supporting SystemML compressed linear algebra (CLA)?
>
> Would you be willing to help (along with others including Vladisav) in
> providing a common ML design document for Beam?
>
>
>
> On Wed, Jan 11, 2017 at 12:07 PM, Andrew Musselman <
> andrew.mussel...@gmail.com> wrote:
>
> > That's right; what other info do you think would be useful?
> >
> > On Tue, Jan 10, 2017 at 11:09 AM, Kam Kasravi <kamkasr...@gmail.com>
> > wrote:
> >
> > > Thanks Andrew
> > > I think more information about the DRM operations and how persistence
> > > would be done at the runner level. It looks like HDFS or spark caching
> is
> > > currently being used?
> > >
> > >     On Monday, January 9, 2017 6:04 PM, Andrew Musselman <
> a...@apache.org
> > >
> > > wrote:
> > >
> > >
> > >  Hello Beam Team,
> > >
> > > Thought you might be interested in the work we've been doing on Mahout,
> > > such as the distributed linear algebra DSL/front-end that can use
> > multiple
> > > back-ends for compute (Spark, Flink, H2O now). See
> > > https://mahout.apache.org/users/environment/out-of-core-reference.html
> > for
> > > an intro.
> > >
> > > We also are working on native CPU/GPU hybrid support and we're close to
> > an
> > > initial release. Let us know if you'd like to know more.
> > >
> > > Thanks and best of luck!
> > >
> > > Best
> > > Andrew Musselman
> > >
> > > On 2017-01-09 12:00 (-0800), Kam Kasravi <kamkasr...@gmail.com> wrote:
> > > > Hi Vladisav
> > > >
> > > > I'm the author of the design document. An area we stalled on was
> > creating
> > > a
> > > > common low level linear algebra library that would also include
> > > > optimizations like MKL but across platforms and GPUs.
> > > > Additionally there are efforts underway that provide a scoring API
> vs a
> > > > training API.
> > > >
> > > >    - PredictionIO http://predictionio.incubator.apache.org/ (now
> part
> > of
> > > >    Salesforce)
> > > >    - MLeap https://github.com/combust/mleap
> > > >    - PFA - Portable Format for Analytics http://dmg.org/pfa/
> > > >
> > > > Any ML effort needs to also include deep learning and the ability to
> > > > integrate various types of neural networks. Apache has several early
> > > > efforts in this regard (mxnet, singa).
> > > >
> > > > Thanks
> > > > Kam
> > > >
> > > > On Fri, Jan 6, 2017 at 7:07 AM, Vladisav Jelisavcic <
> > vladis...@gmail.com
> > > >
> > > > wrote:
> > > >
> > > > > Hi everyone,
> > > > >
> > > > > what is the current status on BEAM-478 and BEAM-303 (machine
> learning
> > > > > learning DSL and related functions)?
> > > > > I would like to start contributing in this direction.
> > > > >
> > > > > I found this design document:
> > > > > https://docs.google.com/document/d/17cRZk_
> > > yqHm3C0fljivjN66MbLkeKS1yjo4PB
> > > > > ECHb-xA/edit#heading=h.n51rhya8bv4f
> > > > >
> > > > > Are there any other docs/advances related to this?
> > > > >
> > > > >
> > > > > Best regards,
> > > > > Vladisav
> > > > >
> > > >
> > >
> > >
> > >
> > >
> >
>

Reply via email to