Author: apalumbo
Date: Sun Apr 12 01:05:19 2015
New Revision: 1672958
URL: http://svn.apache.org/r1672958
Log:
update for release
Modified:
mahout/site/mahout_cms/trunk/content/users/environment/h2o-internals.mdtext
Modified:
mahout/site/mahout_cms/trunk/content/users/environment/h2o-internals.mdtext
URL:
http://svn.apache.org/viewvc/mahout/site/mahout_cms/trunk/content/users/environment/h2o-internals.mdtext?rev=1672958&r1=1672957&r2=1672958&view=diff
==============================================================================
--- mahout/site/mahout_cms/trunk/content/users/environment/h2o-internals.mdtext
(original)
+++ mahout/site/mahout_cms/trunk/content/users/environment/h2o-internals.mdtext
Sun Apr 12 01:05:19 2015
@@ -1,6 +1,6 @@
# Introduction
-This document provides an overview of how the Mahout Scala DSL (distributed
algebraic operators) is implemented over the H2O backend engine. The document
is aimed at Mahout developers, to give a high level description of the design
so that one can explore the code inside `h2o/` with some context.
+This document provides an overview of how the Mahout Samsara environment is
implemented over the H2O backend engine. The document is aimed at Mahout
developers, to give a high level description of the design so that one can
explore the code inside `h2o/` with some context.
## H2O Overview
@@ -13,10 +13,10 @@ The data model of the H2O math engine is
## Mahout DRM
-The Mahout DRM, or Distributed Row Matrix, is an abstraction for storing a
large matrix of numbers in-memory in a cluster by distributing logical rows
among servers. The DSL provides an abstract API on DRMs for backend engines to
provide implementations of this API. Examples are the Spark and H2O backend
engines. Each engine has it's own design of mapping the abstract API onto its
data model and provides implementations for algebraic operators over that
mapping.
+The Mahout DRM, or Distributed Row Matrix, is an abstraction for storing a
large matrix of numbers in-memory in a cluster by distributing logical rows
among servers. Mahout's scala DSL provides an abstract API on DRMs for backend
engines to provide implementations of this API. Examples are the Spark and H2O
backend engines. Each engine has it's own design of mapping the abstract API
onto its data model and provides implementations for algebraic operators over
that mapping.
-## H2O DSL Engine
+## H2O Environment Engine
The H2O backend implements the abstract DRM as an H2O Frame. Each logical
column in the DRM is an H2O Vector. All elements of a logical DRM row are
guaranteed to be homed on the same server. A set of rows stored on a server are
presented as a read-only virtual in-core Matrix (i.e BlockMatrix) for the
closure method in the `mapBlock(...)` API.