[ 
https://issues.apache.org/jira/browse/MAHOUT-593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Lyubimov updated MAHOUT-593:
------------------------------------

    Attachment: MAHOUT-593.patch.gz

* minor renaming, final patch 
 git diff trunk --stat
core/pom.xml                                       |    6 +-
.../java/org/apache/mahout/common/IOUtils.java     |   90 +++
.../mahout/math/hadoop/stochasticsvd/BBtJob.java   |  188 ++++++
.../mahout/math/hadoop/stochasticsvd/BtJob.java    |  303 ++++++++++
.../hadoop/stochasticsvd/DenseBlockWritable.java   |   85 +++
.../hadoop/stochasticsvd/GivensThinSolver.java     |  621 ++++++++++++++++++++
.../mahout/math/hadoop/stochasticsvd/Omega.java    |  124 ++++
.../hadoop/stochasticsvd/PartialRowEmitter.java    |   32 +
.../mahout/math/hadoop/stochasticsvd/QJob.java     |  351 +++++++++++
.../mahout/math/hadoop/stochasticsvd/SSVDCli.java  |  126 ++++
.../math/hadoop/stochasticsvd/SSVDPrototype.java   |  383 ++++++++++++
.../math/hadoop/stochasticsvd/SSVDSolver.java      |  489 +++++++++++++++
.../mahout/math/hadoop/stochasticsvd/UJob.java     |  156 +++++
.../math/hadoop/stochasticsvd/UpperTriangular.java |  151 +++++
.../mahout/math/hadoop/stochasticsvd/VJob.java     |  150 +++++
.../hadoop/stochasticsvd/LocalSSVDSolverTest.java  |  172 ++++++
.../hadoop/stochasticsvd/SSVDPrototypeTest.java    |  112 ++++
math/pom.xml                                       |    2 +-
.../mahout/math/ssvd/EigenSolverWrapper.java       |   61 ++
src/conf/driver.classes.props                      |    1 +
src/conf/ssvd.props                                |   13 +
21 files changed, 3613 insertions(+), 3 deletions(-)

> Backport of Stochastic SVD patch (Mahout-376) to hadoop 0.20 to ensure 
> compatibility with current Mahout dependencies.
> ----------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-593
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-593
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Math
>    Affects Versions: 0.4
>            Reporter: Dmitriy Lyubimov
>             Fix For: 0.5
>
>         Attachments: MAHOUT-593.patch.gz, MAHOUT-593.patch.gz, 
> MAHOUT-593.patch.gz, MAHOUT-593.patch.gz, SSVD-givens-CLI.pdf, 
> ssvdclassdiag.png
>
>
> Current Mahout-376 patch requries 'new' hadoop API.  Certain elements of that 
> API (namely, multiple outputs) are not available in standard hadoop 0.20.2 
> release. As such, that may work only with either CDH or 0.21 distributions. 
>  In order to bring it into sync with current Mahout dependencies, a backport 
> of the patch to 'old' API is needed. 
> Also, some work is needed to resolve math dependencies. Existing patch relies 
> on apache commons-math 2.1 for eigen decomposition of small matrices. This 
> dependency is not currently set up in the mahout core. So, certain snippets 
> of code are either required to go to mahout-math or use Colt eigen 
> decompositon (last time i tried, my results were mixed with that one. It 
> seems to produce results inconsistent with those from mahout-math 
> eigensolver, at the very least, it doesn't produce singular values in sorted 
> order).
> So this patch is mainly moing some Mahout-376 code around.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to