[
https://issues.apache.org/jira/browse/MAHOUT-593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dmitriy Lyubimov updated MAHOUT-593:
------------------------------------
Attachment: MAHOUT-593.patch.gz
* minor renaming, final patch
git diff trunk --stat
core/pom.xml | 6 +-
.../java/org/apache/mahout/common/IOUtils.java | 90 +++
.../mahout/math/hadoop/stochasticsvd/BBtJob.java | 188 ++++++
.../mahout/math/hadoop/stochasticsvd/BtJob.java | 303 ++++++++++
.../hadoop/stochasticsvd/DenseBlockWritable.java | 85 +++
.../hadoop/stochasticsvd/GivensThinSolver.java | 621 ++++++++++++++++++++
.../mahout/math/hadoop/stochasticsvd/Omega.java | 124 ++++
.../hadoop/stochasticsvd/PartialRowEmitter.java | 32 +
.../mahout/math/hadoop/stochasticsvd/QJob.java | 351 +++++++++++
.../mahout/math/hadoop/stochasticsvd/SSVDCli.java | 126 ++++
.../math/hadoop/stochasticsvd/SSVDPrototype.java | 383 ++++++++++++
.../math/hadoop/stochasticsvd/SSVDSolver.java | 489 +++++++++++++++
.../mahout/math/hadoop/stochasticsvd/UJob.java | 156 +++++
.../math/hadoop/stochasticsvd/UpperTriangular.java | 151 +++++
.../mahout/math/hadoop/stochasticsvd/VJob.java | 150 +++++
.../hadoop/stochasticsvd/LocalSSVDSolverTest.java | 172 ++++++
.../hadoop/stochasticsvd/SSVDPrototypeTest.java | 112 ++++
math/pom.xml | 2 +-
.../mahout/math/ssvd/EigenSolverWrapper.java | 61 ++
src/conf/driver.classes.props | 1 +
src/conf/ssvd.props | 13 +
21 files changed, 3613 insertions(+), 3 deletions(-)
> Backport of Stochastic SVD patch (Mahout-376) to hadoop 0.20 to ensure
> compatibility with current Mahout dependencies.
> ----------------------------------------------------------------------------------------------------------------------
>
> Key: MAHOUT-593
> URL: https://issues.apache.org/jira/browse/MAHOUT-593
> Project: Mahout
> Issue Type: New Feature
> Components: Math
> Affects Versions: 0.4
> Reporter: Dmitriy Lyubimov
> Fix For: 0.5
>
> Attachments: MAHOUT-593.patch.gz, MAHOUT-593.patch.gz,
> MAHOUT-593.patch.gz, MAHOUT-593.patch.gz, SSVD-givens-CLI.pdf,
> ssvdclassdiag.png
>
>
> Current Mahout-376 patch requries 'new' hadoop API. Certain elements of that
> API (namely, multiple outputs) are not available in standard hadoop 0.20.2
> release. As such, that may work only with either CDH or 0.21 distributions.
> In order to bring it into sync with current Mahout dependencies, a backport
> of the patch to 'old' API is needed.
> Also, some work is needed to resolve math dependencies. Existing patch relies
> on apache commons-math 2.1 for eigen decomposition of small matrices. This
> dependency is not currently set up in the mahout core. So, certain snippets
> of code are either required to go to mahout-math or use Colt eigen
> decompositon (last time i tried, my results were mixed with that one. It
> seems to produce results inconsistent with those from mahout-math
> eigensolver, at the very least, it doesn't produce singular values in sorted
> order).
> So this patch is mainly moing some Mahout-376 code around.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira