[
https://issues.apache.org/jira/browse/MAHOUT-344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13144493#comment-13144493
]
Hudson commented on MAHOUT-344:
-------------------------------
Integrated in Mahout-Quality #1150 (See
[https://builds.apache.org/job/Mahout-Quality/1150/])
MAHOUT-344: added minhash to build-asf-email.sh and to driver.classes.props
gsingers :
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1197803
Files :
* /mahout/trunk/examples/bin/build-asf-email.sh
* /mahout/trunk/src/conf/driver.classes.props
> Minhash based clustering
> -------------------------
>
> Key: MAHOUT-344
> URL: https://issues.apache.org/jira/browse/MAHOUT-344
> Project: Mahout
> Issue Type: Bug
> Components: Clustering
> Affects Versions: 0.3
> Reporter: Ankur
> Assignee: Ankur
> Fix For: 0.4
>
> Attachments: MAHOUT-344-v1.patch, MAHOUT-344-v2.patch,
> MAHOUT-344-v3.patch, MAHOUT-344-v4.patch, MAHOUT-344-v5.patch,
> MAHOUT-344-v6.patch, MAHOUT-344-v7.patch
>
>
> Minhash clustering performs probabilistic dimension reduction of high
> dimensional data. The essence of the technique is to hash each item using
> multiple independent hash functions such that the probability of collision of
> similar items is higher. Multiple such hash tables can then be constructed
> to answer near neighbor type of queries efficiently.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira