[ 
https://issues.apache.org/jira/browse/MAHOUT-157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12751990#action_12751990
 ] 

Robin Anil commented on MAHOUT-157:
-----------------------------------

What I am merging are not integers its a pair of a list of integers(the pattern 
found) and an integer(the support of the pattern).  My Comparison operator is 
first based on support(higher to lower), then sizes of the pattern(longer to 
shorter), then the lexicographic ordering if the patterns are of same size and 
support. 

I have been reading up a couple of optmisations to fpgrowth

* Compressing the FPTree data structure representation for faster growth 
recursion (http://ftp1.de.freebsd.org/Publications/CEUR-WS/Vol-90/liu.pdf       
            
                                                                                
                                             
http://www.computing.edu.au/~sucahyoy/article/hpdm04.pdf )
* Detecting Frequent Closed item-sets.(if a pattern (a1 a2 .... an) is there 
with some support. Instead of outputting 2^n sets in the power-set of that set, 
output just the pattern provided the item-set is closed(definition here 
http://ftp1.de.freebsd.org/Publications/CEUR-WS/Vol-90/liu.pdf )

> Frequent Pattern Mining using Parallel FP-Growth
> ------------------------------------------------
>
>                 Key: MAHOUT-157
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-157
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>            Reporter: Robin Anil
>             Fix For: 0.2
>
>         Attachments: MAHOUT-157-August-17.patch, MAHOUT-157-August-24.patch, 
> MAHOUT-157-August-31.patch, MAHOUT-157-August-6.patch, 
> MAHOUT-157-Combinations-BSD-License.patch, 
> MAHOUT-157-Combinations-BSD-License.patch, 
> MAHOUT-157-inProgress-August-5.patch, MAHOUT-157-September-5.patch
>
>
> Implement: http://infolab.stanford.edu/~echang/recsys08-69.pdf

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to