FP-Growth Redundant patterns
----------------------------
Key: MAHOUT-709
URL: https://issues.apache.org/jira/browse/MAHOUT-709
Project: Mahout
Issue Type: Bug
Components: Frequent Itemset/Association Rule Mining
Affects Versions: 0.4
Reporter: Yarco Hayduk
Fix For: 0.5
The algorithm outputs more patterns that it is needed.
I have tested Mahout's PFP-Growth algorithm with the
http://www.borgelt.net/fpgrowth.html FP-Growth implementation. This
implementation has an option to generate closed patterns too.
When I filtered out the sub patterns from the output of Parallel FP-Growth I
arrived to the same result, as in http://www.borgelt.net/fpgrowth.html
Succinctly, you are not outputting closed items
I am attaching the dummy DB along with the output of both algorithms
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira