[
https://issues.apache.org/jira/browse/MAHOUT-157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12742033#action_12742033
]
Isabel Drost commented on MAHOUT-157:
-------------------------------------
Patch applies to trunk but I run into problems when trying to get it to
compile. I needed to apply MAHOUT-124 first. Then I got an error that indicated
that you are using the Combinations class not only in the tests (where it is
put by the diff) but also in the regular source code. After copying the class
to src/main/java, I get the following error:
MAHOUT-157/core/src/main/java/org/apache/mahout/fpm/pfpgrowth/ParallelFPGrowthReducer.java:[92,41]
get(java.lang.String) in org.apache.mahout.common.Parameters cannot be applied
to (java.lang.String,java.lang.String)
I guess I have done something wrong when applying the patches one after another?
Other than that I only have some general comments before going into more detail
for the review: I am missing some documentation, both JavaDoc and package.html,
at least a link to the original paper would be nice to have.
PFPGrowth - seems like you do quite a lot of work in your constructor. I think
it is no good idea to start map reduce jobs from within a constructor. Maybe I
am reading something wrong here?
Is it possible to break up the test into unit tests? I think that would make
changing the code and tracking where the change actually broke the code by far
easier.
AggregatorReducer, line 88: Please avoid calls to <Exception>.printStackTrace()
- usually those messages get lost when the system is in production. Better log
the message with Logger.<fatal|error|warn>("your message", <Exception>) - maybe
rethrow the exception if you cannot handle it properly.
TreeNode - the class seems to contain public attributes only but no methods.
Please at least explain which type of tree these nodes are supposed to be a
part of. From the code alone I am not able to understand the its usage...
> Frequent Pattern Mining using Parallel FP-Growth
> ------------------------------------------------
>
> Key: MAHOUT-157
> URL: https://issues.apache.org/jira/browse/MAHOUT-157
> Project: Mahout
> Issue Type: New Feature
> Affects Versions: 0.2
> Reporter: Robin Anil
> Fix For: 0.2
>
> Attachments: MAHOUT-157-August-6.patch,
> MAHOUT-157-Combinations-BSD-License.patch,
> MAHOUT-157-Combinations-BSD-License.patch,
> MAHOUT-157-inProgress-August-5.patch
>
>
> Implement: http://infolab.stanford.edu/~echang/recsys08-69.pdf
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.