I agree with you. In main method of FPGrowthDriver, numGroupsOpt set its description with default value 1000, however PFPGrowth uses 50 as default value for numGroup when divide the features into groups.
That's why the --help option shows the default numGroups value 1000, but exactly 50 is used in runtime. 2011/9/8 yuji anzai <[email protected]> > Thanks for the response. > > The help shows "Default Value:1000". But I think the default value isn't > correct. > > org.apache.mahout.fpm.pfpgrowth.PFPGrowth > line: 285 > Integer numGroups = Integer.valueOf(params.get(NUM_GROUPS, "50")); > > Yuji > > > you can do --help on the command to see the current default value for any > flag > > On Tue, Sep 6, 2011 at 2:43 PM, yuji anzai <[email protected]> wrote: > > >> Hi, > >> > >> I tried FPGrowthJob using mahout command below. > >> 1. $MAHOUT_HOME/bin/mahout fpg -i /input/retail.dat -o /output/numgdef/ > >> -method mapreduce -regex '[\s]' -s 100 > >> 2. $MAHOUT_HOME/bin/mahout fpg -i /input/retail.dat -o /output/numg50/ > >> -method mapreduce -regex '[\s]' -s 100 --numGroups 50 > >> 3. $MAHOUT_HOME/bin/mahout fpg -i /input/retail.dat -o /output/numg1000/ > >> -method mapreduce -regex '[\s]' -s 100 --numGroups 1000 > >> > >> 1. and 2. were same results. but 3.(--numGroups 1000) was different. > >> > >> The document page shows numGroups default 1000. > >> > >> > > https://cwiki.apache.org/confluence/display/MAHOUT/Parallel+Frequent+Pattern+Mining > >> > >> I think default value is 50. Is my thinking right? > >> > >> Yuji > -- Gmail/talk: [email protected]
