Hi, I tried FPGrowthJob using mahout command below. 1. $MAHOUT_HOME/bin/mahout fpg -i /input/retail.dat -o /output/numgdef/ -method mapreduce -regex '[\s]' -s 100 2. $MAHOUT_HOME/bin/mahout fpg -i /input/retail.dat -o /output/numg50/ -method mapreduce -regex '[\s]' -s 100 --numGroups 50 3. $MAHOUT_HOME/bin/mahout fpg -i /input/retail.dat -o /output/numg1000/ -method mapreduce -regex '[\s]' -s 100 --numGroups 1000
1. and 2. were same results. but 3.(--numGroups 1000) was different. The document page shows numGroups default 1000. https://cwiki.apache.org/confluence/display/MAHOUT/Parallel+Frequent+Pattern+Mining I think default value is 50. Is my thinking right? Yuji
