Hi All,
In kicking the tires of the 0.7 release, I've discovered that the
arguments for clusterdump in examples/bin/cluster-reuters.sh aren't
quite right.
When running what's checked in, I get:
12/06/09 08:10:47 ERROR common.AbstractJob: Unexpected -s while
processing Job-Specific Options:
usage: <command> [Generic Options] [Job-Specific Options]
The current dump commands look like:
$MAHOUT clusterdump \
-s ${WORK_DIR}/reuters-kmeans/clusters-*-final \
-d ${WORK_DIR}/reuters-out-seqdir-sparse-kmeans/dictionary.file-0 \
-dt sequencefile -b 100 -n 20 --evaluate -dm
org.apache.mahout.common.distance.CosineDistanceMeasure \
--pointsDir ${WORK_DIR}/reuters-kmeans/clusteredPoints
I think they should be:
$MAHOUT clusterdump \
-i ${WORK_DIR}/reuters-kmeans/clusters-*-final \
-o ${WORK_DIR}/reuters-kmeans/clusters-dump -of TEXT \
-d ${WORK_DIR}/reuters-out-seqdir-sparse-kmeans/dictionary.file-0 \
-dt sequencefile -b 100 -n 20 --evaluate -dm
org.apache.mahout.common.distance.CosineDistanceMeasure \
--pointsDir ${WORK_DIR}/reuters-kmeans/clusteredPoints
Anyone opposed to getting this fix in for 0.7?
Drew