adam35413 wrote:
I did some testing and determined that a local copy of the results files AND
an HDFS copy has to exist. From looking at the code, it appears that
ClusterDump checks if the files exist locally at Line 129, but then tries to
read the files from HDFS aroudn Line 135. If the files don't exist at both
places mahout will bomb out.
Whats the correct process to rebuild bin/mahout to make sure it is using the
most recent version of the ClusterDump code?
I'd do a checkout from trunk and a clean install. I don't know why the
file handling you describe is working that way. Seems it might be a bug.
I would think the file locations should come with the path argument,
local or HDFS.