Hello,
I have mahout 0.9 and a single-node Hadoop 1.2.1 running on a Mac.
I am trying to create a bunch of vectors for clustering from a
collection of text documents. So I did:
$MAHOUT_HOME/bin/mahout seqdirectory --input
/Users/hadoop/fuzzyjoin-results/NOTES/progress_notes --output
/tmp/mahout-vectors/
However, this gives me an error:
Exception in thread "main" java.io.FileNotFoundException: File does not
exist: /Users/hadoop/fuzzyjoin-results/NOTES/progress_notes
even though the directory definitely exists and contains lots of files.
After a lot of googling I found that if I add "-xm sequential" to the
above command, it does not complain; however, the output directory
(/tmp/mahout-vectors) is empty.
Any help would be appreciated.
Thank you,
Natalia Connolly