I bet you're missing part of your stack trace which gives you more helpful info. I see you've run seq2sparse, but not rowId - this step turns your SequenceFile<Text, VectorWritable> to SequenceFile<IntWritable,VectorWritable>, and is currently necessary for input to the LDA (cvb0) training. I'd imagine you should probably have an error related to that somewhere in your output.
p.s. your email header has your name set as "Gmail". I'm guessing that's not actually your name? :) On Mon, Jul 8, 2013 at 10:43 PM, Gmail <[email protected]> wrote: > Hi > > I am trying to run the mahout cvb on hadoop cluster using some text files > as input . I am getting the following error : > > Exception in thread "main" java.lang.IllegalStateException: No part files > found in model path 'temp/topicModelState/model-1' > > My script for running mahout cvb looks like this : > > export work_dir=/home/mahout > > rm -rf $work_dir/input_seqfiles > > ./mahout seqdirectory --input $work_dir/lda_input --output > $work_dir/input_seqfiles -c UTF8 > > rm -rf $work_dir/input_seqparse > > ./mahout seq2sparse -i $work_dir/input_seqfiles -o > $work_dir/input_seqparse -wt tf > > ./mahout cvb -i $work_dir/input_seqparse -o $work_dir/lda_output -k 3 -nt > 10 --maxIter 200 > > > Is there something i am missing ? Any help or suggestion is greatly > appreciated . > > Thanks > > -- -jake
