Hi all,
A noob question I'm sure but I'm stuck. I'm using CVB to cluster a text index
of articles.
Here's the CVB call:
bin/mahout cvb \ -i /opt/mahout/lucene-sparse-vectors-cvb/matrix \ -dict
/opt/mahout/cvb-output/dict.file-* \ -o /opt/mahout/cvb-output/topic_terms.out
\ -dt /opt/mahout/cvb-output/topic_dist.out \ -k 200 \-mt
/opt/mahout/output/iterations/ \-x 20 -a .25 -ow
I'm trying to access the topics using ldatopics per
https://cwiki.apache.org/confluence/display/MAHOUT/Latent+Dirichlet+Allocation.
My latest combination was: bin/mahout ldatopics -i opt/mahout/cvb-output/ -d
/opt/mahout/cvb-output/dict.file-*
However, it returns an error stating: ERROR driver.MahoutDriver: : Try the new
Collapsed Variation Bayes LDA, try bin/mahout cvb or bin/mahout cvb0_local
The spec is:bin/mahout ldatopics \ -i <input vectors directory> \ -d
<input dictionary file> \
What is the vectors directory supposed to be? Many thanks in advance.
Cheers!
Chris