Thanks for the reply. After using 6 C L as the token, now I am getting a different error "wrong number of attributes in the string". I have tried using a csv file or a tab deliminated file and got the same error. I also converted a csv file into an ARFF file but got the same error message. I created my files on a windows machine and copied them over. Could that be a problem?
________________________________ From: deneche abdelhakim <[email protected]> To: [email protected]; Lingxiang Cheng <[email protected]> Sent: Sunday, January 1, 2012 12:17 AM Subject: Re: Bad Token put spaces between the letters : 6 C L On Sun, Jan 1, 2012 at 5:16 AM, Lingxiang Cheng <[email protected]>wrote: > Hi, > I prepared my own data and tried to run it through Decision Forest but I > got stuck before I could run > the actually algorithm. I followed the steps listed at: > https://cwiki.apache.org/MAHOUT/partial-implementation.html. > When I typed in the following command (well not exactly the same because > my paths and filenames are different) I > got a "Bad Token" error. In my data I have 6 categories seperated by comma > and the last colume is the lable so I > put in 6CL as my token. Is it not correct? > > $HADOOP_HOME/bin/hadoop jar > $MAHOUT_HOME/core/target/mahout-core-<VERSION>-job.jar > org.apache.mahout.df.tools.Describe -p testdata/KDDTrain+.arff -f > testdata/KDDTrain+.info > -d 6CL > > Thanks and Happy New Year! > Lingxiang
