Sorry, the raw exception from my nabble post didn't come through in the email
Exception in thread "main" java.io.UTFDataFormatException: malformed input around byte 5 at java.io.DataInputStream.readUTF(DataInputStream.java:617) at java.io.DataInputStream.readUTF(DataInputStream.java:547) at org.apache.mahout.classifier.sgd.PolymorphicWritable.read(PolymorphicWritable.java:41) at org.apache.mahout.classifier.sgd.ModelSerializer.readBinary(ModelSerializer.java:69) at com.memonews.mahout.sentiment.TestNewsGroups.run(TestNewsGroups.java:67) at com.memonews.mahout.sentiment.TestNewsGroups.main(TestNewsGroups.java:59) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at com.intellij.rt.execution.application.AppMain.main(AppMain.java:120 On Jan 29, 2013, at 5:14 PM, Suneel Marthi <[email protected]> wrote: > What's the exception u r seeing? > > > > > ________________________________ > From: vybe3142 <[email protected]> > To: [email protected] > Sent: Tuesday, January 29, 2013 5:26 PM > Subject: Classifying a single file after training the model > > 1. Index the training data that I've pre-classified manually . Then perform > training and testing. Everything works fine to this point > > > > This seems to work (looking at the confusion matrix even though these are > plan old text snippets as opposed to newsgroup text articles. > > 2. At this point, I want to classify individual files that are not part of > the training set. I've tried a bunch of things that don't seem to work. > For example, .. I try to invoke main() on TestNewsGroups.java > <http://svn.apache.org/repos/asf/mahout/trunk/examples/src/main/java/org/apache/mahout/classifier/sgd/TestNewsGroups.java> > > with the args > > > > and end up with an Exception > > > Any idea what I can do to fix this? Thanks > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Classifying-a-single-file-after-training-the-model-tp4037203.html > Sent from the Mahout User List mailing list archive at Nabble.com.
