Hi,
I've been trying to convert a simple arff file and I'm getting the following
error:
-bash-3.1$ java -cp
mahout-core-0.3-SNAPSHOT.jar:mahout-utils-0.3-SNAPSHOT.jar:$(echo
dependency/*.jar . | sed 's/ /:/g') org.apache.mahout.utils.vectors.arff.Driver
-d vehicle.arff -o iris -t iris/dict.txt
Jan 19, 2010 8:58:36 PM org.slf4j.impl.JCLLoggerAdapter info
INFO: Output Dir: iris
Jan 19, 2010 8:58:36 PM org.slf4j.impl.JCLLoggerAdapter info
INFO: Converting File: vehicle.arff
outfile: iris/vehicle.arff.mvc
Exception in thread "main" java.lang.NullPointerException
at
org.apache.hadoop.io.serializer.SerializationFactory.getSerializer(SerializationFactory.java:73)
at org.apache.hadoop.io.SequenceFile$Writer.init(SequenceFile.java:910)
at
org.apache.hadoop.io.SequenceFile$RecordCompressWriter.<init>(SequenceFile.java:1074)
at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:397)
at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:284)
at org.apache.hadoop.io.SequenceFile.createWriter(SequenceFile.java:265)
at
org.apache.mahout.utils.vectors.arff.Driver.getSeqFileWriter(Driver.java:180)
at org.apache.mahout.utils.vectors.arff.Driver.writeFile(Driver.java:167)
at org.apache.mahout.utils.vectors.arff.Driver.main(Driver.java:132)
My data is:
@relation iris
@attribute f1 numeric
@attribute f2 numeric
@attribute f3 numeric
@attribute f4 numeric
@data
5.1,3.5,1.4,0.2
4.9,3.0,1.4,0.2
4.7,3.2,1.3,0.2
4.6,3.1,1.5,0.2
5.0,3.6,1.4,0.2
5.4,3.9,1.7,0.4
4.6,3.4,1.4,0.3
5.0,3.4,1.5,0.2
4.4,2.9,1.4,0.2
4.9,3.1,1.5,0.1
Any guidance? Thanks.
- jerry