It's weird, because the error seems to be on the class attribute. Maybe some error propagates down the line while parsing?
-- Gianmarco On 26 April 2015 at 06:21, Albert Bifet <[email protected]> wrote: > It seems that the problem is with the Dir attribute. Without this > attribute it's working fine. It contains non standard characters. We > should check how this attribute is parsed. > > On Fri, Apr 24, 2015 at 4:42 PM, Simon Dugas (JIRA) <[email protected]> > wrote: > > Simon Dugas created SAMOA-26: > > -------------------------------- > > > > Summary: VHT throws NumberFormatException on class attribute > > Key: SAMOA-26 > > URL: https://issues.apache.org/jira/browse/SAMOA-26 > > Project: SAMOA > > Issue Type: Bug > > Components: SAMOA-Local > > Environment: MAC OSX 10.10.3 > > java version "1.7.0_71" > > Java(TM) SE Runtime Environment (build 1.7.0_71-b14) > > Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode) > > Reporter: Simon Dugas > > > > > > I'm trying to debug the following error, PrequentialEvaluation with VHT > (classification) throws a NumberFormatException for the class attribute. > Why is it trying to parse the class attribute as an integer? I can't find a > format error in my ARFF file. It was created with > weka.core.converters.CSVLoader. Other datasets (nominal only) work fine > with my install of SAMOA. This configuration runs fine in MOA. > > > > Command Line Argument > > > > bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar > "PrequentialEvaluation -l > com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s > (ArffFileStream -f test.arff) -f 1" > > > > ARFF File > > > > @relation test.txt > > > > @attribute Dur numeric > > @attribute Proto > {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt} > > @attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'} > > @attribute State {CON,PA_PA,PA_FRA, ...} > > @attribute sTos numeric > > @attribute dTos numeric > > @attribute TotPkts numeric > > @attribute TotBytes numeric > > @attribute SrcBytes numeric > > @attribute class {Background,Normal,Botnet} > > > > @data > > > > 1065.731934,udp,' <->',CON,0,0,2,252,145,Background > > 1471.787109,udp,' <->',CON,0,0,2,252,145,Background > > ... > > > > Error Output > > > > Command line string = PrequentialEvaluation -l > com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s > (ArffFileStream -f test.arff) -f 1 > > 2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask > (LocalDoTask.java:80) - Successfully instantiating > com.yahoo.labs.samoa.tasks.PrequentialEvaluation > > Exception in thread "main" java.lang.NumberFormatException: For input > string: "Background" > > at > sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250) > > at java.lang.Double.valueOf(Double.java:504) > > at > com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105) > > at > com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77) > > at > com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182) > > at > com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183) > > at > com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145) > > at > com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118) > > at > com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46) > > at > com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70) > > at > com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197) > > at > com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170) > > at > com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161) > > at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87) > > > > > > > > -- > > This message was sent by Atlassian JIRA > > (v6.3.4#6332) >
