Yes, the problem is in the parsing of the Dir attribute that propagates down to the other attributes.
Cheers, Albert On Mon, Apr 27, 2015 at 3:31 PM, Gianmarco De Francisci Morales <[email protected]> wrote: > It's weird, because the error seems to be on the class attribute. > Maybe some error propagates down the line while parsing? > > -- > Gianmarco > > On 26 April 2015 at 06:21, Albert Bifet <[email protected]> wrote: > >> It seems that the problem is with the Dir attribute. Without this >> attribute it's working fine. It contains non standard characters. We >> should check how this attribute is parsed. >> >> On Fri, Apr 24, 2015 at 4:42 PM, Simon Dugas (JIRA) <[email protected]> >> wrote: >> > Simon Dugas created SAMOA-26: >> > -------------------------------- >> > >> > Summary: VHT throws NumberFormatException on class attribute >> > Key: SAMOA-26 >> > URL: https://issues.apache.org/jira/browse/SAMOA-26 >> > Project: SAMOA >> > Issue Type: Bug >> > Components: SAMOA-Local >> > Environment: MAC OSX 10.10.3 >> > java version "1.7.0_71" >> > Java(TM) SE Runtime Environment (build 1.7.0_71-b14) >> > Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode) >> > Reporter: Simon Dugas >> > >> > >> > I'm trying to debug the following error, PrequentialEvaluation with VHT >> (classification) throws a NumberFormatException for the class attribute. >> Why is it trying to parse the class attribute as an integer? I can't find a >> format error in my ARFF file. It was created with >> weka.core.converters.CSVLoader. Other datasets (nominal only) work fine >> with my install of SAMOA. This configuration runs fine in MOA. >> > >> > Command Line Argument >> > >> > bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar >> "PrequentialEvaluation -l >> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s >> (ArffFileStream -f test.arff) -f 1" >> > >> > ARFF File >> > >> > @relation test.txt >> > >> > @attribute Dur numeric >> > @attribute Proto >> {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt} >> > @attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'} >> > @attribute State {CON,PA_PA,PA_FRA, ...} >> > @attribute sTos numeric >> > @attribute dTos numeric >> > @attribute TotPkts numeric >> > @attribute TotBytes numeric >> > @attribute SrcBytes numeric >> > @attribute class {Background,Normal,Botnet} >> > >> > @data >> > >> > 1065.731934,udp,' <->',CON,0,0,2,252,145,Background >> > 1471.787109,udp,' <->',CON,0,0,2,252,145,Background >> > ... >> > >> > Error Output >> > >> > Command line string = PrequentialEvaluation -l >> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s >> (ArffFileStream -f test.arff) -f 1 >> > 2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask >> (LocalDoTask.java:80) - Successfully instantiating >> com.yahoo.labs.samoa.tasks.PrequentialEvaluation >> > Exception in thread "main" java.lang.NumberFormatException: For input >> string: "Background" >> > at >> sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250) >> > at java.lang.Double.valueOf(Double.java:504) >> > at >> com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105) >> > at >> com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77) >> > at >> com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182) >> > at >> com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183) >> > at >> com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145) >> > at >> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118) >> > at >> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46) >> > at >> com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70) >> > at >> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197) >> > at >> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170) >> > at >> com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161) >> > at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87) >> > >> > >> > >> > -- >> > This message was sent by Atlassian JIRA >> > (v6.3.4#6332) >>
