It's weird, because the error seems to be on the class attribute.
Maybe some error propagates down the line while parsing?

--
Gianmarco

On 26 April 2015 at 06:21, Albert Bifet <[email protected]> wrote:

> It seems that the problem is with the Dir attribute. Without this
> attribute it's working fine. It contains non standard characters. We
> should check how this attribute is parsed.
>
> On Fri, Apr 24, 2015 at 4:42 PM, Simon Dugas (JIRA) <[email protected]>
> wrote:
> > Simon Dugas created SAMOA-26:
> > --------------------------------
> >
> >              Summary: VHT throws NumberFormatException on class attribute
> >                  Key: SAMOA-26
> >                  URL: https://issues.apache.org/jira/browse/SAMOA-26
> >              Project: SAMOA
> >           Issue Type: Bug
> >           Components: SAMOA-Local
> >          Environment: MAC OSX 10.10.3
> > java version "1.7.0_71"
> > Java(TM) SE Runtime Environment (build 1.7.0_71-b14)
> > Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode)
> >             Reporter: Simon Dugas
> >
> >
> > I'm trying to debug the following error, PrequentialEvaluation with VHT
> (classification) throws a NumberFormatException for the class attribute.
> Why is it trying to parse the class attribute as an integer? I can't find a
> format error in my ARFF file. It was created with
> weka.core.converters.CSVLoader. Other datasets (nominal only) work fine
> with my install of SAMOA. This configuration runs fine in MOA.
> >
> > Command Line Argument
> >
> > bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar
> "PrequentialEvaluation -l
> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
> (ArffFileStream -f test.arff) -f 1"
> >
> > ARFF File
> >
> > @relation test.txt
> >
> > @attribute Dur numeric
> > @attribute Proto
> {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt}
> > @attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'}
> > @attribute State {CON,PA_PA,PA_FRA, ...}
> > @attribute sTos numeric
> > @attribute dTos numeric
> > @attribute TotPkts numeric
> > @attribute TotBytes numeric
> > @attribute SrcBytes numeric
> > @attribute class {Background,Normal,Botnet}
> >
> > @data
> >
> > 1065.731934,udp,' <->',CON,0,0,2,252,145,Background
> > 1471.787109,udp,' <->',CON,0,0,2,252,145,Background
> > ...
> >
> > Error Output
> >
> > Command line string = PrequentialEvaluation -l
> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
> (ArffFileStream -f test.arff) -f 1
> > 2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask
> (LocalDoTask.java:80) - Successfully instantiating
> com.yahoo.labs.samoa.tasks.PrequentialEvaluation
> > Exception in thread "main" java.lang.NumberFormatException: For input
> string: "Background"
> > at
> sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250)
> > at java.lang.Double.valueOf(Double.java:504)
> > at
> com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105)
> > at
> com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77)
> > at
> com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118)
> > at
> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46)
> > at
> com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70)
> > at
> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197)
> > at
> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170)
> > at
> com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161)
> > at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87)
> >
> >
> >
> > --
> > This message was sent by Atlassian JIRA
> > (v6.3.4#6332)
>

Reply via email to