Yes, the problem is in the parsing of the Dir attribute that
propagates down to the other attributes.

Cheers, Albert

On Mon, Apr 27, 2015 at 3:31 PM, Gianmarco De Francisci Morales
<[email protected]> wrote:
> It's weird, because the error seems to be on the class attribute.
> Maybe some error propagates down the line while parsing?
>
> --
> Gianmarco
>
> On 26 April 2015 at 06:21, Albert Bifet <[email protected]> wrote:
>
>> It seems that the problem is with the Dir attribute. Without this
>> attribute it's working fine. It contains non standard characters. We
>> should check how this attribute is parsed.
>>
>> On Fri, Apr 24, 2015 at 4:42 PM, Simon Dugas (JIRA) <[email protected]>
>> wrote:
>> > Simon Dugas created SAMOA-26:
>> > --------------------------------
>> >
>> >              Summary: VHT throws NumberFormatException on class attribute
>> >                  Key: SAMOA-26
>> >                  URL: https://issues.apache.org/jira/browse/SAMOA-26
>> >              Project: SAMOA
>> >           Issue Type: Bug
>> >           Components: SAMOA-Local
>> >          Environment: MAC OSX 10.10.3
>> > java version "1.7.0_71"
>> > Java(TM) SE Runtime Environment (build 1.7.0_71-b14)
>> > Java HotSpot(TM) 64-Bit Server VM (build 24.71-b01, mixed mode)
>> >             Reporter: Simon Dugas
>> >
>> >
>> > I'm trying to debug the following error, PrequentialEvaluation with VHT
>> (classification) throws a NumberFormatException for the class attribute.
>> Why is it trying to parse the class attribute as an integer? I can't find a
>> format error in my ARFF file. It was created with
>> weka.core.converters.CSVLoader. Other datasets (nominal only) work fine
>> with my install of SAMOA. This configuration runs fine in MOA.
>> >
>> > Command Line Argument
>> >
>> > bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar
>> "PrequentialEvaluation -l
>> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
>> (ArffFileStream -f test.arff) -f 1"
>> >
>> > ARFF File
>> >
>> > @relation test.txt
>> >
>> > @attribute Dur numeric
>> > @attribute Proto
>> {udp,tcp,icmp,arp,ipx/spx,ipv6-icmp,pim,esp,igmp,rtcp,rtp,ipv6,udt}
>> > @attribute Dir {' <->',' <?>',' ->',' ?>',' who',' <-',' <?'}
>> > @attribute State {CON,PA_PA,PA_FRA, ...}
>> > @attribute sTos numeric
>> > @attribute dTos numeric
>> > @attribute TotPkts numeric
>> > @attribute TotBytes numeric
>> > @attribute SrcBytes numeric
>> > @attribute class {Background,Normal,Botnet}
>> >
>> > @data
>> >
>> > 1065.731934,udp,' <->',CON,0,0,2,252,145,Background
>> > 1471.787109,udp,' <->',CON,0,0,2,252,145,Background
>> > ...
>> >
>> > Error Output
>> >
>> > Command line string = PrequentialEvaluation -l
>> com.yahoo.labs.samoa.learners.classifiers.trees.VerticalHoeffdingTree -s
>> (ArffFileStream -f test.arff) -f 1
>> > 2015-04-23 12:05:45,277 [main] INFO com.yahoo.labs.samoa.LocalDoTask
>> (LocalDoTask.java:80) - Successfully instantiating
>> com.yahoo.labs.samoa.tasks.PrequentialEvaluation
>> > Exception in thread "main" java.lang.NumberFormatException: For input
>> string: "Background"
>> > at
>> sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1250)
>> > at java.lang.Double.valueOf(Double.java:504)
>> > at
>> com.yahoo.labs.samoa.instances.ArffLoader.readInstanceDense(ArffLoader.java:105)
>> > at
>> com.yahoo.labs.samoa.instances.ArffLoader.readInstance(ArffLoader.java:77)
>> > at
>> com.yahoo.labs.samoa.instances.Instances.readInstance(Instances.java:182)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.getNextInstanceFromFile(ArffFileStream.java:183)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.readNextInstanceFromFile(ArffFileStream.java:145)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:118)
>> > at
>> com.yahoo.labs.samoa.moa.streams.ArffFileStream.nextInstance(ArffFileStream.java:46)
>> > at
>> com.yahoo.labs.samoa.streams.StreamSource.nextInstance(StreamSource.java:70)
>> > at
>> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.initStreamSource(PrequentialSourceProcessor.java:197)
>> > at
>> com.yahoo.labs.samoa.streams.PrequentialSourceProcessor.getDataset(PrequentialSourceProcessor.java:170)
>> > at
>> com.yahoo.labs.samoa.tasks.PrequentialEvaluation.init(PrequentialEvaluation.java:161)
>> > at com.yahoo.labs.samoa.LocalDoTask.main(LocalDoTask.java:87)
>> >
>> >
>> >
>> > --
>> > This message was sent by Atlassian JIRA
>> > (v6.3.4#6332)
>>

Reply via email to