[ 
https://issues.apache.org/jira/browse/SAMOA-58?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15193353#comment-15193353
 ] 

ASF GitHub Bot commented on SAMOA-58:
-------------------------------------

Github user edi-bice commented on a diff in the pull request:

    https://github.com/apache/incubator-samoa/pull/48#discussion_r56007266
  
    --- Diff: 
samoa-api/src/main/java/org/apache/samoa/evaluation/F1ClassificationPerformanceEvaluator.java
 ---
    @@ -0,0 +1,157 @@
    +package org.apache.samoa.evaluation;
    +
    +/*
    + * #%L
    + * SAMOA
    + * %%
    + * Copyright (C) 2014 - 2016 Apache Software Foundation
    + * %%
    + * Licensed under the Apache License, Version 2.0 (the "License");
    + * you may not use this file except in compliance with the License.
    + * You may obtain a copy of the License at
    + * 
    + *      http://www.apache.org/licenses/LICENSE-2.0
    + * 
    + * Unless required by applicable law or agreed to in writing, software
    + * distributed under the License is distributed on an "AS IS" BASIS,
    + * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
    + * See the License for the specific language governing permissions and
    + * limitations under the License.
    + * #L%
    + */
    +
    +
    +import org.apache.samoa.instances.Instance;
    +import org.apache.samoa.instances.Utils;
    +import org.apache.samoa.moa.AbstractMOAObject;
    +import org.apache.samoa.moa.core.Measurement;
    +
    +import java.util.Collections;
    +import java.util.List;
    +import java.util.Vector;
    +
    +/**
    + * Created by Edi Bice (edi.bice gmail com) on 2/22/2016.
    + */
    +public class F1ClassificationPerformanceEvaluator extends 
AbstractMOAObject implements
    --- End diff --
    
    I'm using Samoa for extremely unbalanced classification and needed 
precision, recall, F-measure along with supports. measures.F1 seemed to apply 
to clustering only (and I didn't know how to use in my scenario) though I can't 
see why we couldn't in theory use one F1 measure for both multiclass 
classification and clustering.


> Samoa AvroFileStream from HDFSFileStreamSource stops at end of first file
> -------------------------------------------------------------------------
>
>                 Key: SAMOA-58
>                 URL: https://issues.apache.org/jira/browse/SAMOA-58
>             Project: SAMOA
>          Issue Type: Bug
>          Components: SAMOA-Instances
>         Environment: RHEL 6.6, java 1.8.0_72
>            Reporter: Edi Bice
>
> It appears Samoa is capable of streaming a collection of files as a single 
> stream effectively concatenating the files. However using Samoa 
> AvroFileStream from HDFSFileStreamSource seems the stream stops at end of 
> first file:
> bin/samoa local target/SAMOA-Local-0.4.0-incubating-SNAPSHOT.jar 
> "PrequentialEvaluation -i -1 -l (classifiers.ensemble.Bagging -s 100) -s 
> (AvroFileStream -s HDFSFileStreamSource -f 
> /tmp/order_and_feats_flat_avro/2016_02_18/ -c 1 -e binary) -f 10000"
> 2016-02-18 20:43:20,991 [main] INFO  
> org.apache.samoa.evaluation.EvaluatorProcessor (EvaluatorProcessor.java:183) 
> - last event is received!
> 2016-02-18 20:43:20,991 [main] INFO  
> org.apache.samoa.evaluation.EvaluatorProcessor (EvaluatorProcessor.java:184) 
> - total count: 262144
> ...
> 2016-02-18 20:43:20,993 [main] INFO  
> org.apache.samoa.evaluation.EvaluatorProcessor (EvaluatorProcessor.java:191) 
> - total evaluation time: 34 seconds for 262144 instances
> bash-4.1$ hadoop fs -ls /tmp/order_and_feats_flat_avro/2016_02_18 | more
> Found 70 items
> -rw-r--r--   3 yarn hdfs  230855335 2016-02-18 16:01 
> /tmp/order_and_feats_flat_avro/2016_02_18/hdfs-1a238673-c4ec-4462-be67-78d573efa790-00001
> -rw-r--r--   3 yarn hdfs  229800273 2016-02-18 16:04 
> /tmp/order_and_feats_flat_avro/2016_02_18/hdfs-1a238673-c4ec-4462-be67-78d573efa790-00002
> ...



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to