[ 
https://issues.apache.org/jira/browse/SAMOA-68?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16057590#comment-16057590
 ] 

ASF GitHub Bot commented on SAMOA-68:
-------------------------------------

Github user nicolas-kourtellis commented on the issue:

    https://github.com/apache/incubator-samoa/pull/61
  
    Thank you @mgrzenda for the contribution! Very nice!
    I tested it with bagging and VHT, covtypeNorm and random-tree-generator.
    Quick feedback:
    - Can you please remove the copyright years etc., as per the other PRs?
    - Can you add some comments/help on what each of the new parameters mean?
    - I think the new parameter on the frequency is not included in the defined 
templates. Can you check and add it?
    - I think the printout of the last prediction is not printed in file. Not 
such a big deal but please check if there is something you can do to fix it (or 
if it's something bigger).
    - Interestingly, the higher the frequency of predicted labels printed 
(controlled by -h), the higher the overhead on the execution, and the longer it 
takes to finish the examples I tried. However, practically when outputing every 
say 10 instances, the impact is minimal. Any way we can make it impact less the 
execution time? Or is this due to I/O bottleneck to disk?


> Saving true and predicted labels to file
> ----------------------------------------
>
>                 Key: SAMOA-68
>                 URL: https://issues.apache.org/jira/browse/SAMOA-68
>             Project: SAMOA
>          Issue Type: New Feature
>          Components: SAMOA-API
>            Reporter: Maciej Grzenda
>              Labels: features
>
> Currently PrequentialEvaluation task supports dumpFile option.  With this 
> option model performance can be saved to a file. However, in some cases it 
> would be good to save also individual predictions made by a model.  This is 
> useful for model debugging and method development.
> This could be also used to visualize model output, calculate custom 
> performance indicators (e.g. model accuracy for instances of a certain class 
> or sharing the same feature value).  Such saving of model output (if done) 
> should be made for every instance. Hence, a new option making it possible to 
> dump predictions to a separate file seems justified.  For classification, it 
> should include votes made for individual classes, if available.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to