[ 
https://issues.apache.org/jira/browse/OPENNLP-338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13438538#comment-13438538
 ] 

Joern Kottmann commented on OPENNLP-338:
----------------------------------------

The best F-Measure on the conll data i could get was 0.05.
Here are the details:
Precision: 0.38461538461538464
Recall: 0.02844950213371266
F-Measure: 0.05298013245033113

In the QNModel the case that a feature does not has a param is much more 
frequent in the name finder than in the PPA data set I think. Another 
difference is that it uses beam search to look for a likely sequence. Maybe the 
SMOOTHING_VALUE needs to be adjusted. I changed it to 1 and used a cutoff of 5 
instead for the test results above.
Is the smoothing needed? Should it be disabled?

Do we need to initialize the results array in QNModel.eval with some prior 
values, like we in the GISModel?

Should the CONVERGE_TOLERANCE in QNTrainer be lower?
                
> Add L-BFGS parameter estimation training to maxent
> --------------------------------------------------
>
>                 Key: OPENNLP-338
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-338
>             Project: OpenNLP
>          Issue Type: New Feature
>          Components: Maxent
>            Reporter: Joern Kottmann
>            Assignee: Hyosup Shim
>             Fix For: tools-1.5.3
>
>         Attachments: nl-per.testa, nl-per.testb, nl-per.train, 
> patch20120814-lbfgs.txt, patch20120821-lbfgs, patch-lbfgs.txt
>
>
> Add support for the L-BFGS algorithm to train a maxent classifier.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to