[
https://issues.apache.org/jira/browse/OPENNLP-671?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vinh Khuc updated OPENNLP-671:
------------------------------
Description:
L1-regularization is useful during training Maximum Entropy models since it
pushes parameters of irrelevant features to zero. Hence, the parameter vector
will be sparse and the trained model will be compact.
When the number of features is much larger than the number of training
examples, L1 often gives better accuracy than L2.
The implementation of L1-regularization for L-BFGS will follow the method
described in the paper:
http://research.microsoft.com/en-us/um/people/jfgao/paper/icml07scalable.pdf
was:
L1-regularization is useful during training Maximum Entropy models since it
pushes parameters of irrelevant features to zero. Hence, the trained model will
be sparse and compact.
When the number of features is much larger than the number of training
examples, L1 often gives better accuracy than L2.
The implementation of L1-regularization for L-BFGS will follow the method
described in the paper:
http://research.microsoft.com/en-us/um/people/jfgao/paper/icml07scalable.pdf
> Add L1-regularization into L-BFGS
> ---------------------------------
>
> Key: OPENNLP-671
> URL: https://issues.apache.org/jira/browse/OPENNLP-671
> Project: OpenNLP
> Issue Type: Improvement
> Components: Machine Learning
> Reporter: Vinh Khuc
>
> L1-regularization is useful during training Maximum Entropy models since it
> pushes parameters of irrelevant features to zero. Hence, the parameter vector
> will be sparse and the trained model will be compact.
> When the number of features is much larger than the number of training
> examples, L1 often gives better accuracy than L2.
> The implementation of L1-regularization for L-BFGS will follow the method
> described in the paper:
> http://research.microsoft.com/en-us/um/people/jfgao/paper/icml07scalable.pdf
--
This message was sent by Atlassian JIRA
(v6.2#6252)