[
https://issues.apache.org/jira/browse/OPENNLP-17?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13029305#comment-13029305
]
Jason Baldridge commented on OPENNLP-17:
----------------------------------------
How about having an object that specifies such a configuration and then using
JSON to store and recover it? This is similar to the serialization suggestion,
but allows text files to be shared, and it pretty straightforward.
I've only read through this quickly, so sorry if this suggestion is off the
mark, or is pretty much the same as the XML suggestion (w/ JSON instead).
> Add support for custom feature generator configuration embedded in the model
> package
> ------------------------------------------------------------------------------------
>
> Key: OPENNLP-17
> URL: https://issues.apache.org/jira/browse/OPENNLP-17
> Project: OpenNLP
> Issue Type: Improvement
> Components: Chunker, Name Finder, POS Tagger
> Affects Versions: tools-1.5.0-sourceforge
> Reporter: Jörn Kottmann
> Attachments: spring-featuregen.xml
>
>
> Add support for custom feature generator configuration embedded in the model
> package.
> The configuration of the feature generators for the name finder component can
> be quite complex and the configuration must
> be always done twice once for training and once for tagging. Doing it twice
> at two different points in time makes
> the feature generation very error prone. Small mistakes lead to a drop in
> detection performance which might
> be difficult to notice.
> To solve this issue add the configuration to the model, then it must only be
> specified during training and
> can be loaded from the model during tagging.
> Another advantage is that custom feature generation is difficult to use
> otherwise, because the integration
> code must deal itself with setting up the feature generators. In some cases
> the user even does not have control
> over the code, or does not want to change it, e.g. in the UIMA wrappers.
> The same logic should be used for the POS Tagger and Chunker.
> The issues is migrated from SourceForge:
> https://sourceforge.net/tracker/?func=detail&aid=1941380&group_id=3368&atid=353368
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira