Hi all,

in OpenNLP we have a couple of components which rely on sequence tagging.
Right now they are using a normal classifier and search for a good sequence via beam search.

I would like to propose that we change that a bit, all components which are based on sequence tagging should use a Sequence Classification Model instead of directly using an
Event Classification Model (currently named MaxentModel).

The change will have two advantages, It will be possible to integrate ml algorithm which operate on a sequence level (e.g. CRF) and it would be easy to exchange beam search against a similar (maybe enhanced) algorithm.

On the training side we already have support for training on sequences. Anyway the current implementation is a bit unlucky because the sequence training class can only return an Event Classification Model. I will change that so that a Sequence Classification Model has to be returned, and the Perceptron Sequence Model will be returned as a
Sequence Classification Model instead.

Any thoughts?

Jörn

Reply via email to