Define low level Classifier API which only works on ordered int features
------------------------------------------------------------------------

                 Key: OPENNLP-116
                 URL: https://issues.apache.org/jira/browse/OPENNLP-116
             Project: OpenNLP
          Issue Type: Improvement
          Components: Maxent
            Reporter: Jörn Kottmann


The maxent/perceptron code currently performs a mapping from String features to 
low level int features. Most of the code is clearly separated between these two 
different features, but the separation is not complete. There should be a 
clearly separated API for 
dealing with high level features and low level features. The API should also 
contain support to map high level features to low level features.

Goal of the separation is to allow also non-string features to be mapped to the 
low level int features, non string features could be hash int features, or hash 
long features. Or a different representation of a string e.g. UTF-8 bytes.

In previous discussions it turned out that having both levels of API are 
valuable.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira


Reply via email to