[ 
https://issues.apache.org/jira/browse/OPENNLP-695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13999840#comment-13999840
 ] 

William Colen commented on OPENNLP-695:
---------------------------------------

I did a commit a few days ago: 
http://svn.apache.org/viewvc?view=revision&revision=r1594287

The idea was to create a Map<String, Object>, where users could set any kind of 
information. Of course it is not possible to create a standard feature 
generator, but it is a lot flexible.

Maybe we could implement a feature generator that would assume the Map contains 
the fields of a MS Word document?

> Add extra information to DocumentSample
> ---------------------------------------
>
>                 Key: OPENNLP-695
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-695
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Doccat
>    Affects Versions: 1.6.0
>            Reporter: William Colen
>            Assignee: William Colen
>            Priority: Minor
>             Fix For: 1.6.0
>
>
> Often a document has additional information fields, such as title, sender, 
> date, key words. We should add field to the DocumentSample where to store 
> this information, and change the API in such a way that users could implement 
> feature generators using this information.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to