[
https://issues.apache.org/jira/browse/OPENNLP-239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13078560#comment-13078560
]
James Kosin commented on OPENNLP-239:
-------------------------------------
Okay,
(1) I'm going to implement an attribute for the case sensitivity setting for
the entries being saved to the dictionary. I'm going to keep them separate for
now and not try to combine them into the dictionary object. The reason being;
I'd like to keep the created Dictionary object default the user's required
default setting for the comparison and not for the original dictionary elements
as they where added. You'll understand as things progress.
This means the new dictionary output will be like this
<code>
<dictionary>
<entry CASE="true">
<token>Patrick</token>
</entry>
</code>
(2) Case sensitivity will be dependant on how the dictionary is created;
however, if the user selects case insensitive comparison from the command line,
the false value will take precidence to allow for a case insensitive compare on
equals(). This means a dictionary created with the false default will not
compare case sensitve regardless of any true value specified by the user. So,
create the dictionary accordingly to the true case sensitivity that is required.
(3) I'm hoping the attributes in the XML will allow us to be backward
compatible; since they will not exist otherwise. I'm also hoping the case
attribute will be backward compatible as well in that the older inplementation
never used an attribute value system.
> Case Sensitivie Flag & Custom Tag Dictionary
> --------------------------------------------
>
> Key: OPENNLP-239
> URL: https://issues.apache.org/jira/browse/OPENNLP-239
> Project: OpenNLP
> Issue Type: New Feature
> Components: Parser
> Affects Versions: tools-1.5.1-incubating
> Reporter: mark meiklejohn
> Assignee: James Kosin
> Fix For: tools-1.5.2-incubating
>
>
> Unable to set case sensitive flag as per TreebankParser 1.3.1 or use a custom
> tag dictionary
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira