[
https://issues.apache.org/jira/browse/OPENNLP-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13069349#comment-13069349
]
James Kosin commented on OPENNLP-236:
-------------------------------------
William,
Also, with the census data, it may end up being passed as multiple input files
to create the dictionary. Jorn at the time said it wasn't much to have the
createDictionary() function in my class when I wanted to be able to pass the
object stream directly to the Dictionary class. And it allows some finer
details to the CensusDictionaryCreator.
I think for the time being we can leave them separate and re-factor later if
really needed later. The object stream class is suppose to validate the input
also before passing back the results; which is really nice. We have actually
used it quite well for the converters; but, it is flexible to use on simple
standard input from files for things like sentences and such for training.
James
> Create a command line tool to create dictionaries
> -------------------------------------------------
>
> Key: OPENNLP-236
> URL: https://issues.apache.org/jira/browse/OPENNLP-236
> Project: OpenNLP
> Issue Type: Improvement
> Components: Command Line Interface
> Affects Versions: tools-1.5.2-incubating
> Reporter: William Colen
> Assignee: William Colen
> Priority: Minor
> Fix For: tools-1.5.2-incubating
>
>
> Should create a command line tool to create dictionaries. The input should be
> a plain text and the output a serialized Dictionary.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira