Dan,
I'm guessing when you say tokenized you mean with POS values. If so, a
better approach would be to use the JWNL library to look up the
dictionary terms. We use this with our coref component and isn't hard
to get working. The biggest thing with POS is selecting the right one.
It may be better to build a model for the POS tokenizer than to build a
dictionary for this. Unless you are meaning for a different language.
I guess I need more information from you on what you are trying to
accomplish?
James
On 3/8/2013 6:05 PM, Daniel Franc wrote:
Hello friends,
I am at a novice level for both OpenNLP and Java and have been fumbling
around to put together a working version of the software with some success
thanks to the documentation provided! My eventual goal is partially to
look up terms within a pre-defined dictionary, and I've been able to use
the dictionary creator to create a basic dictionary to lookup from as here:
dictionary.serialize(new FileOutputStream(
"/Applications/apache-opennlp-1.5.2-incubating/dictionarynames.txt"));
My particular questions are:
1. Can someone help me with loading this dictionary after it was previously
created?
2. Is there a straightforward was to implement a basic lookup mechanism for
tokenized text?
Thanks for your help!
-Dan