On 28/09/2011 12:32, Jörn Kottmann wrote:
We once discussed to implement bloom filter based dictionaries, maybe
that would also be an option for you.
Interesting, can you give me more insights about it?
Well, the current implementation cannot really be sub-classed. If you
need to replace it the way to go should be to implement
the TagDictionary interface yourself.
Of course, but I still have a problem. The POSTaggerME.train method
takes as a parameter a POSDictionary object, not a TagDictionary. What
should I do?
A List<String> or a String[] will not work as keys for a Map. We have
a StringList
object for this case. It contains a sequence of String objects.
Can you explain the structure a little, I cannot follow here.
Is the key a sequence of tokens?
Yes, I mean a sequence of tokens w.r.t. what you said in a previous
message. There could be the need to assign a tag to a certain sequence
of tokens.
Riccardo