On 7/6/11 4:38 PM, [email protected] wrote:
but it also consume less memory after loading. This LGPL dictionary library
uses a FSA data structure that requires less memory than Hashtable to store
500k words, and also is fast enough during runtime.

Yeah, it would be nice to have a better dictionary in OpenNLP, we also
discussed the usage of bloom-filters, which I believe might be good
enough for feature generation anyway in many cases.

Jörn

Reply via email to