Re: How to handle big dictionaries to find typos

Catalin Mititelu Sun, 13 Sep 2015 12:09:07 -0700

Hi Damiano,

You may try Lucene fuzzy query which is based on Levenstein distance.


BR,
Catalin

On 09/13/2015 09:59 PM, Damiano Porta wrote:

Hello,

I have created a very big dictionary of companies, it is around 3M.
At the moment i am using DictionaryNameFinder class, but I need to
implement something to find typos like Gogle/Gooogle Inc etc.
I read something about leveinstain distance, is this implementend in
OpenNLP?
It seems good but i read it takes a lot of times if the words are many (my
case).

What should i do?
Thanks!
Damiano

Re: How to handle big dictionaries to find typos

Reply via email to