2009/1/29 Olof Sjobergh <olo...@gmail.com>

>
> I think most problems could be solved by using a dictionary format
> similar to what you describe above, i.e. something like:
>
> match : candidate1 candidate2; frequency
> for example:
> vogel : Vogel Vögel; 123
>
> That would mean you can search on the normalised word where simple
> strcmp works fine and will be fast enough.


This dictionary would have hundreds of millions of rows even if you take
only reasonable user inputs. But what to do if the users inputs something
that's not in the dictionary? Of course I'm assuming you want to correct
typos, as it's doing now.

vogel: Vogel, Vögel
vigel: Vogel, Vögel
vpgel: Vogel, Vögel
wogel: Vogel, Vögel
wigel: Vogel, Vögel
vigem: Vogel, Vögel
vigwl: Vogel, Vögel
...
...
_______________________________________________
Openmoko community mailing list
community@lists.openmoko.org
http://lists.openmoko.org/mailman/listinfo/community

Reply via email to