Hello,

Sorry this is probably in the documentation somewhere, but I couldn't find it.

How to index and search accented words without accents?

For example: "Portégé" (a model for Toshiba laptops) would be indexed
as "portege"; and the search for "portégé" would be equivalent to the
search for "portege" and find either "Portégé", "Portegé", "portége",
"portege", etc.

This is how Google works; maybe Nutch do the same by default?

Currently, by default (0.7.1), "Portégé" is indexed as "portégé" and
found only if searched for "portégé" or "Portégé" (but not "portege").

This is all the most useful considering users in the US do not have
easy access to accented letters on their keywords...

Thanks,
Frank.

Reply via email to