Re: [Zope] searching with foreign characters in the ZCatalog

2007-10-11 Thread Dieter Maurer
Wohnlich, Eric (IMS) wrote at 2007-10-10 15:42 -0400:
As far as I can tell, using any of the default index types that come with Zope 
2 (FieldIndex, TextIndex, whatever) if you do a search for Jurgen it will 
not match Jürgen.  In my opinion, this is correct - Jurgen is spelled 
incorrectly.

There are two ways to handle this:

  *  either by normalization: you transform your words into a normal
 form and index this.
 You make the same normalization for search terms.
 This way, any two search terms with the same normalization
 are equivalent.

 In your case, your normalization could replace ü by u.

  *  by an expansion of your search terms to search as well
 for similar words.

 This technique is often used to search for words that sound
 similarly but can be used for other purposes as well.

 Applied to your case: when someone searches for Jurgen
 you would effectively replace it with Jurgen or Jürgen.



-- 
Dieter
___
Zope maillist  -  Zope@zope.org
http://mail.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **
(Related lists - 
 http://mail.zope.org/mailman/listinfo/zope-announce
 http://mail.zope.org/mailman/listinfo/zope-dev )


Re: [Zope] searching with foreign characters in the ZCatalog

2007-10-10 Thread Andreas Jung



--On 10. Oktober 2007 15:42:05 -0400 Wohnlich, Eric (IMS) 
[EMAIL PROTECTED] wrote:



As far as I can tell, using any of the default index types that come with
Zope 2 (FieldIndex, TextIndex, whatever) if you do a search for Jurgen
it will not match Jürgen.  In my opinion, this is correct - Jurgen
is spelled incorrectly.  However, I am having some clients push hard to
have it behave differently.  From what I've looked at, I haven't seen a
way to make FieldIndex ignore or somehow manipulate special characters,
nor have I seen any products that provide a new index type to
specifically deal with this.  Does anyone know if there is something out
there to help with this?


Look at TextIndexNG3 and its normalization support.

-aj

pgpyY1inWuI6P.pgp
Description: PGP signature
___
Zope maillist  -  Zope@zope.org
http://mail.zope.org/mailman/listinfo/zope
**   No cross posts or HTML encoding!  **
(Related lists - 
 http://mail.zope.org/mailman/listinfo/zope-announce
 http://mail.zope.org/mailman/listinfo/zope-dev )