Hi Garrett, Garrett Dimon wrote: > I've setup my sphinx.yml to include all of the possible values for the > charset_table so that accented characters will be indexed as their un- > accented counterparts. My sphinx.conf is being generated successfully > and appears to have the correct value for the charset_table. However, > when searching, it appears that the accented characters are being > ignored as if they were not explicitly added to the charset_table > setting. > > As a result, words with accented characters are getting broken into > smaller words where the accented characters are being treated as if > they are whitespace. I can search for subsets of the word that only > include unaccented characters, but searching for the whole word or a > section that includes the accented characters yields no results. > > Is there some additional configuration that I need to do in order to > get the charset_table preferences to work successfully, or is there > something else that I'm overlooking or missing?
You also need to make sure your database and indexes are set to utf-8. Thinking Sphinx should your indexes to utf-8 by default, but double check that "charset_type" in your sphinx config file is set correctly. It's probably also worth stopping the sphinx daemon, running "ts:config", reindexing, and restarting the daemon, just to make sure your charset_table is being used. -- James Healy <jimmy-at-deefa-dot-com> Tue, 28 Apr 2009 15:07:10 +1000 --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Thinking Sphinx" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/thinking-sphinx?hl=en -~----------~----~----~----~------~----~------~--~---
