The charset_table in sphinx.yml is set to what James suggested: charset_table: "0..9, a..z, _, A..Z->a..z, U+002C"
Quick question: When I create settings in sphinx.yml and reindex, do these get put into the generated 'development.sphinx.conf' file? Over on the Sphinx forum I have been asking about this and they say adding comma to the charset_table will have unfortunate side effects. Their suggestion is to perform a proximity search by setting: phrase_boundary=, phrase_boundary_step=100 And then search for my phrase like so: *S**trong.search "\"dear love\"~50", :match_mode => :extended2* Unfortunately, that hasn't provided results either :( On Sat, Oct 18, 2008 at 11:45 PM, Pat Allan <[EMAIL PROTECTED]>wrote: > > What's your setting for charset_table in config/sphinx.yml? > If it was just to keep commas, I would have thought the following > would do the trick: > > charset_table: U+2C > > Although, you'd probably want to keep the defaults for utf-8 as well: > > charset_table: 0..9, A..Z->a..z, _, a..z, U+410..U+42F->U+430..U+44F, U > +430..U+44F, U+2C > > Or is that what you've tried, and it's not working? > > -- > Pat > > On 19/10/2008, at 4:49 AM, sant0sk1 wrote: > > > > > I just gave this a go and seems to not have given predicted results: > > > > Now any search I perform that doesn't include a comma returns 0 > > matches and any search I perform that includes a comma resturns > > exactly 1956 results. Every time. I posted a similar question over on > > the Sphinx forum awhile back and got the following reply: > > > > Use phrase_boundary and phrase_boundary_step features to make > > commas increase the word > > position, and then perform a proximity search. > > > > Eg. set phrase_boundary=, and phrase_boundary_step=100 and then > > search for "dear love"~50 > > > > I tried that as well (by setting those in sphinx.yml) but every search > > I run that ends in "~50 returns 0 matches. > > > > Any other ideas? > > > > > > > > On Oct 8, 8:13 pm, James Healy <[EMAIL PROTECTED]> wrote: > >> sant0sk1 wrote: > >>> All three of these searches returned a match on a record with "dear, > >>> love" in the text field. Any other ideas how to get Sphinx to take > >>> notice of the commas in my indexed fields and not return matches on > >>> them? > >> > >> I suspect the Sphinx treats a comma as whitespace when indexing. To > >> change the default behaviour, you'll want to specify charset_table in > >> your sphinx.yml file. > >> > >> I have a blog post on tweaking charset_table: > http://yob.id.au/blog/2008/05/08/thinking_sphinx_and_unicode/ > >> . It > >> doesn't explicitly mention commas, but you can just add a comma to > >> the > >> list, then rebuild your indexes. > >> > >> Anything that isn't specified in the charset_table list is ignored > >> when > >> sphinx builds its indexes. At the very least, you'll probably want > >> something like: > >> > >> charset_table: "0..9, a..z, _, A..Z->a..z, U+002C" > >> > >> -- James Healy <jimmy-at-deefa-dot-com> Thu, 09 Oct 2008 12:06:57 > >> +1100 > > > > > > > > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Thinking Sphinx" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/thinking-sphinx?hl=en -~----------~----~----~----~------~----~------~--~---
