What's your setting for charset_table in config/sphinx.yml? If it was just to keep commas, I would have thought the following would do the trick:
charset_table: U+2C Although, you'd probably want to keep the defaults for utf-8 as well: charset_table: 0..9, A..Z->a..z, _, a..z, U+410..U+42F->U+430..U+44F, U +430..U+44F, U+2C Or is that what you've tried, and it's not working? -- Pat On 19/10/2008, at 4:49 AM, sant0sk1 wrote: > > I just gave this a go and seems to not have given predicted results: > > Now any search I perform that doesn't include a comma returns 0 > matches and any search I perform that includes a comma resturns > exactly 1956 results. Every time. I posted a similar question over on > the Sphinx forum awhile back and got the following reply: > > Use phrase_boundary and phrase_boundary_step features to make > commas increase the word > position, and then perform a proximity search. > > Eg. set phrase_boundary=, and phrase_boundary_step=100 and then > search for "dear love"~50 > > I tried that as well (by setting those in sphinx.yml) but every search > I run that ends in "~50 returns 0 matches. > > Any other ideas? > > > > On Oct 8, 8:13 pm, James Healy <[EMAIL PROTECTED]> wrote: >> sant0sk1 wrote: >>> All three of these searches returned a match on a record with "dear, >>> love" in the text field. Any other ideas how to get Sphinx to take >>> notice of the commas in my indexed fields and not return matches on >>> them? >> >> I suspect the Sphinx treats a comma as whitespace when indexing. To >> change the default behaviour, you'll want to specify charset_table in >> your sphinx.yml file. >> >> I have a blog post on tweaking >> charset_table:http://yob.id.au/blog/2008/05/08/thinking_sphinx_and_unicode/ >> . It >> doesn't explicitly mention commas, but you can just add a comma to >> the >> list, then rebuild your indexes. >> >> Anything that isn't specified in the charset_table list is ignored >> when >> sphinx builds its indexes. At the very least, you'll probably want >> something like: >> >> charset_table: "0..9, a..z, _, A..Z->a..z, U+002C" >> >> -- James Healy <jimmy-at-deefa-dot-com> Thu, 09 Oct 2008 12:06:57 >> +1100 > > --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "Thinking Sphinx" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [EMAIL PROTECTED] For more options, visit this group at http://groups.google.com/group/thinking-sphinx?hl=en -~----------~----~----~----~------~----~------~--~---
