What's your setting for charset_table in config/sphinx.yml?
If it was just to keep commas, I would have thought the following  
would do the trick:

charset_table: U+2C

Although, you'd probably want to keep the defaults for utf-8 as well:

charset_table: 0..9, A..Z->a..z, _, a..z, U+410..U+42F->U+430..U+44F, U 
+430..U+44F, U+2C

Or is that what you've tried, and it's not working?

-- 
Pat

On 19/10/2008, at 4:49 AM, sant0sk1 wrote:

>
> I just gave this a go and seems to not have given predicted results:
>
> Now any search I perform that doesn't include a comma returns 0
> matches and any search I perform that includes a comma resturns
> exactly 1956 results. Every time. I posted a similar question over on
> the Sphinx forum awhile back and got the following reply:
>
>    Use phrase_boundary and phrase_boundary_step features to make
> commas increase the word
>    position, and then perform a proximity search.
>
>    Eg. set phrase_boundary=, and phrase_boundary_step=100 and then
> search for "dear love"~50
>
> I tried that as well (by setting those in sphinx.yml) but every search
> I run that ends in "~50 returns 0 matches.
>
> Any other ideas?
>
>
>
> On Oct 8, 8:13 pm, James Healy <[EMAIL PROTECTED]> wrote:
>> sant0sk1 wrote:
>>> All three of these searches returned a match on a record with "dear,
>>> love" in the text field. Any other ideas how to get Sphinx to take
>>> notice of the commas in my indexed fields and not return matches on
>>> them?
>>
>> I suspect the Sphinx treats a comma as whitespace when indexing. To
>> change the default behaviour, you'll want to specify charset_table in
>> your sphinx.yml file.
>>
>> I have a blog post on tweaking 
>> charset_table:http://yob.id.au/blog/2008/05/08/thinking_sphinx_and_unicode/ 
>> . It
>> doesn't explicitly mention commas, but you can just add a comma to  
>> the
>> list, then rebuild your indexes.
>>
>> Anything that isn't specified in the charset_table list is ignored  
>> when
>> sphinx builds its indexes. At the very least, you'll probably want
>> something like:
>>
>> charset_table: "0..9, a..z, _, A..Z->a..z, U+002C"
>>
>> -- James Healy <jimmy-at-deefa-dot-com>  Thu, 09 Oct 2008 12:06:57  
>> +1100
> >


--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to