The charset_table in sphinx.yml is set to what James suggested:
charset_table: "0..9, a..z, _, A..Z->a..z, U+002C"

Quick question: When I create settings in sphinx.yml and reindex, do these
get put into the generated 'development.sphinx.conf' file?

Over on the Sphinx forum I have been asking about this and they say adding
comma to the charset_table will have unfortunate side effects. Their
suggestion is to perform a proximity search by setting:

phrase_boundary=,
phrase_boundary_step=100


And then search for my phrase like so: *S**trong.search "\"dear love\"~50",
:match_mode => :extended2*

Unfortunately, that hasn't provided results either :(


On Sat, Oct 18, 2008 at 11:45 PM, Pat Allan <[EMAIL PROTECTED]>wrote:

>
> What's your setting for charset_table in config/sphinx.yml?
> If it was just to keep commas, I would have thought the following
> would do the trick:
>
> charset_table: U+2C
>
> Although, you'd probably want to keep the defaults for utf-8 as well:
>
> charset_table: 0..9, A..Z->a..z, _, a..z, U+410..U+42F->U+430..U+44F, U
> +430..U+44F, U+2C
>
> Or is that what you've tried, and it's not working?
>
> --
> Pat
>
> On 19/10/2008, at 4:49 AM, sant0sk1 wrote:
>
> >
> > I just gave this a go and seems to not have given predicted results:
> >
> > Now any search I perform that doesn't include a comma returns 0
> > matches and any search I perform that includes a comma resturns
> > exactly 1956 results. Every time. I posted a similar question over on
> > the Sphinx forum awhile back and got the following reply:
> >
> >    Use phrase_boundary and phrase_boundary_step features to make
> > commas increase the word
> >    position, and then perform a proximity search.
> >
> >    Eg. set phrase_boundary=, and phrase_boundary_step=100 and then
> > search for "dear love"~50
> >
> > I tried that as well (by setting those in sphinx.yml) but every search
> > I run that ends in "~50 returns 0 matches.
> >
> > Any other ideas?
> >
> >
> >
> > On Oct 8, 8:13 pm, James Healy <[EMAIL PROTECTED]> wrote:
> >> sant0sk1 wrote:
> >>> All three of these searches returned a match on a record with "dear,
> >>> love" in the text field. Any other ideas how to get Sphinx to take
> >>> notice of the commas in my indexed fields and not return matches on
> >>> them?
> >>
> >> I suspect the Sphinx treats a comma as whitespace when indexing. To
> >> change the default behaviour, you'll want to specify charset_table in
> >> your sphinx.yml file.
> >>
> >> I have a blog post on tweaking charset_table:
> http://yob.id.au/blog/2008/05/08/thinking_sphinx_and_unicode/
> >> . It
> >> doesn't explicitly mention commas, but you can just add a comma to
> >> the
> >> list, then rebuild your indexes.
> >>
> >> Anything that isn't specified in the charset_table list is ignored
> >> when
> >> sphinx builds its indexes. At the very least, you'll probably want
> >> something like:
> >>
> >> charset_table: "0..9, a..z, _, A..Z->a..z, U+002C"
> >>
> >> -- James Healy <jimmy-at-deefa-dot-com>  Thu, 09 Oct 2008 12:06:57
> >> +1100
> > >
>
>
> >
>

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to [EMAIL PROTECTED]
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en
-~----------~----~----~----~------~----~------~--~---

Reply via email to