I'm working on a site that has articles in multiple languages. We're using sphinx/thinking sphinx for our search. If possible, we'd like to provide stemming support for as many languages as possible. I noticed in Pat's latest blog post[1] that TS now supports having multiple index on the same model. Ideally, I'd like to be able to define a separate index for each language (or at least as many languages as we have stemmers for--we'll try the snowball stemmers), with each index configured with the appropriate stemmer and a where clause that limits it to articles in the given language. When searching, we would specify the appropriate index for the current user's language, and it should (theoretically) all work.
The part I can't figure out is how to specify a different stemmer for different indexes. As far as I can tell, TS currently only supports configuring the stemmer globally, for all indexes, in your config/ sphinx.yml file. Is there a way to specify the stemmer for each index? If not, is this feature planned? If it's not too difficult to implement, and someone can point me in the right direction, I'd be willing to take a stab at implementing it. Barring that, I'm thinking I may be able to monkey patch the TS rake task that generates the sphinx conf file to allow this. Any suggestions or direction anyone can provide would me much appreciated. Thanks, Myron [1] http://freelancing-gods.com/posts/a_month_in_the_life_of_thinking_sphinx -- You received this message because you are subscribed to the Google Groups "Thinking Sphinx" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/thinking-sphinx?hl=en.
