Re: Multi language search help

Grant Ingersoll Thu, 18 Dec 2008 20:12:04 -0800


On Dec 18, 2008, at 6:25 AM, Sujatha Arun wrote:

Hi,
I am prototyping lanuage search using solr 1.3 .I have 3 fields inthe
schema -id,content and language.
I am indexing 3 pdf files ,the languages are foroyo,chinese andjapanese.
I use xpdf to convert the content of pdf to text and push the textto solr
in the content field.

What is the analyzer  that i need to use for the above.
By using the default text analyzer and posting this content to solr,i am
not getting any  results.

Does solr support stemming for the above languages.

I'm not familiar with Foroyo, but there should be tokenizers/analysisavailable for Chines and Japanese. Are you putting all threelanguages into the same field? If that is the case, you will needsome type of language detection piece that can choose the correctanalyzer.

How are your users searching? That is, do you know the language theywant to search in? If so, then you can have a field for each language.


-Grant

Re: Multi language search help

Reply via email to