Yeah, Thai and Arabic have the stuff in Solr 1.4
For Chinese, if you want to do CJK bigram indexing, this is there too.
If you want to do word-based "smart" indexing, you need to add an additional
jar file to your classpath.

we can add a wiki page with examples of how to use these maybe to make it
easier?

we could also add notes to new ones in lucene (hindi, czech, bulgarian,
etc), as it might be easier to copy some code around and get them working
with solr 1.4 than to write your own!

separately, would you be interesting in helping with Bengali and Marathi?

On Thu, Feb 25, 2010 at 10:48 AM, Gora Mohanty <g...@srijan.in> wrote:

> On Thu, 25 Feb 2010 07:54:06 -0500
> Robert Muir <rcm...@gmail.com> wrote:
>
> > Gora, I wonder perhaps if there is a documentation issue.
> >
> > e.g. Thai, Arabic, Chinese were mentioned here previously, these
> > are all supported, too.
> >
> > Let me know if you have any ideas!
>
> Sorry, are you saying that these are available in Solr 1.4?
> If so, I have definitely fallen down on my reading.
>
> If they are available in Lucene, but not in Solr, that is still
> helpful, but it will take me a little while before I can devote
> enough time to set up access to Lucene directly.
>
> In any case, getting Indian languages (and, also down the road
> other languages) working is an area that I am definitely interested
> in.
>
> Regards,
> Gora
>



-- 
Robert Muir
rcm...@gmail.com

Reply via email to