Re: Lucene for Indian Languages

Praveen Peddi Mon, 23 Aug 2004 10:47:34 -0700

Infact CJK analyzer also works well with indian languages. Since CJKAnalyzer
considers the multi byte characters as special case, it works with most
asian multi byte characters. I introduced CJKAnalyzer for japanese text
search and we also tested with hindi and telugu languages. All our search
test cases passed.
Give CJKAnalyzer a try. You will find it a better analyzer than the standard
(for any asian language).


Praveen

----- Original Message ----- 
From: "Satish Kagathare" <[EMAIL PROTECTED]>
To: "Lucene Users List" <[EMAIL PROTECTED]>
Sent: Monday, August 23, 2004 9:20 AM
Subject: Re: Lucene for Indian Languages


>
> Hi,Srinivasa,
>
> Use StandardAnaylzer for indexing and parsing query for Indian Lang. docs.
> It will work. Right now we r searching on Hindi,Marathi
> but without specific stemmers and filters. We r plannig to develop
> Marathi Morphological Analyzer.
>
> Thanks,
> Satish.
>
> On Sun, 22 Aug 2004, srinivasa raghavan wrote:
>
> > Hi all,
> >
> >  Is Lucene API implemented for Indian contexts? I know
> > that Lucene stemmers and filters for German and
> > Russian Languages. I would like to know, whether there
> > are stemmers and filters available/being developed for
> > Indian Languages.
> >
> > Thanks,
> > Rahavan.
> >
> >
> >
> >
> >
> > _______________________________
> > Do you Yahoo!?
> > Express yourself with Y! Messenger! Free. Download now.
> > http://messenger.yahoo.com
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: [EMAIL PROTECTED]
> > For additional commands, e-mail: [EMAIL PROTECTED]
> >
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene for Indian Languages

Reply via email to