Re: Lucene for Indian Languages

2004-08-23 Thread Satish Kagathare
Hi,Srinivasa, Use StandardAnaylzer for indexing and parsing query for Indian Lang. docs. It will work. Right now we r searching on Hindi,Marathi but without specific stemmers and filters. We r plannig to develop Marathi Morphological Analyzer. Thanks, Satish. On Sun, 22 Aug 2004, srinivasa r

Devnagari Search?

2004-06-09 Thread Satish Kagathare
Any one have built lucene for Devnagari UNICODE search? PLZ help me wht kind of changes i have to do in lucene. Also if any one have built StandardTokenizer,Analyzer,Stemmer,Indexer ,queryParser for Hindi & Marathi Plz let me know. Thanks, Satish. -

Does Lucene support UNICODE?

2004-06-07 Thread Satish Kagathare
Hello, Does Lucene support UNICODE search and indexing of UNICODE data(especially..Devnagari unicode data)? Does it make any difference between utf-8 & utf-16 unicode docs? Bcoz java strings supports utf-16. Bcoz i tried indexing(using indexFiles & indexHTML from lucene Demo) devnagari uni da