Partial word search with unicode contents

2002-06-04 Thread Harpreet S Walia
Hi, We are using lucene to index and search unicode(utf-8) contents in devnagari(hindi) language . What we have observed is that our query fetches results which have partial word match . i.e if it were english then a query india would relurn words like indian , southindia and so on. Is there a

Re: Partial word search with unicode contents

2002-06-04 Thread Peter Carlson
Lucene will look for exact matches at its base. However, between the query string and actually matching searches there is an analyzer that may manipulate the query. You may have to create an devnagari(hindi) which correctly tokenizes the terms. Not that Lucene saves all terms in unicode and will

Re: Partial word search with unicode contents

2002-06-04 Thread Otis Gospodnetic
Hello, A query for india should not be returning southindia (one word). It sounds like something else is happening in your application. Otis --- Harpreet S Walia [EMAIL PROTECTED] wrote: Hi, We are using lucene to index and search unicode(utf-8) contents in devnagari(hindi) language .