Hi,
We are using lucene to index and search unicode(utf-8) contents in
devnagari(hindi) language .
What we have observed is that our query fetches results which have partial
word match . i.e if it were english then a query india would relurn words
like
indian , southindia and so on.
Is there a
Lucene will look for exact matches at its base. However, between the query
string and actually matching searches there is an analyzer that may
manipulate the query. You may have to create an devnagari(hindi) which
correctly tokenizes the terms.
Not that Lucene saves all terms in unicode and will
Hello,
A query for india should not be returning southindia (one word).
It sounds like something else is happening in your application.
Otis
--- Harpreet S Walia [EMAIL PROTECTED] wrote:
Hi,
We are using lucene to index and search unicode(utf-8) contents in
devnagari(hindi) language .