Hi,

I would suggest to perform a Test with your Analyzers, something like:

>>StringReader reader = new StringReader(new String("your khmer text"));
>>TokenStream stream = analyzer.tokenStream("content", reader);

Iterate through the TokenStream and check wether the analyzed Tokens are correct!

Thats the way I test my analyzers/tokenization/filtering without the overhead of indexing, search etc.

Bests

hannes

Zsolt Czinkos schrieb:
Hello

>From the API:

"public class StandardAnalyzer
extends Analyzer

Filters StandardTokenizer with StandardFilter, LowerCaseFilter and
StopFilter, using a list of English stop words."


Are you sure that these filters won't filter your Khmer characters out?


Best,

czinkos


On Wed, Jan 24, 2007 at 05:29:03PM +0700, Fournaux Nicolas wrote:
Good morning all (or good afternoon)

I used Lucene many times before, to search text in French Or English. All
worked fine :-)

But now I have a new challenge, I need to use Lucene with Khmer (Khmer is
the Cambodia’s language, it looks like Thai or Indian)

But it doesn’t work, my code is well executed but it found no results, I
give you my code below

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to