Hi,
I would suggest to perform a Test with your Analyzers, something like:
>>StringReader reader = new StringReader(new String("your khmer text"));
>>TokenStream stream = analyzer.tokenStream("content", reader);
Iterate through the TokenStream and check wether the analyzed Tokens are
correct!
Thats the way I test my analyzers/tokenization/filtering without the
overhead of indexing, search etc.
Bests
hannes
Zsolt Czinkos schrieb:
Hello
>From the API:
"public class StandardAnalyzer
extends Analyzer
Filters StandardTokenizer with StandardFilter, LowerCaseFilter and
StopFilter, using a list of English stop words."
Are you sure that these filters won't filter your Khmer characters out?
Best,
czinkos
On Wed, Jan 24, 2007 at 05:29:03PM +0700, Fournaux Nicolas wrote:
Good morning all (or good afternoon)
I used Lucene many times before, to search text in French Or English. All
worked fine :-)
But now I have a new challenge, I need to use Lucene with Khmer (Khmer is
the Cambodia’s language, it looks like Thai or Indian)
But it doesn’t work, my code is well executed but it found no results, I
give you my code below
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]