Re: Lucene with Khmer ? (Language in cambodia)

hannes Wed, 24 Jan 2007 03:17:42 -0800

Hi,

I would suggest to perform a Test with your Analyzers, something like:


>>StringReader reader = new StringReader(new String("your khmer text"));
>>TokenStream stream = analyzer.tokenStream("content", reader);

Iterate through the TokenStream and check wether the analyzed Tokens arecorrect!

Thats the way I test my analyzers/tokenization/filtering without theoverhead of indexing, search etc.


Bests

hannes

Zsolt Czinkos schrieb:

Hello

>From the API:

"public class StandardAnalyzer
extends Analyzer

Filters StandardTokenizer with StandardFilter, LowerCaseFilter and
StopFilter, using a list of English stop words."


Are you sure that these filters won't filter your Khmer characters out?


Best,

czinkos


On Wed, Jan 24, 2007 at 05:29:03PM +0700, Fournaux Nicolas wrote:

Good morning all (or good afternoon)

I used Lucene many times before, to search text in French Or English. All
worked fine :-)

But now I have a new challenge, I need to use Lucene with Khmer (Khmer is
the Cambodia’s language, it looks like Thai or Indian)

But it doesn’t work, my code is well executed but it found no results, I
give you my code below


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Lucene with Khmer ? (Language in cambodia)

Reply via email to