I'm not sure what you mean.

CLucene StandardTokenizer is meant for internal use only, and provides the
calling Analyzer with a stream of identified tokens (it classifies the
tokens, not just tokenizes them).

The ICU tokenizer is a general purpose tokenizer (like Boost's
implementation is), with loads of extra functionality the CLucene one
doesn't have or need.

Itamar.

-----Original Message-----
From: Paul J. Lucas [mailto:p...@lucasmail.org] 
Sent: Tuesday, February 09, 2010 8:24 PM
To: clucene-developers@lists.sourceforge.net
Subject: [CLucene-dev] CLucene tokenizer vs ICU tokenizer

How do they compare?

- Paul

----------------------------------------------------------------------------
--
The Planet: dedicated and managed hosting, cloud storage, colocation Stay
online with enterprise data centers and the best network in the business
Choose flexible plans and management services without long-term contracts
Personal 24x7 support from experience hosting pros just a phone call away.
http://p.sf.net/sfu/theplanet-com
_______________________________________________
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers



------------------------------------------------------------------------------
SOLARIS 10 is the OS for Data Centers - provides features such as DTrace,
Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW
http://p.sf.net/sfu/solaris-dev2dev
_______________________________________________
CLucene-developers mailing list
CLucene-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/clucene-developers

Reply via email to