On Feb 17, 2004, at 6:53 AM, [EMAIL PROTECTED] wrote:
On Monday 16 February 2004 20:56, Erik Hatcher wrote:
On Feb 16, 2004, at 9:50 AM, [EMAIL PROTECTED] wrote:
TokenStream in = new WhitespaceAnalyzer().tokenStream("contents", new
StringReader(doc.getField("contents").stringValue()));

The field is the field name. No built-in analyzers use it, but custom analyzers could key off of it to do field-specific analysis. Look at

If I want to tokenize all Fields I would have to get a tokenStream of each
Field seperately and process them seperately? Or can I get one "master
stream" that compounds all Fields?

You would do them separately. I'm not clear on what you are trying to do. The Analyzer does all this during indexing automatically for you, but it sounds like you are just trying to emulate what an Analyzer already does to extract words from text?


Erik


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to