Multiple fields derived from same source text?

2011-08-18 Thread Graham Sugden
Hi, I am just beginning to implement text indexation for an application I am building and am not quite sure of a few things. The documents indexed will be in various languages, ranging mostly from short notes to ~20 page articles (with the occaisional book length). And so my plan is to have separa

Re: Multiple fields derived from same source text?

2011-08-21 Thread Graham Sugden
Closed! TeeSinkTokenFilter and CachingTokenFilter seem to provide the functionality/code examples I was looking for. Thanks, graham. -- Forwarded message -- From: Graham Sugden Date: Thu, Aug 18, 2011 at 5:23 PM Subject: Multiple fields derived from same source text? To: java

Re: Analysis

2011-08-22 Thread Graham Sugden
Caveat to the below is that I am very new to lucene. (That said though, following the below strategy, after a couple of days work I have a set of per field analyzers for various languages, using various custom filters, caching of initial analysis; and capable of outputting stemmed, reversed, diacri