Hi,
I am just beginning to implement text indexation for an application I am
building and am not quite sure of a few things. The documents indexed will
be in various languages, ranging mostly from short notes to ~20 page
articles (with the occaisional book length). And so my plan is to have
separa
Closed! TeeSinkTokenFilter and CachingTokenFilter seem to provide the
functionality/code examples I was looking for.
Thanks, graham.
-- Forwarded message --
From: Graham Sugden
Date: Thu, Aug 18, 2011 at 5:23 PM
Subject: Multiple fields derived from same source text?
To: java
Caveat to the below is that I am very new to lucene. (That said though,
following the below strategy, after a couple of days work I have a set of
per field analyzers for various languages, using various custom filters,
caching of initial analysis; and capable of outputting stemmed, reversed,
diacri