- Original Message -
From: William Wong
To: Lucene-user
Sent: Friday, October 05, 2001 5:12 PM
Subject: RE: Lucene has moved to Jakarta
How about adding filters for different file types such as
-HTML (there is one in the demo already)
-XML
-PDF
-MsWord/RTF
I'm interested in exploring deeper if Lucene is suitable for one of our projects and
would like to leverage specialist expertise. Is there any forum where specialist
Lucene expertize (evaluation and dev support) can be sought on commercial terms.
not sure if this is the right forum to post
We're having a heck of a time with too many file handles around here. When
we create large indexes, we often get thousands of temporary files in a
given index! Even worse, we just plain run out of file handles--even on
boxes where we've upped the limits as much as we think we can! We've played
From: Scott Ganyo [mailto:[EMAIL PROTECTED]]
We're having a heck of a time with too many file handles
around here. When
we create large indexes, we often get thousands of temporary
files in a given index!
Thousands, eh? That seems high.
The maximum number of segments should be
We are experimenting with XML-aware indexing. The approach we're trying
is to index every element in a given XML document as a separate Lucene
document along with a another Lucene document that captures just the
concatenated text content of the document (to handle searching for
phrases across
Doug wrote:
I'm having trouble getting a clear picture of your indexing scheme.
I've been doing a lot of thinking about this same problem, so I
may be a little more in tune with what Elliot's saying. By the way,
Elliot, I'm very interested in your results. I considered the basic