Interest in Lucene specialists..

2001-10-11 Thread Sunny Kapoor \(SunKap\)
- Original Message - From: William Wong To: Lucene-user Sent: Friday, October 05, 2001 5:12 PM Subject: RE: Lucene has moved to Jakarta How about adding filters for different file types such as -HTML (there is one in the demo already) -XML -PDF -MsWord/RTF

Re: Interest in Lucene specialists..

2001-10-11 Thread Sunny Kapoor \(SunKap\)
I'm interested in exploring deeper if Lucene is suitable for one of our projects and would like to leverage specialist expertise. Is there any forum where specialist Lucene expertize (evaluation and dev support) can be sought on commercial terms. not sure if this is the right forum to post

File Handles issue

2001-10-11 Thread Scott Ganyo
We're having a heck of a time with too many file handles around here. When we create large indexes, we often get thousands of temporary files in a given index! Even worse, we just plain run out of file handles--even on boxes where we've upped the limits as much as we think we can! We've played

RE: File Handles issue

2001-10-11 Thread Doug Cutting
From: Scott Ganyo [mailto:[EMAIL PROTECTED]] We're having a heck of a time with too many file handles around here. When we create large indexes, we often get thousands of temporary files in a given index! Thousands, eh? That seems high. The maximum number of segments should be

Index Optimization: Which is Better?

2001-10-11 Thread W. Eliot Kimber
We are experimenting with XML-aware indexing. The approach we're trying is to index every element in a given XML document as a separate Lucene document along with a another Lucene document that captures just the concatenated text content of the document (to handle searching for phrases across

Re: Index Optimization: Which is Better?

2001-10-11 Thread Steven J. Owens
Doug wrote: I'm having trouble getting a clear picture of your indexing scheme. I've been doing a lot of thinking about this same problem, so I may be a little more in tune with what Elliot's saying. By the way, Elliot, I'm very interested in your results. I considered the basic