Re: Stress/scalability testing Lucene

2002-11-21 Thread Karl Øie
I have a index that is compiled each night that indexes 1,3gb with XML data that results into a 1,4gb index. The index takes about 11 hours to build on a dual 700mhz xeon processor with 768mb of ram. The index contains 4.388.730 documents and 953.632 terms. Mvh karl øie On Thursday, Nov 21, 2

Re: Stress/scalability testing Lucene

2002-11-21 Thread Ian Lea
Otis I could do some tests on a solaris and/or intel linux server that have got about 10 to 15Gb disk space to play with. Neither has huge amounts of RAM (512 and 256Mb respectively) but that is probably enough. Neither server is doing much else most of the time. If that would help would you li

Re: Stress/scalability testing Lucene

2002-11-20 Thread Doug Cutting
writing at the same time? I thought I read this in the FAQ. Roy. -Original Message- From: Doug Cutting [mailto:[EMAIL PROTECTED]] Sent: Wednesday, November 20, 2002 5:04 PM To: Lucene Users List Subject: Re: Stress/scalability testing Lucene * Replies will be sent through Spamex to [EMAIL

RE: Stress/scalability testing Lucene

2002-11-20 Thread Otis Gospodnetic
> thought I read this in the FAQ. > > Roy. > > -Original Message- > From: Doug Cutting [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, November 20, 2002 5:04 PM > To: Lucene Users List > Subject: Re: Stress/scalability testing Lucene > > > * Replies w

RE: Stress/scalability testing Lucene

2002-11-20 Thread roy-lucene-user
PM To: Lucene Users List Subject: Re: Stress/scalability testing Lucene * Replies will be sent through Spamex to [EMAIL PROTECTED] * For additional info click -> http://www.spamex.com/i/?v=886513 Justin Greene wrote: > We created a thread pool to read and parse the email > messages. 1

Re: Stress/scalability testing Lucene

2002-11-20 Thread Doug Cutting
Justin Greene wrote: We created a thread pool to read and parse the email messages. 10 threads seems to be the magic number here for us. We then created a queue of messages to be indexed onto which we push the parsed messages and have a single thread adding messages to the index. IndexWriter.a

RE: Stress/scalability testing Lucene

2002-11-20 Thread Justin Greene
D]' > Subject: RE: Stress/scalability testing Lucene > > We wrote an Lucene based indexer that we are using to index > MailDir email > boxes. Each file is an individual email message and they -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

RE: Stress/scalability testing Lucene

2002-11-20 Thread Justin Greene
aving a second CPU makes a major difference in performance. Justin > -Original Message- > From: Otis Gospodnetic [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, November 20, 2002 12:09 PM > To: [EMAIL PROTECTED] > Cc: [EMAIL PROTECTED] > Subject: Stress/scalability testi

Stress/scalability testing Lucene

2002-11-20 Thread Otis Gospodnetic
Hello, Has anyone tested Lucene for scalability? I know that some peple have indices with 10M+ documents in it, but has anyone tried going beyond there, to 50M, 100M, 500M or more documents? (I know the size of the index and performance of searches depends on documents, number of fields, field typ