Scott
On Feb 10, 2004, at 7:55 AM, Otis Gospodnetic wrote:
--- Leo Galambos <[EMAIL PROTECTED]> wrote:Otis Gospodnetic napsal(a):
Without seeing more information/code, I can't tell which part ofyourIsystem slows down with time, but I can tell you that Lucene's 'add' does not slow over time (i.e. as the index gets larger). Therefore,would look elsewhere for causes of the slowdown.
Otis, can you point me to some proofs that time of "insert" operation
does not depend on the index size, please? Amortized time of "insert" is O(log(docsIndexed/mergeFac)), I think.
This would imply that Lucene gets slower as it adds more documents to the index. Have you observed this behaviour? I haven't.
Thus I do not know how it could be O(1).
~ O(1) is what I have observed through experiments with indexing of several million documents.
Otis
AFAIK the issue with PDF files can be based on the PDF parser (I already encountered this with PDFbox).
search).The easiest thing to do is add logging to suspicious portions of the code. That will narrow the scope of the code you need to analyze.
Otis
--- [EMAIL PROTECTED] wrote:
Hey Lucene-users,
I'm setting up a Lucene index on 5G of PDF files (full-textIndexFile.index(baseDirectory+documentRoot+"fileNamesI've been really happy with Lucene so far but I'm curious what tips and strategies I can use to optimize my performance at this large size.
So far I am using pretty much all of the defaults (I'm new to Lucene).
I am using PDFBox to add the documents to the index. I can usually add about 800 or so PDF files and then the add loop:
for ( int i = 0; i < fileNames.length; i++ ) { Document doc =[i]); writer.addDocument(doc); }
really starts to slow down. Doesn't seem to be memory related. Thoughts anyone?
Thanks in advance, CK Hill
---------------------------------------------------------------------[EMAIL PROTECTED]To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail:
---------------------------------------------------------------------To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
smime.p7s
Description: S/MIME cryptographic signature
