--- Leo Galambos <[EMAIL PROTECTED]> wrote: > Otis Gospodnetic napsal(a): > > >Without seeing more information/code, I can't tell which part of > your > >system slows down with time, but I can tell you that Lucene's 'add' > >does not slow over time (i.e. as the index gets larger). Therefore, > I > >would look elsewhere for causes of the slowdown. > > > > > > Otis, can you point me to some proofs that time of "insert" operation > > does not depend on the index size, please? Amortized time of "insert" > is O(log(docsIndexed/mergeFac)), I think.
This would imply that Lucene gets slower as it adds more documents to the index. Have you observed this behaviour? I haven't. > Thus I do not know how it could be O(1). ~ O(1) is what I have observed through experiments with indexing of several million documents. Otis > AFAIK the issue with PDF files can be based on the PDF parser (I > already > encountered this with PDFbox). > > >The easiest thing to do is add logging to suspicious portions of the > >code. That will narrow the scope of the code you need to analyze. > > > >Otis > > > > > >--- [EMAIL PROTECTED] wrote: > > > > > >>Hey Lucene-users, > >> > >>I'm setting up a Lucene index on 5G of PDF files (full-text > search). > >>I've > >>been really happy with Lucene so far but I'm curious what tips and > >>strategies > >>I can use to optimize my performance at this large size. > >> > >>So far I am using pretty much all of the defaults (I'm new to > >>Lucene). > >> > >>I am using PDFBox to add the documents to the index. > >>I can usually add about 800 or so PDF files and then the add loop: > >> > >>for ( int i = 0; i < fileNames.length; i++ ) { > >> Document doc = > IndexFile.index(baseDirectory+documentRoot+"fileNames > >>[i]); > >> writer.addDocument(doc); > >>} > >> > >> > >>really starts to slow down. Doesn't seem to be memory related. > >>Thoughts anyone? > >> > >>Thanks in advance, > >>CK Hill > >> > >> > >> > >>--------------------------------------------------------------------- > >>To unsubscribe, e-mail: [EMAIL PROTECTED] > >>For additional commands, e-mail: > [EMAIL PROTECTED] > >> > >> > >> > > > > > >--------------------------------------------------------------------- > >To unsubscribe, e-mail: [EMAIL PROTECTED] > >For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > > > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]