Hi,

Adding new documents to the index is not very costly even when the index is large. The newly added documents result in additional segment(s). If you then optimize then that is extremely costly for a large index. Test your search performance to see if you really need to optimize.

The costly part is re-opening your searchers. There are plans underway to improve the re-open performance so that the cost is more in proportion to what new segments were newly added (unlike today where cost is in proportion to total size of the index), but those are just plans at this point.

Mike

Vinicius Carvalho wrote:
Hello there! Since I've just begun with lucene, some concepts are kinda new for me :). One of the is the whole indexing process. Well, AFAIK, indexing should happen in a batch process right, to maximize the time spent on this
operation. One issue tough, is that our client wants "instants search
results", I mean, after something is added, the next search should have it available. Well, it's almost impossible to convince him (and I believe you guys know how this works) that indexing should not happen online (maybe I'm
just wrong here as well).

Well, my question is, when I have a reasonable amount of indexed data, how expensive is to add a new document? Does it re-builds the entire index? I believe we'll have a small amount of data (when compare to things I've seen
lucene is capable of), probably 50k of html docs, 10k of pdf, and 5--6
million rows of database information. With that scenario, an online index
(plus optimize) would be a good idea?

Best regards

--
"In a world without fences and walls, who needs Gates and Windows?"


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to