: For example, when you are indexing every hour and large document set
: is present, it takes >1 hr to index the documents. Now you are
: already behind indexing for the next hour. How do you design
: something that is robust?
fundementally, this question is really about issues in a producer/con
Don't do it that way ? Is this an actual or theoretical
scenario? And do you reasonably expect it to become actual?
Otherwise, why bother?
And you've got other problems here. If you're indexing that
much data, you'll soon outgrow your disk. Unless you're
replacing most of the documents.
But assu