Has anyone mitigated the potentially large IO impact of doing a backup of a
large collection or just in general?  If the collection is large enough,
there very well could be many shards on one host and it could saturate the
IO.  I wonder if there should be a rate limit mechanism or some other
mechanism.

Not the same but I know that at a segment level, the merges are rate
limited -- ConcurrentMergeScheduler doesn't quite let you set it but
adjusts itself automatically ("ioThrottle" boolean).

~ David Smiley
Apache Lucene/Solr Search Developer
http://www.linkedin.com/in/davidwsmiley

Reply via email to