Re: SolrCloud: Adding a very large collection to a pre-existing cluster

2016-06-22 Thread Erick Erickson
One other option is to index "somewhere else", then use the collections API to "addreplica"s on your prod cluster. Then perhaps delete replica on the nodes that are "somewhere else". Best, Erick On Jun 21, 2016 4:27 PM, "Jeff Wartes" wrote: There’s no official way of

Re: SolrCloud: Adding a very large collection to a pre-existing cluster

2016-06-21 Thread Jeff Wartes
There’s no official way of doing #1, but there are some less official ways: 1. The Backup/Restore API provides some hooks into loading pre-existing data dirs into an existing collection. Lots of caveats. 2. If you don’t have many shards, there’s always rsync/reload. 3. There are some third-party

SolrCloud: Adding a very large collection to a pre-existing cluster

2016-06-21 Thread Kelly, Frank
We have about 200 million documents (~70 GB) we need to keep indexed across 3 collections. Currently 2 of the 3 collections are already indexed (roughly 90m docs). We'd like to create the remaining collection (about 100 m documents) but minimizing the performance impact on the existing