Just barely skimmed the documentation, but it looks like the tool generates
its own shards and pushes them into the collection by manipulating the
configuration of the cluster.
https://www.cloudera.com/documentation/enterprise/5-8-x/topics/search_mapreduceindexertool.html
If that reading is corre
We have solr with the index stored in HDFS. We are running MapReduce jobs
to build the index using the MapReduceIndexerTool from Cloudera with the
go-live option to merge into our live index.
We are seeing an issue where the number of segments in the index never
reduces. It continues to grow until