Looks like the shard split failed, and only created one additional shard. I didn't allocate enough memory for 3x - since two additional shards needed to be created. I was allocating 20G for each shard, so in order do the split, I needed to give 60G for the direct memory access. I've now switched it to 10G, and run the split - that works, but I still need to build the directories before hand otherwise I get the cannot find class problem.
Here are my HDFS parameters: <directoryFactory name="DirectoryFactory" class="solr.HdfsDirectoryFactory"> <bool name="solr.hdfs.blockcache.enabled">true</bool> <int name="solr.hdfs.blockcache.slab.count">80</int> <bool name="solr.hdfs.blockcache.direct.memory.allocation">true</bool> <int name="solr.hdfs.blockcache.blocksperbank">16384</int> <bool name="solr.hdfs.blockcache.read.enabled">true</bool> <bool name="solr.hdfs.blockcache.write.enabled">false</bool> <bool name="solr.hdfs.nrtcachingdirectory.enable">true</bool> <int name="solr.hdfs.nrtcachingdirectory.maxmergesizemb">64</int> <int name="solr.hdfs.nrtcachingdirectory.maxcachedmb">512</int> <str name="solr.hdfs.home">hdfs://nameservice1:8020/solr6</str> <str name="solr.hdfs.confdir">/etc/hadoop/conf.cloudera.hdfs1</str> </directoryFactory> I did have the slab.count set to 160 before, and just didn't have the RAM to try this out. The split is now running and I see the amount of space going into the new shards is increasing. Looks like it's going to be overnight before it completes. -Joe On Mon, Nov 17, 2014 at 5:57 PM, Erick Erickson <erickerick...@gmail.com> wrote: > Tell us more about your HDFS stuff. Specifically, how > do you have your HDFSDirectoryFactory specified in > solrconfig.xml? > > Cause you shouldn't have to do things like create the > directory ahead of time I don't think. > > Best, > Erick > > On Mon, Nov 17, 2014 at 12:17 PM, Joseph Obernberger > <joseph.obernber...@gmail.com> wrote: > > Originally I had two shards on two machines - shard1 and shard2. > > I did a SHARDSPLIT on shard1. > > Now have shard1, shard2, and shard1_0 > > If I select the core (COLLECT_shard1_0_replica1) and execute a query, I > get > > all the docs OK, but if I specific &distrib=false, I get 0 documents. > > > > Under HDFS - when/how will the new core start to get data? > > Thank you! > > > > -Joe >