Anyone have any thoughts on this? In general, am I expected to be able to go-live from an unrelated cluster of Hadoop machines to a SolrCloud that isn't running off of HDFS?
intput: HDFS output: HDFS go-live cluster: SolrCloud cluster on different machines running on plain MMapDirectory I'm back to looking at the code but holy hell is debugging Hadoop hard. :) On Thu, Apr 17, 2014 at 12:33 PM, Brett Hoerner <br...@bretthoerner.com>wrote: > https://gist.github.com/bretthoerner/0dc6bfdbf45a18328d4b > > > On Thu, Apr 17, 2014 at 11:31 AM, Mark Miller <markrmil...@gmail.com>wrote: > >> Odd - might be helpful if you can share your sorlconfig.xml being used. >> >> -- >> Mark Miller >> about.me/markrmiller >> >> On April 17, 2014 at 12:18:37 PM, Brett Hoerner (br...@bretthoerner.com) >> wrote: >> >> I'm doing HDFS input and output in my job, with the following: >> >> hadoop jar /mnt/faas-solr.jar \ >> -D mapreduce.job.map.class=com.massrel.faassolr.SolrMapper \ >> --update-conflict-resolver com.massrel.faassolr.SolrConflictResolver >> \ >> --morphline-file /mnt/morphline-ignore.conf \ >> --zk-host $ZKHOST \ >> --output-dir hdfs://$MASTERIP:9000/output/ \ >> --collection $COLLECTION \ >> --go-live \ >> --verbose \ >> hdfs://$MASTERIP:9000/input/ >> >> Index creation works, >> >> $ hadoop fs -ls -R hdfs://$MASTERIP:9000/output/results/part-00000 >> drwxr-xr-x - hadoop supergroup 0 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data >> drwxr-xr-x - hadoop supergroup 0 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index >> -rwxr-xr-x 1 hadoop supergroup 61 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0.fdt >> -rwxr-xr-x 1 hadoop supergroup 45 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0.fdx >> -rwxr-xr-x 1 hadoop supergroup 1681 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0.fnm >> -rwxr-xr-x 1 hadoop supergroup 396 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0.si >> -rwxr-xr-x 1 hadoop supergroup 67 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.doc >> -rwxr-xr-x 1 hadoop supergroup 37 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.pos >> -rwxr-xr-x 1 hadoop supergroup 508 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.tim >> -rwxr-xr-x 1 hadoop supergroup 305 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.tip >> -rwxr-xr-x 1 hadoop supergroup 120 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene45_0.dvd >> -rwxr-xr-x 1 hadoop supergroup 351 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene45_0.dvm >> -rwxr-xr-x 1 hadoop supergroup 45 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/segments_1 >> -rwxr-xr-x 1 hadoop supergroup 110 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/index/segments_2 >> drwxr-xr-x - hadoop supergroup 0 2014-04-17 16:00 hdfs:// >> 10.98.33.114:9000/output/results/part-00000/data/tlog >> -rw-r--r-- 1 hadoop supergroup 333 2014-04-17 16:00 hdfs:// >> >> 10.98.33.114:9000/output/results/part-00000/data/tlog/tlog.0000000000000000000 >> >> But the go-live step fails, it's trying to use the HDFS path as the remote >> index path? >> >> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merging of output shards into >> Solr cluster... >> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merge hdfs:// >> 10.98.33.114:9000/output/results/part-00000 into >> http://discover8-test-1d.i.massrel.com:8983/solr >> 14/04/17 16:00:31 ERROR hadoop.GoLive: Error sending live merge command >> java.util.concurrent.ExecutionException: >> org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: >> directory '/mnt/solr_8983/home/hdfs:/ >> 10.98.33.114:9000/output/results/part-00000/data/index' does not exist >> at java.util.concurrent.FutureTask.report(FutureTask.java:122) >> at java.util.concurrent.FutureTask.get(FutureTask.java:188) >> at org.apache.solr.hadoop.GoLive.goLive(GoLive.java:126) >> at >> >> org.apache.solr.hadoop.MapReduceIndexerTool.run(MapReduceIndexerTool.java:867) >> at >> >> org.apache.solr.hadoop.MapReduceIndexerTool.run(MapReduceIndexerTool.java:609) >> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) >> at >> >> org.apache.solr.hadoop.MapReduceIndexerTool.main(MapReduceIndexerTool.java:596) >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >> at >> >> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) >> at >> >> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) >> at java.lang.reflect.Method.invoke(Method.java:606) >> at org.apache.hadoop.util.RunJar.main(RunJar.java:212) >> Caused by: >> org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: >> directory '/mnt/solr_8983/home/hdfs:/ >> 10.98.33.114:9000/output/results/part-00000/data/index' does not exist >> at >> >> org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:495) >> at >> >> org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:199) >> at >> >> org.apache.solr.client.solrj.request.CoreAdminRequest.process(CoreAdminRequest.java:493) >> at org.apache.solr.hadoop.GoLive$1.call(GoLive.java:100) >> at org.apache.solr.hadoop.GoLive$1.call(GoLive.java:89) >> at java.util.concurrent.FutureTask.run(FutureTask.java:262) >> at >> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) >> at java.util.concurrent.FutureTask.run(FutureTask.java:262) >> at >> >> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) >> at >> >> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) >> at java.lang.Thread.run(Thread.java:744) >> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merging of index shards into >> Solr cluster took 2.31269488E8 secs >> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merging failed >> >> I'm digging into the code now, but wanted to send this out as a sanity >> check. >> >> Thanks, >> Brett >> > >