https://gist.github.com/bretthoerner/0dc6bfdbf45a18328d4b


On Thu, Apr 17, 2014 at 11:31 AM, Mark Miller <markrmil...@gmail.com> wrote:

> Odd - might be helpful if you can share your sorlconfig.xml being used.
>
> --
> Mark Miller
> about.me/markrmiller
>
> On April 17, 2014 at 12:18:37 PM, Brett Hoerner (br...@bretthoerner.com)
> wrote:
>
> I'm doing HDFS input and output in my job, with the following:
>
> hadoop jar /mnt/faas-solr.jar \
> -D mapreduce.job.map.class=com.massrel.faassolr.SolrMapper \
> --update-conflict-resolver com.massrel.faassolr.SolrConflictResolver
> \
> --morphline-file /mnt/morphline-ignore.conf \
> --zk-host $ZKHOST \
> --output-dir hdfs://$MASTERIP:9000/output/ \
> --collection $COLLECTION \
> --go-live \
> --verbose \
> hdfs://$MASTERIP:9000/input/
>
> Index creation works,
>
> $ hadoop fs -ls -R hdfs://$MASTERIP:9000/output/results/part-00000
> drwxr-xr-x - hadoop supergroup 0 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data
> drwxr-xr-x - hadoop supergroup 0 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index
> -rwxr-xr-x 1 hadoop supergroup 61 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0.fdt
> -rwxr-xr-x 1 hadoop supergroup 45 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0.fdx
> -rwxr-xr-x 1 hadoop supergroup 1681 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0.fnm
> -rwxr-xr-x 1 hadoop supergroup 396 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0.si
> -rwxr-xr-x 1 hadoop supergroup 67 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.doc
> -rwxr-xr-x 1 hadoop supergroup 37 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.pos
> -rwxr-xr-x 1 hadoop supergroup 508 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.tim
> -rwxr-xr-x 1 hadoop supergroup 305 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene41_0.tip
> -rwxr-xr-x 1 hadoop supergroup 120 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene45_0.dvd
> -rwxr-xr-x 1 hadoop supergroup 351 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/_0_Lucene45_0.dvm
> -rwxr-xr-x 1 hadoop supergroup 45 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/segments_1
> -rwxr-xr-x 1 hadoop supergroup 110 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/index/segments_2
> drwxr-xr-x - hadoop supergroup 0 2014-04-17 16:00 hdfs://
> 10.98.33.114:9000/output/results/part-00000/data/tlog
> -rw-r--r-- 1 hadoop supergroup 333 2014-04-17 16:00 hdfs://
>
> 10.98.33.114:9000/output/results/part-00000/data/tlog/tlog.0000000000000000000
>
> But the go-live step fails, it's trying to use the HDFS path as the remote
> index path?
>
> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merging of output shards into
> Solr cluster...
> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merge hdfs://
> 10.98.33.114:9000/output/results/part-00000 into
> http://discover8-test-1d.i.massrel.com:8983/solr
> 14/04/17 16:00:31 ERROR hadoop.GoLive: Error sending live merge command
> java.util.concurrent.ExecutionException:
> org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException:
> directory '/mnt/solr_8983/home/hdfs:/
> 10.98.33.114:9000/output/results/part-00000/data/index' does not exist
> at java.util.concurrent.FutureTask.report(FutureTask.java:122)
> at java.util.concurrent.FutureTask.get(FutureTask.java:188)
> at org.apache.solr.hadoop.GoLive.goLive(GoLive.java:126)
> at
>
> org.apache.solr.hadoop.MapReduceIndexerTool.run(MapReduceIndexerTool.java:867)
> at
>
> org.apache.solr.hadoop.MapReduceIndexerTool.run(MapReduceIndexerTool.java:609)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
> at
>
> org.apache.solr.hadoop.MapReduceIndexerTool.main(MapReduceIndexerTool.java:596)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
>
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at
>
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:606)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
> Caused by:
> org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException:
> directory '/mnt/solr_8983/home/hdfs:/
> 10.98.33.114:9000/output/results/part-00000/data/index' does not exist
> at
>
> org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:495)
> at
>
> org.apache.solr.client.solrj.impl.HttpSolrServer.request(HttpSolrServer.java:199)
> at
>
> org.apache.solr.client.solrj.request.CoreAdminRequest.process(CoreAdminRequest.java:493)
> at org.apache.solr.hadoop.GoLive$1.call(GoLive.java:100)
> at org.apache.solr.hadoop.GoLive$1.call(GoLive.java:89)
> at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
> at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> at
>
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
>
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:744)
> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merging of index shards into
> Solr cluster took 2.31269488E8 secs
> 14/04/17 16:00:31 INFO hadoop.GoLive: Live merging failed
>
> I'm digging into the code now, but wanted to send this out as a sanity
> check.
>
> Thanks,
> Brett
>

Reply via email to