I'm getting this error on crawl at indexing. 

Error log attached. 

Nutch 1.12 into Solr 5.4.1. 

Solr is set up to use schema.xml instead of managed_schema... as attached. 

Nutch is set up to use the metadata plugin to extract three metadata fields. 

Can I get some bread crumbs toward a solution? 

Thx, 

Kris 


--snipped previous content---

2016-09-07 13:19:44,372 INFO  indexer.IndexingJob - Indexer: starting at 2016-09-07 13:19:44
2016-09-07 13:19:44,375 INFO  indexer.IndexingJob - Indexer: deleting gone documents: false
2016-09-07 13:19:44,375 INFO  indexer.IndexingJob - Indexer: URL filtering: false
2016-09-07 13:19:44,375 INFO  indexer.IndexingJob - Indexer: URL normalizing: false
2016-09-07 13:19:44,502 INFO  indexer.IndexWriters - Adding org.apache.nutch.indexwriter.solr.SolrIndexWriter
2016-09-07 13:19:44,502 INFO  indexer.IndexingJob - Active IndexWriters :
SOLRIndexWriter
	solr.server.url : URL of the SOLR instance
	solr.zookeeper.hosts : URL of the Zookeeper quorum
	solr.commit.size : buffer size when sending to SOLR (default 1000)
	solr.mapping.file : name of the mapping file for fields (default solrindex-mapping.xml)
	solr.auth : use authentication (default false)
	solr.auth.username : username for authentication
	solr.auth.password : password for authentication


2016-09-07 13:19:44,504 INFO  indexer.IndexerMapReduce - IndexerMapReduce: crawldb: crawl/crawldb
2016-09-07 13:19:44,504 INFO  indexer.IndexerMapReduce - IndexerMapReduce: linkdb: crawl/linkdb
2016-09-07 13:19:44,504 INFO  indexer.IndexerMapReduce - IndexerMapReduces: adding segment: crawl/segments/20160907131416
2016-09-07 13:19:44,967 WARN  conf.Configuration - file:/tmp/hadoop-musshorn/mapred/staging/musshorn600793053/.staging/job_local600793053_0001/job.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.retry.interval;  Ignoring.
2016-09-07 13:19:44,970 WARN  conf.Configuration - file:/tmp/hadoop-musshorn/mapred/staging/musshorn600793053/.staging/job_local600793053_0001/job.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.attempts;  Ignoring.
2016-09-07 13:19:45,061 WARN  conf.Configuration - file:/tmp/hadoop-musshorn/mapred/local/localRunner/musshorn/job_local600793053_0001/job_local600793053_0001.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.retry.interval;  Ignoring.
2016-09-07 13:19:45,064 WARN  conf.Configuration - file:/tmp/hadoop-musshorn/mapred/local/localRunner/musshorn/job_local600793053_0001/job_local600793053_0001.xml:an attempt to override final parameter: mapreduce.job.end-notification.max.attempts;  Ignoring.
2016-09-07 13:19:45,251 INFO  anchor.AnchorIndexingFilter - Anchor deduplication is: off
2016-09-07 13:19:46,599 INFO  indexer.IndexWriters - Adding org.apache.nutch.indexwriter.solr.SolrIndexWriter
2016-09-07 13:19:46,739 INFO  solr.SolrMappingReader - source: content dest: content
2016-09-07 13:19:46,739 INFO  solr.SolrMappingReader - source: title dest: title
2016-09-07 13:19:46,739 INFO  solr.SolrMappingReader - source: host dest: host
2016-09-07 13:19:46,739 INFO  solr.SolrMappingReader - source: segment dest: segment
2016-09-07 13:19:46,739 INFO  solr.SolrMappingReader - source: boost dest: boost
2016-09-07 13:19:46,739 INFO  solr.SolrMappingReader - source: digest dest: digest
2016-09-07 13:19:46,739 INFO  solr.SolrMappingReader - source: tstamp dest: tstamp
2016-09-07 13:19:46,984 INFO  solr.SolrIndexWriter - Indexing 250/250 documents
2016-09-07 13:19:46,984 INFO  solr.SolrIndexWriter - Deleting 0 documents
2016-09-07 13:19:47,232 INFO  solr.SolrIndexWriter - Indexing 250/250 documents
2016-09-07 13:19:47,232 INFO  solr.SolrIndexWriter - Deleting 0 documents
2016-09-07 13:19:47,356 WARN  mapred.LocalJobRunner - job_local600793053_0001
java.lang.Exception: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/TEST_CORE: This IndexSchema is not mutable.
	at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:529)
Caused by: org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException: Error from server at http://localhost:8983/solr/TEST_CORE: This IndexSchema is not mutable.
	at org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod(HttpSolrClient.java:575)
	at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:241)
	at org.apache.solr.client.solrj.impl.HttpSolrClient.request(HttpSolrClient.java:230)
	at org.apache.solr.client.solrj.SolrClient.request(SolrClient.java:1220)
	at org.apache.nutch.indexwriter.solr.SolrIndexWriter.push(SolrIndexWriter.java:209)
	at org.apache.nutch.indexwriter.solr.SolrIndexWriter.write(SolrIndexWriter.java:173)
	at org.apache.nutch.indexer.IndexWriters.write(IndexWriters.java:85)
	at org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:50)
	at org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:41)
	at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.write(ReduceTask.java:493)
	at org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:422)
	at org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:367)
	at org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:56)
	at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:444)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:392)
	at org.apache.hadoop.mapred.LocalJobRunner$Job$ReduceTaskRunnable.run(LocalJobRunner.java:319)
	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
	at java.util.concurrent.FutureTask.run(FutureTask.java:266)
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
	at java.lang.Thread.run(Thread.java:745)
2016-09-07 13:19:48,099 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed!
	at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:836)
	at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:145)
	at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:228)
	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
	at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:237)

Attachment: nutch-site.xml
Description: XML document

Attachment: solrconfig.xml
Description: XML document

Attachment: schema.xml
Description: XML document

Reply via email to