Ferdy, I was not able to use sol4-schema.xml coming with Nutch 2.1 because it was throwing error Lewis pointed ( https://issues.apache.org/jira/browse/NUTCH-1486).
Therefore,I have used sol4 with original schema.xml. Maybe, this is why I am facing issue. Lewis suggested to try to patch that I will do it now The previous version of Nutch 1.5 with Solr 3.4.x worked w/o any issue. Thanks for your comment Erol Akarsu On Tue, Nov 13, 2012 at 3:57 AM, Ferdy Galema <[email protected]>wrote: > I'm not a regular Solr user, but here are some pointers: Somehow, you have > added multiple values for the 'id' field. What did you change from the > default indexing behaviour?Perhaps some custom IndexingFilters? What schema > are you using? (Or as a last resort, perhaps you could give Elasticsearch a > try. This one works out of the box, schema-less). > > On Mon, Nov 12, 2012 at 7:29 PM, Erol Akarsu <[email protected]> wrote: > > > I am sorry I missed the version of Nutch and SOLR I used to get the above > > error. > > > > I used SOLR 4.0 and Nutch 2.1 > > > > I am hoping somebody would help me to resolved this issue > > > > Erol Akarsu > > > > On Mon, Nov 12, 2012 at 1:02 PM, Erol Akarsu <[email protected]> wrote: > > > > > I am trying to crawl with Nutch and index on SOLR. Crawling went fine > > > > > > But when I try to index with SOLR, then I am getting error in my tomcat > > > log file "SEVERE: org.apache.solr.common.SolrException: Document > contains > > > multiple values for uniqueKey field: > > > id=[fi.foofactory.blog:http/2007/03/twice-speed-half-size.html, > > > http://blog.foofactory.fi/2007/03/twice-speed-half-size.html, > > > 0000000000000000]" > > > > > > bin/nutch crawl urls/ -depth 2 > > > > > > eakarsu@ubuntu:~/apache-nutch-2.1/runtime/local$ bin/nutch solrindex > > > http://localhost:8983/solr40/ -reindex > > > SolrIndexerJob: starting > > > Adding 31 documents > > > SolrIndexerJob: java.lang.RuntimeException: job failed: > name=solr-index, > > > jobid=job_local_0001 > > > at org.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54) > > > at > > > > org.apache.nutch.indexer.solr.SolrIndexerJob.run(SolrIndexerJob.java:46) > > > at > > > > > > org.apache.nutch.indexer.solr.SolrIndexerJob.indexSolr(SolrIndexerJob.java:54) > > > at > > > > org.apache.nutch.indexer.solr.SolrIndexerJob.run(SolrIndexerJob.java:75) > > > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) > > > at > > > > org.apache.nutch.indexer.solr.SolrIndexerJob.main(SolrIndexerJob.java:84) > > > > > > > > > Nov 12, 2012 10:42:59 AM org.apache.solr.common.SolrException log > > > SEVERE: org.apache.solr.common.SolrException: Document contains > multiple > > > values for uniqueKey field: > > > id=[fi.foofactory.blog:http/2007/03/twice-speed-half-size.html, > > > http://blog.foofactory.fi/2007/03/twice-speed-half-size.html, > > > 0000000000000000] > > > at > > > > > > org.apache.solr.update.AddUpdateCommand.getIndexedId(AddUpdateCommand.java:91) > > > at > > > > > > org.apache.solr.update.processor.DistributedUpdateProcessor.versionAdd(DistributedUpdateProcessor.java:445) > > > at > > > > > > org.apache.solr.update.processor.DistributedUpdateProcessor.processAdd(DistributedUpdateProcessor.java:325) > > > at > > > > > > org.apache.solr.update.processor.LogUpdateProcessor.processAdd(LogUpdateProcessorFactory.java:100) > > > at > > > > > > org.apache.solr.update.processor.SignatureUpdateProcessorFactory$SignatureUpdateProcessor.processAdd(SignatureUpdateProcessorFactory.java:181) > > > at > > > > > > org.apache.solr.handler.loader.XMLLoader.processUpdate(XMLLoader.java:230) > > > at > > > org.apache.solr.handler.loader.XMLLoader.load(XMLLoader.java:157) > > > at > > > > > > org.apache.solr.handler.UpdateRequestHandler$1.load(UpdateRequestHandler.java:92) > > > at > > > > > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:74) > > > at > > > > > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129) > > > at org.apache.solr.core.SolrCore.execute(SolrCore.java:1699) > > > at > > > > > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:455) > > > at > > > > > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:276) > > > at > > > > > > org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235) > > > at > > > > > > org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206) > > > at > > > > > > org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233) > > > at > > > > > > org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191) > > > at > > > > > > org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127) > > > at > > > > > > org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102) > > > at > > > > > > org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109) > > > at > > > > > > org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:293) > > > at > > > > > > org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:859) > > > at > > > > > > org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:602) > > > at > > > org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489) > > > at java.lang.Thread.run(Thread.java:662) > > > > > > > > > > > > > > >

