Add the schema.xml from nutch/conf to your Solr core. btw: be careful with your host and port in the mailing lists. If it's open....
On Wednesday 26 October 2011 15:07:56 Fred Zimmerman wrote: > that's it. > > org.apache.solr.common.SolrException: ERROR:unknown field 'content' > > *ERROR:unknown field 'content'* > > request: http://url/solr/update?wt=javabin&version=2 > at > org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp > SolrServer.java:436) at > org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttp > SolrServer.java:245) at > org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(Abstract > UpdateRequest.java:105) at > org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49) at > org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:82) > at > org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.ja > va:48) at > org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474) > at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411) > at > org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216) > 2011-10-26 12:58:20,596 ERROR solr.SolrIndexer - java.io.IOException: Job > failed! > > > On Wed, Oct 26, 2011 at 9:03 AM, Markus Jelsma > > <[email protected]>wrote: > > Check your hadoop.log and Solr log. If that happens there's usually i > > field mismatch when indexing. > > > > On Wednesday 26 October 2011 14:59:02 Fred Zimmerman wrote: > > > OK, I've fixed the problem with the parameters giving incorrect paths > > > to the files. Now I get this: > > > > > > $ bin/nutch solrindex http://search.zimzaz.com:8983/solr crawl/crawldb > > > crawl/linkdb crawl/segments/* > > > SolrIndexer: starting at 2011-10-26 12:57:57 > > > java.io.IOException: Job failed! > > > > -- > > Markus Jelsma - CTO - Openindex > > http://www.linkedin.com/in/markus17 > > 050-8536620 / 06-50258350 -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536620 / 06-50258350

