This is what I get in the hadoop.log in Nutch.
2011-09-27 16:08:38,279 INFO indexer.IndexingFilters - Adding
org.apache.nutch.indexer.more.MoreIndexingFilter
2011-09-27 16:08:38,401 INFO solr.SolrMappingReader - source: content dest:
content
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: site dest:
site
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: title dest:
title
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: host dest:
host
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: segment dest:
segment
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: boost dest:
boost
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: digest dest:
digest
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: tstamp dest:
tstamp
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: url dest: id
2011-09-27 16:08:38,402 INFO solr.SolrMappingReader - source: url dest: url
2011-09-27 16:09:38,009 WARN more.MoreIndexingFilter -
http://www.americanprogress.org/experts/GudeKen.html/repository/capportrait/item760555734:
can't parse erroneous date: 2008-03-05T16:19:54
2011-09-27 16:09:38,012 WARN more.MoreIndexingFilter -
http://www.americanprogress.org/experts/KorbLawrence.html/repository/capportrait/item808695475:
can't parse erroneous date: 2008-03-05T16:05:25
2011-09-27 16:11:42,559 WARN mapred.LocalJobRunner - job_local_0001
org.apache.solr.common.SolrException: For input string: "17456 "
java.lang.NumberFormatException: For input string: "17456 " at
java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
at java.lang.Long.parseLong(Long.java:419) at
java.lang.Long.parseLong(Long.java:468) at
org.apache.solr.schema.TrieField.createField(TrieField.java:381) at
org.apache.solr.schema.SchemaField.createField(SchemaField.java:104) at
org.apache.solr.update.DocumentBuilder.addField(DocumentBuilder.java:203)
at
org.apache.solr.update.DocumentBuilder.toDocument(DocumentBuilder.java:276)
at
org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:60)
at
org.apache.solr.update.processor.LogUpdateProcessor.processAdd(LogUpdateProcessorFactory.java:115)
at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:158)
at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:79) at
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:67)
at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1368) at
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:356)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:252)
at
org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
at
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
at
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
at
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
at
org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
at
org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerW
For input string: "17456 " java.lang.NumberFormatException: For input
string: "17456 " at
java.lang.NumberFormatException.forInputString(NumberFormatException.java:48)
at java.lang.Long.parseLong(Long.java:419) at
java.lang.Long.parseLong(Long.java:468) at
org.apache.solr.schema.TrieField.createField(TrieField.java:381) at
org.apache.solr.schema.SchemaField.createField(SchemaField.java:104) at
org.apache.solr.update.DocumentBuilder.addField(DocumentBuilder.java:203)
at
org.apache.solr.update.DocumentBuilder.toDocument(DocumentBuilder.java:276)
at
org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:60)
at
org.apache.solr.update.processor.LogUpdateProcessor.processAdd(LogUpdateProcessorFactory.java:115)
at org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:158)
at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:79) at
org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:67)
at
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:129)
at org.apache.solr.core.SolrCore.execute(SolrCore.java:1368) at
org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:356)
at
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:252)
at
org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212)
at
org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399)
at
org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216)
at
org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182)
at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766)
at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450)
at
org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230)
at
org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114)
at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerW
request: http://127.0.0.1:8983/solr/update?wt=javabin&version=2
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:436)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:245)
at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49)
at org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:71)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:54)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat.java:44)
at org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:440)
at
org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:159)
at
org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:50)
at
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:463)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216)
2011-09-27 16:11:43,161 ERROR solr.SolrIndexer - java.io.IOException: Job
failed!
On Wed, Sep 28, 2011 at 2:44 PM, Markus Jelsma
<[email protected]>wrote:
> Check Solr's log, it's there.
>
> > I don't know. It doesn't say as far as I can tell. Is there a way to
> look
> > at the logs or data to determine?
> >
> > On Tue, Sep 27, 2011 at 4:55 PM, Markus Jelsma
> >
> > <[email protected]>wrote:
> > > For which field do you get this issue? At least one such issue is fixed
> > > in 1.4-dev.
> > >
> > > > So I added the Nutch fields to my Solr schema and reran solrindex.
> > > > However, now I'm getting a NumberFormatException. Nutch is
> apparently
> > > > sending "123456 " to Solr to be parsed as a TrieLong.
> > > >
> > > > Any ideas what would cause this and where to look to fix it?
> > > >
> > > > Thanks.
>