Great to hear. Thanks
On Wed, Jan 18, 2012 at 1:26 PM, remi tassing <[email protected]> wrote: > Hi, > > Lewis is right! > > > 1. I copied the jar files "apache-solr-core-3.5.0" and > "apache-solr-solrj-3.5.0" from solr-installation-dir/dist to > nutch-installation-dir/lib > 2. I rebuilt Nutch > 3. It works > > Thanks Lewis! > > Remi > > On Wed, Dec 14, 2011 at 4:27 PM, Markus Jelsma <[email protected] > > wrote: > >> It's something else >> >> NUTCH-1016 Strip UTF-8 non-character codepoints and add logging for >> SolrWriter >> >> On Wednesday 14 December 2011 15:12:23 Lewis John Mcgibbney wrote: >> > Hi Remi, >> > >> > This is a compatibility issue with conflicting versions of Solrj [1] >> > >> > [1] >> > >> http://lucene.472066.n3.nabble.com/Invalid-version-or-the-data-in-not-in-j >> > avabin-format-td1460495.html >> > >> > On Wed, Dec 14, 2011 at 1:57 PM, remi tassing <[email protected]> >> wrote: >> > > Hello guys, >> > > >> > > After crawling with Nutch I tried pushing the index to Solr but it >> > > doesn't work. >> > > >> > > I'm using Nutch-1.2. Solr-3.4 & 3.5 don't work but 1.4 works well! >> > > >> > > $ bin/nutch solrindex http://127.0.0.1:8983/solr/ crawl/crawldb >> > > crawl/linkdb crawl/segments/* >> > > SolrIndexer: starting at 2011-12-14 15:36:15 >> > > java.io.IOException: Job failed! >> > > >> > > This is my nutch log: >> > > ... >> > > 011-12-14 15:37:36,762 INFO indexer.IndexingFilters - Adding >> > > org.apache.nutch.indexer.anchor.AnchorIndexingFilter >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: content >> > > dest: content >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: site >> dest: >> > > site >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: title >> > > dest: title >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: host >> dest: >> > > host >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: segment >> > > dest: segment >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: boost >> > > dest: boost >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: digest >> > > dest: digest >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: tstamp >> > > dest: tstamp >> > > 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: url >> dest: >> > > id 2011-12-14 15:37:36,810 INFO solr.SolrMappingReader - source: url >> > > dest: url 2011-12-14 15:37:37,454 WARN mapred.LocalJobRunner - >> > > job_local_0001 java.lang.RuntimeException: Invalid version or the data >> > > in not in 'javabin' format >> > > at >> > > >> org.apache.solr.common.util.JavaBinCodec.unmarshal(JavaBinCodec.java:99) >> > > at >> > > >> org.apache.solr.client.solrj.impl.BinaryResponseParser.processResponse(Bi >> > > naryResponseParser.java:39) at >> > > >> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHt >> > > tpSolrServer.java:466) at >> > > >> org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHt >> > > tpSolrServer.java:243) at >> > > >> org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(Abstra >> > > ctUpdateRequest.java:105) at >> > > org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49) at >> > > org.apache.nutch.indexer.solr.SolrWriter.write(SolrWriter.java:64) at >> > > >> org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat. >> > > java:54) at >> > > >> org.apache.nutch.indexer.IndexerOutputFormat$1.write(IndexerOutputFormat. >> > > java:44) at >> > > org.apache.hadoop.mapred.ReduceTask$3.collect(ReduceTask.java:440) at >> > > >> org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:15 >> > > 9) at >> > > >> org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:50 >> > > ) at >> > > org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:463) >> > > at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411) >> > > at >> > > >> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216) >> > > 2011-12-14 15:37:38,160 ERROR solr.SolrIndexer - java.io.IOException: >> Job >> > > failed! >> > > >> > > Remi >> >> -- >> Markus Jelsma - CTO - Openindex >> > > -- *Lewis*

