hi markus,
I want to try apache nutch 1.12, but i got the following error when index the data to apache solr 5.5.2.

   /16/07/04 19:49:53 INFO mapreduce.Job: Task Id :
   attempt_1467576953324_0090_r_000000_0, Status : FAILED//
   //Error: Bad return type//
   //Exception Details://
   //  Location://
   
//org/apache/solr/client/solrj/impl/HttpClientUtil.createClient(Lorg/apache/solr/common/params/SolrParams;Lorg/apache/http/conn/ClientConnectionManager;)Lorg/apache/http/impl/client/CloseableHttpClient;
   @58: areturn//
   //  Reason://
   //    Type 'org/apache/http/impl/client/DefaultHttpClient' (current
   frame, stack[0]) is not assignable to
   'org/apache/http/impl/client/CloseableHttpClient' (from method
   signature)//
   //  Current Frame://
   //    bci: @58//
   //    flags: { }//
   //    locals: { 'org/apache/solr/common/params/SolrParams',
   'org/apache/http/conn/ClientConnectionManager',
   'org/apache/solr/common/params/ModifiableSolrParams',
   'org/apache/http/impl/client/DefaultHttpClient' }//
   //    stack: { 'org/apache/http/impl/client/DefaultHttpClient' }//
   //  Bytecode://
   //    0000000: bb00 0359 2ab7 0004 4db2 0005 b900 0601//
   //    0000010: 0099 001e b200 05bb 0007 59b7 0008 1209//
   //    0000020: b600 0a2c b600 0bb6 000c b900 0d02 002b//
   //    0000030: b800 104e 2d2c b800 0f2d b0 //
   //  Stackmap Table://
   //    append_frame(@47,Object[#143])/


What should I do, please help.
regards

On 04/07/16 16:37, Markus Jelsma wrote:
Hello - there is no Boilerpipe support for 2.x.
Markus

-----Original message-----
From:Nana Pandiawan <[email protected]>
Sent: Monday 4th July 2016 6:16
To: [email protected]
Subject: Re: Remove Header from content

Hi Markus Jelsma,

If Boilerpipe support for Apache Nutch 2.3.1? i have try
https://issues.apache.org/jira/secure/attachment/12708817/nutch-2.x-boilerpipe.patch,
but doesnt work.

regards

On 29/06/16 17:06, Markus Jelsma wrote:
Manish - you're in luck. Nutch 1.12 was released and has Boilerpipe support. 
Check:
https://issues.apache.org/jira/browse/NUTCH-961

Markus

-----Original message-----
From:Manish Verma <[email protected]>
Sent: Tuesday 28th June 2016 23:46
To: [email protected]
Subject: Remove Header from content

Hi,

I don’t want to index header and footer of content , I know we can make changes 
in HtmlParser.java but I don’t want to change nutch core code, is there any 
other way(plugin) to eleminate Header div from content.

Thanks MV




Reply via email to