I've attached a patch to NUTCH-2325 for 1.12. You can use it to patch 1.12 or 
trunk/master and use it before 1.13 is released.
M.

 
 
-----Original message-----
> From:Sujan Suppala <ssupp...@opentext.com>
> Sent: Friday 14th October 2016 13:51
> To: user@nutch.apache.org
> Subject: RE: nutch 1.12 INJECT REST call not honoring db.injector.overwrite
> 
> Thanks.
> 
> https://issues.apache.org/jira/browse/NUTCH-2325
> 
> Hoping this would be fixed in 1.13 .
> 
> Thanks
> Sujan 
> -----Original Message-----
> From: Markus Jelsma [mailto:markus.jel...@openindex.io] 
> Sent: Friday, October 14, 2016 4:43 PM
> To: user@nutch.apache.org
> Subject: RE: nutch 1.12 INJECT REST call not honoring db.injector.overwrite
> 
> REST uses the old method invocation which sets overwrite and update to false, 
> which is wrong.
> 
> https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_apache_nutch_blob_master_src_java_org_apache_nutch_crawl_Injector.java-23L514&d=DQIFaQ&c=ZgVRmm3mf2P1-XDAyDsu4A&r=TYOvRwySdGnkd8fWW9UKQD84hpS9B0oyD81yyeqf8dE&m=blLsEihQyxklDyosqKUjWAH9t1Jzgm3MwnH0ublHY-I&s=95zGKyM7P7W2lTNPE3KePTHes16i26KQXHYx1xiJlWY&e=
>  
> 
> Please open a ticket.
> 
> M.
> 
>  
>  
> -----Original message-----
> > From:Sujan Suppala <ssupp...@opentext.com>
> > Sent: Friday 14th October 2016 11:44
> > To: user@nutch.apache.org
> > Subject: nutch 1.12 INJECT REST call not honoring db.injector.overwrite
> > 
> > Hi,
> >                 I am using the Nutch 1.12.
> >                 I don't see an option to overwrite an url in the crawl db 
> >via the INJECT REST call, also I have observed that this rest call is not 
> >honoring the config settings for "db.injector.overwrite" , 
> >"db.injector.update" properties when these are set to true.
> > 
> > 
> > POST /job/create
> > 
> > {
> > 
> >     "type":"INJECT",
> > 
> >     "confId":"default",
> > 
> >     "crawlId":"TestCrawl",
> > 
> >     "args": {"url_dir":"c:\\cygwin64\\tmp\\1475752235404-0"}
> > 
> > }
> > 
> > 
> >                 But I could see an option via the inject command:
> >                                 $ bin/nutch inject TestCrawl/crawldb urls 
> >-overwrite
> > 
> > 
> >                 I want to overwrite an url so that its status changes to 
> >UNFETCHED. I have only option to use REST service. Someone help on this?
> > 
> > 
> > Thanks
> > Sujan
> > 

Reply via email to