Re: Config issues with URL filters and normalizers in UpdateCrawlDb

2018-03-19 Thread Semyon Semyonov
Hi Sebastian, No problems. Here it is, https://issues.apache.org/jira/browse/NUTCH-2539 Semyon. Sent: Monday, March 19, 2018 at 2:02 PM From: "Sebastian Nagel" To: dev@nutch.apache.org Subject: Re: Config issues with URL filters and normalizers in UpdateCrawlDb Hi Semyon, sor

Re: Config issues with URL filters and normalizers in UpdateCrawlDb

2018-03-19 Thread Sebastian Nagel
Hi Semyon, sorry for the late answer. Yes, you're right the naming in nutch-default.xml is wrong. Please open a Jira issue to address this. The description should also mention that the property crawldb.url.filters is a "temporary" and set/overwritten by command-line options. Cf. the overview (s