Markus Jelsma created NUTCH-2231:
------------------------------------
Summary: Jexl support in generator job
Key: NUTCH-2231
URL: https://issues.apache.org/jira/browse/NUTCH-2231
Project: Nutch
Issue Type: Improvement
Affects Versions: 1.11
Reporter: Markus Jelsma
Assignee: Markus Jelsma
Fix For: 1.12
Generator should support Jexl expressions. This would make it much easier to
implement focussing crawlers that rely on information stored in the CrawlDB.
With the HostDB it is possible to restrict the generator to select only
interesting records but it is very cumbersome and involves
domainblacklist-urlfiltering.
With Jexl support, it is no hassle!
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)