[ http://issues.apache.org/jira/browse/NUTCH-87?page=all ]
Matt Kangas updated NUTCH-87:
-
Version: 0.7.2-dev
0.8-dev
Efficient site-specific crawling for a large number of sites
[ http://issues.apache.org/jira/browse/NUTCH-87?page=all ]
Matt Kangas updated NUTCH-87:
-
Attachment: build.xml.patch-0.8
The previous patch file is valid for 0.7. Here is one that works for 0.8-dev
(trunk).
(It's three separate one-line additions, to
[ http://issues.apache.org/jira/browse/NUTCH-87?page=all ]
Matt Kangas updated NUTCH-87:
-
Attachment: build.xml.patch
urlfilter-whitelist.tar.gz
THIS REPLACES THE PREVIOUS TARBALL
SEE THE INCLUDED README.txt FOR USAGE GUIDELINES
Place both