Hi, We want to fetch all sub-folders of a site, excluding outside links and pages that are on upper levels. Is there there a way to setup Nutch making it work in such mode, without using the RegExp filter file (regex-urlfilter.txt)?
Thank you. -- View this message in context: http://www.nabble.com/Fetching-site%27s-sub-folders-only-tf4901877.html#a14041549 Sent from the Nutch - User mailing list archive at Nabble.com.
