I have some newbie questions. - There are two filters crawl-urlfilter.txt and regex-urlfilter.txt. Which one should be configured in which condition?
- Is it possible to see howmuch bandwidth Nutch crawl consumes? - Can the Nutch bot do NTLM authentication for websites in a domain? - Is there any benchmarking tool to compare the performance of Nutch bot with different number of threads? - Howmany threads are recommended? Cheers B. Hugh