I have some newbie questions.

- There are two filters crawl-urlfilter.txt and regex-urlfilter.txt.
Which one should be configured in which condition?

- Is it possible to see howmuch bandwidth Nutch crawl consumes?

- Can the Nutch bot do NTLM authentication for websites in a domain?

- Is there any benchmarking tool to compare the performance of Nutch
bot with different number of threads?

- Howmany threads are recommended?

Cheers
B. Hugh

Reply via email to