Just want to ask if anyone else has noticed that the index and segments
under the searcher dir are causing a hot spot on the hard drive in a heavy
transaction use search.
I am on windows, Nutch 7.1, tomcat 5.15, and have tuned the system for some
decent performance, Modified both tomcat and
Hi all i have started nutch to crawl my schools web site it has bee 2 days now,
and i want to stop nutch crawling the web site in such a way that b4 stopping
it does it's housekeeping activities. I mean if i just close the window will
it be fine, because i feel that if i will do it that nutch
Hi,
I've been using Nutch 7 for a few months, and recently started
working with 8. I'm testing everything right now on a single server,
using the local file system. I generated 10 segments with 100k urls in
each, and fetched the content. Then I do the updatedb, but it looks like
the
hi there,
I have webdb with over 60,000 pages (using nutch/admin
dumptxt command) and refetching interval is set as 1
day
property
namedb.default.fetch.interval/name
value1/value
descriptionThe default number of days between
re-fetches of a page.
/description
/property
But, when I do