Nutch search and hard drive hot spots

2006-04-09 Thread Dan Morrill
Just want to ask if anyone else has noticed that the index and segments under the searcher dir are causing a hot spot on the hard drive in a heavy transaction use search. I am on windows, Nutch 7.1, tomcat 5.15, and have tuned the system for some decent performance, Modified both tomcat and

Strange question!! but i want to know how to stop Nutch successfully

2006-04-09 Thread sapan euf
Hi all i have started nutch to crawl my schools web site it has bee 2 days now, and i want to stop nutch crawling the web site in such a way that b4 stopping it does it's housekeeping activities. I mean if i just close the window will it be fine, because i feel that if i will do it that nutch

Question about crawldb and segments

2006-04-09 Thread Jason Camp
Hi, I've been using Nutch 7 for a few months, and recently started working with 8. I'm testing everything right now on a single server, using the local file system. I generated 10 segments with 100k urls in each, and fetched the content. Then I do the updatedb, but it looks like the

refetching interval

2006-04-09 Thread Michael Ji
hi there, I have webdb with over 60,000 pages (using nutch/admin dumptxt command) and refetching interval is set as 1 day property namedb.default.fetch.interval/name value1/value descriptionThe default number of days between re-fetches of a page. /description /property But, when I do