Hi. I have a few questions about recrawling. The script works fine. I have about a 1000 seedings, but the don't seem to "grow". I'm recrawling now, more urls will be injected soon. How often should the re-crawl script run? And if i run it very often- say, every 3 days, won't the computer eventually run out of memory? If so, how can that be prevented, and if it happens, how can it be fixed?
Sorry for the many questions, but i also need to know how to "maintain" nutch. I would really apreciate all and any help. Thanks in advance. -- View this message in context: http://www.nabble.com/Re-crawl-frequency-memory-problem--please-help-tp18048873p18048873.html Sent from the Nutch - User mailing list archive at Nabble.com.
