I think you need run several runs. The first run just crawling the homepage of the site.
I use the screen output as the log information. Do sure whatelse logs are. Michael Ji, --- AJ Chen <[EMAIL PROTECTED]> wrote: > I'm testing nutch whole-web crawling with juts one > url in a text file. > But, after generate/fetch/updatedb/index, there is > only one document in > the index. Questions: > 1. What needs to be set in order to fetch all > available web pages on one > site? > 2. Where is the log file that I can check what's > going on? > Thanks, > > -AJ > > > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
