Fabrice,

Personally I am tailing the crawl log to find that out. About every 100
pages it gives out the amount of pages in total and pages per second and
line speed. 

Hope that helps. 
r/d

-----Original Message-----
From: Fabrice Estiévenart [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, April 05, 2006 2:14 AM
To: [email protected]
Subject: Crawl status

Hello,

One week ago, I launched a crawl on a list of domains with a depth of 
10. My crawler is still running now. How can I have the status of the 
crawl process ? (number of fetched/indexed pages, current depth of 
crawl, percentage of tasks realised...and any other useful information).

"bin/nutch readdb -stats" gives me some tips but I have some 
difficulties to interpret them and they do not chage very often

Thanks for this great list,

Fabrice



-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to