I've found the script here http://wiki.apache.org/nutch/MonitoringNutchCrawls. But I'm not sure how can I use it, when hadoop is on the farm of 15 machines? May be I should use hadoop tasktracker instead of this script somehow?
caezar wrote: > > Hi All, > > Is there a way, to retrieve nutch crawling status at runtime? Let me > describe what I mean. For instance if currently fetch job is running, I > want to retrieve that fetch is running, how many URLs already fetched, how > many errors occured. Hadoop farm is used. > > Thanks for any ideas. > -- View this message in context: http://www.nabble.com/Nutch-crawling-status-tp24681707p24681949.html Sent from the Nutch - User mailing list archive at Nabble.com.
