hello,

I can finally run Nutch (+Solr) with JAVA, my only question left is, how can
I make sure if a particular domain has been crawled?

Let's say I have 300 sites to crawl and index.
So far my work-around was to execute a simple Solr query for each domain
URL, and see if the indexing timestamp in the Solr DB is greater then the
Nutch crawling start date-time. It works, but I'm curious if there is a
better way to do this. 

thanks,
--iM



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-status-info-on-each-domain-individually-tp4042815.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to