hello, I can finally run Nutch (+Solr) with JAVA, my only question left is, how can I make sure if a particular domain has been crawled?
Let's say I have 300 sites to crawl and index. So far my work-around was to execute a simple Solr query for each domain URL, and see if the indexing timestamp in the Solr DB is greater then the Nutch crawling start date-time. It works, but I'm curious if there is a better way to do this. thanks, --iM -- View this message in context: http://lucene.472066.n3.nabble.com/Nutch-status-info-on-each-domain-individually-tp4042815.html Sent from the Nutch - User mailing list archive at Nabble.com.