hi 

We run nutch in data center where we cant access internet, but we have
access to out side urls with  different hostnames/ip addresses.

we feed nutch crawled data to solr. 

Anyway for us to change crawled urls ( which are accessibe only from
intranet) to urls which are accessible on internet?


--
View this message in context: 
http://lucene.472066.n3.nabble.com/crawl-url-replacement-during-indexing-tp3377801p3377801.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to