Hi How to avoid duplicate content? 1. Mirror sites: 1 website, 2 domains. 2. Confusing the bot: dynamic URL's. As robots find dynamic content, the site may be returning a different URL with the same content… 3. Print friendly pages?
Will nutch enhanced the dedup code? /Jack -- Keep Discovering ... ... http://www.jroller.com/page/jmars
