Hi There are are several posts about the difference between
regex-urlfilter.txt crawl-urlfilter.txt e.g.http://www.mail-archive.com/nutch-user@lucene.apache.org/msg06318.html or http://mail-archives.apache.org/mod_mbox/lucene-nutch-user/200503.mbox/[EMAIL PROTECTED] but it might stupid, but what do you mean by intranet and internet crawling? In the end both of them are just URLs ... right? It seems to me I completely misunderstand something. Thanks for a hint Michi -- Michael Wechner Wyona - Open Source Content Management - Apache Lenya http://www.wyona.com http://lenya.apache.org [EMAIL PROTECTED] [EMAIL PROTECTED] +41 44 272 91 61 ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-developers mailing list Nutch-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-developers