Hi

There are are several posts about the difference between

regex-urlfilter.txt crawl-urlfilter.txt

e.g.http://www.mail-archive.com/nutch-user@lucene.apache.org/msg06318.html

or 
http://mail-archives.apache.org/mod_mbox/lucene-nutch-user/200503.mbox/[EMAIL 
PROTECTED]

but it might stupid, but  what do you mean by intranet and internet 
crawling?

In the end both of them are just URLs ... right? It seems to me I 
completely misunderstand something.

Thanks for a hint

Michi

-- 
Michael Wechner
Wyona      -   Open Source Content Management   -    Apache Lenya
http://www.wyona.com                      http://lenya.apache.org
[EMAIL PROTECTED]                        [EMAIL PROTECTED]
+41 44 272 91 61


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-developers mailing list
Nutch-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to