Hi

There are are several posts about the difference between

regex-urlfilter.txt crawl-urlfilter.txt

e.g.http://www.mail-archive.com/nutch-user@lucene.apache.org/msg06318.html

or http://mail-archives.apache.org/mod_mbox/lucene-nutch-user/200503.mbox/[EMAIL PROTECTED]

but it might stupid, but what do you mean by intranet and internet crawling?

In the end both of them are just URLs ... right? It seems to me I completely misunderstand something.

Thanks for a hint

Michi

--
Michael Wechner
Wyona      -   Open Source Content Management   -    Apache Lenya
http://www.wyona.com                      http://lenya.apache.org
[EMAIL PROTECTED]                        [EMAIL PROTECTED]
+41 44 272 91 61

Reply via email to