No problem. Check out the Intranet configuration section of the tutorial (http://lucene.apache.org/nutch/tutorial.html):
Edit the file conf/crawl-urlfilter.txt and replace MY.DOMAIN.NAME with the name of the domain you wish to crawl. For example, if you wished to limit the crawl to the apache.org domain, the line should read: +^http://([a-z0-9]*\.)*apache.org/ This will include any url in the domain apache.org. ________________________________ From: Rajpaul Cheenath [mailto:[EMAIL PROTECTED] Sent: Tuesday, February 14, 2006 6:21 AM To: [email protected] Subject: Nutch search engine can be used to search only on specific domain? Hi, I like to know that Nutch search engine can be used to search only on specific domain. My requirement allows me to search only on specific web sites (it include the same web site where we search). Thanks Raj Paul
