No problem.  Check out the Intranet configuration section of
the tutorial (http://lucene.apache.org/nutch/tutorial.html):

 

Edit the file conf/crawl-urlfilter.txt and replace MY.DOMAIN.NAME with
the name of the domain you wish to crawl. For example, if you wished to
limit the crawl to the apache.org domain, the line should read:

+^http://([a-z0-9]*\.)*apache.org/

This will include any url in the domain apache.org.

 

________________________________

From: Rajpaul Cheenath [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, February 14, 2006 6:21 AM
To: [email protected]
Subject: Nutch search engine can be used to search only on specific
domain?

 

Hi,

 

I like to know that Nutch search engine can be used to search only on
specific domain.

My requirement allows me to search only on specific web sites (it
include the same web site where we search).

 

Thanks

Raj Paul

 

Reply via email to