Yes, nutch can crawl webpages and you can soemhow limit the crawler
to a set of hosts.
Just try the intranet crawl tutorial to get an idea.
Stefan
Am 10.11.2005 um 10:04 schrieb Arun Kumar Sharma:
Hi All,
I want to know how nutch fits into my requirements and how best I
can expolit its features?
Requirement:
Nutch is designed to be crawl the information system on internet
and intranet. My requirement is that it crawl information present
anywhere? Do nutch suitable for me ? What I need to extend/ update
so that it fits to my requirement ?
response awaited........
Thanx in advance for your early response
Regards,
Arun Kumar Sharma (Tech Lead -Java/J2EE)
Mob: +91.981.529.5761
---------------------------------
Enjoy this Diwali with Y! India Click here
-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server. Download
it for free - -and be entered to win a 42" plasma tv or your very own
Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers