You can setup a proxy in the nutch-default.xml configuration file.
Am 09.11.2005 um 13:34 schrieb Aled Jones:
Hi all,
Just started playing around with nutch, have got it crawling the
intranet no problem.
My question is how do you get it to go through a web proxy so I can
crawl pages on the internet from my work machine?
Regards
Aled
========================================
Aled Rhys Jones
Software Developer
Innovations
Comtec (Europe) Ltd
========================================
t: +44 (0)1633627500 ext 1426
e: [EMAIL PROTECTED]
w: http://www.comtec-europe.co.uk
6th Floor Gwent House, Gwent Square,
Cwmbran, South Wales. NP44 1PL
========================================
Queens Award Winner for Innovation 2004
========================================
**********************************************************************
**
This e-mail and any attachments are strictly confidential and
intended solely for the addressee. They may contain information
which is covered by legal, professional or other privilege. If you
are not the intended addressee, you must not copy the e-mail or the
attachments, or use them for any purpose or disclose their contents
to any other person. To do so may be unlawful. If you have received
this transmission in error, please notify us as soon as possible
and delete the message and attachments from all places in your
computer where they are stored.
Although we have scanned this e-mail and any attachments for
viruses, it is your responsibility to ensure that they are actually
virus free.