Aled, I guess the other question is what are you trying to do, for example, if you need to automate the crawl you can make a shell script and cron it (well ok, I am using task manager). If you want to watch the logs on the screen in a terminal window, you can tail -f crawl.log it (I am using wintail), I am more than happy to help if you want to automate your nutch jobs.
I automated as much as I could on those processes that I wanted nutch to do, and it sits quietly in the corner doing all the work, merging, indexing, rebuilding, stopping and starting tomcat, so it is possible to automate nutch so that it is 90% stand alone by scripting. Although, its all windows scripting, I am not running on linux, I have no linux scripts. r/d -----Original Message----- From: Aled Jones [mailto:[EMAIL PROTECTED] Sent: Friday, April 28, 2006 6:14 AM To: [email protected] Subject: ATB: Heritrix Thanks for your replies guys. I hadn't realised that the admin gui was already in development. We should be able to cope till it gets released ;-) Thanks again Aled > -----Neges Wreiddiol-----/-----Original Message----- > Oddi wrth/From: Dan Morrill [mailto:[EMAIL PROTECTED] > Anfonwyd/Sent: 28 April 2006 14:07 > At/To: [email protected] > Pwnc/Subject: RE: Heritrix > > Aled, > > I used heritrix before going over to nutch, while it is an > excellent program, with lots of good things to offer, it > didn't quite meet my need, and when designing the > architecture had too many dependencies for me to be comfortable with. > > If you want to run an internet archive though, heritrix can > not be beat, if you want to run a search engine, nutch is a > good choice. > > My personal opinion. > r/d > > -----Original Message----- > From: Aled Jones [mailto:[EMAIL PROTECTED] > Sent: Friday, April 28, 2006 1:59 AM > To: [email protected] > Subject: Heritrix > > Hi > > Anyone used Heritrix (http://crawler.archive.org/) as a > crawler? How does it compare with the Nutch crawler? Can > Nutch serve its crawled > results? Main reason I'm interested is that it has a WUI interface > that might make maintenance for the IT guys easier, although > I know that some of you guys are working on an interface. > > Cheers > Aled > > > ########################################### > > This message has been scanned by F-Secure Anti-Virus for > Microsoft Exchange. > For more information, connect to http://www.f-secure.com/ > ************************************************************** > ********** > This e-mail and any attachments are strictly confidential and > intended solely for the addressee. They may contain > information which is covered by legal, professional or other > privilege. If you are not the intended addressee, you must > not copy the e-mail or the attachments, or use them for any > purpose or disclose their contents to any other person. To do > so may be unlawful. If you have received this transmission in > error, please notify us as soon as possible and delete the > message and attachments from all places in your computer > where they are stored. > > Although we have scanned this e-mail and any attachments for > viruses, it is your responsibility to ensure that they are > actually virus free. > > > > ########################################### This message has been scanned by F-Secure Anti-Virus for Microsoft Exchange. For more information, connect to http://www.f-secure.com/ ************************************************************************ This e-mail and any attachments are strictly confidential and intended solely for the addressee. They may contain information which is covered by legal, professional or other privilege. If you are not the intended addressee, you must not copy the e-mail or the attachments, or use them for any purpose or disclose their contents to any other person. To do so may be unlawful. If you have received this transmission in error, please notify us as soon as possible and delete the message and attachments from all places in your computer where they are stored. Although we have scanned this e-mail and any attachments for viruses, it is your responsibility to ensure that they are actually virus free. =
