Re: log4j problem

2007-01-31 Thread chee wu
set the two java arguments-Dhadoop.log.file and -Dhadoop.log.dir should fix your problem. btw,not to put much chinese characters in your mail.. - Original Message - From: kauu [EMAIL PROTECTED] To: nutch-dev@lucene.apache.org Sent: Wednesday, January 31, 2007 1:45 PM Subject: log4j

Re: Modified date in crawldb

2007-01-25 Thread chee wu
: +44 (788) 695 0483 http://blog.idna-solutions.com -Original Message- From: chee wu [mailto:[EMAIL PROTECTED] Sent: 25 January 2007 13:44 To: nutch-dev@lucene.apache.org Subject: Re: Modified date in crawldb I also had this question a few days ago,and I am using Nutch0.8.1

Re: Fetcher2

2007-01-24 Thread chee wu
- Armel T. Nene iDNA Solutions Tel: +44 (207) 257 6124 Mobile: +44 (788) 695 0483 http://blog.idna-solutions.com -Original Message- From: chee wu [mailto:[EMAIL PROTECTED] Sent: 24 January 2007 03:59 To: nutch-dev@lucene.apache.org

Re: Fetcher2

2007-01-23 Thread chee wu
Thanks! I successfully port Fetcher2 to Nutch.81, it's prettyly easy... I can share the code,if any one want to use .. - Original Message - From: Andrzej Bialecki [EMAIL PROTECTED] To: nutch-dev@lucene.apache.org Sent: Tuesday, January 23, 2007 12:09 AM Subject: Re: Fetcher2 chee wu

Re: Fetcher2

2007-01-22 Thread chee wu
Fetcher2 should be a great help for me,but seems can't integrate with Nutch81. Any advice on how to use it based on .81? - Original Message - From: Andrzej Bialecki [EMAIL PROTECTED] To: nutch-dev@lucene.apache.org Sent: Thursday, January 18, 2007 5:18 AM Subject: Fetcher2 Hi all,

nutch81 pages seems were not kept but no error message found

2007-01-03 Thread Chee Wu
Hi all, I am using crawl tool in Nutch81 under cygwin,trying to retrieve pages from about 2 thousand websites,and the crawl process has been running for nearly 20 hours. But during the past 10 hours, the fetch status always remain the same as below: TOTAL urls: 165212 retry 0: