RE: problem with nutch

2006-08-25 Thread anton
I tried start job tracker without tomcat. -Original Message- From: Chris Stephens [mailto:[EMAIL PROTECTED] Sent: Wednesday, August 23, 2006 6:16 PM To: nutch-dev@lucene.apache.org Subject: Re: problem with nutch Importance: High This is probably a better question for the user list.

RE: problem with nutch

2006-08-25 Thread anton
If be exacеt. When I started job tracker on given server was loaded only namenode. All ports from hadoop-default.xml not used. -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] Sent: Friday, August 25, 2006 10:48 AM To: nutch-dev@lucene.apache.org Subject: RE:

RE: problem with nutch

2006-08-25 Thread anton
In Addition please draw attention on next part of log: 06/08/25 05:07:59 WARN servlet.WebApplicationContext: Web application not found /spider_kakle_mapred/spider/conf:/spider_ 06/08/25 05:07:59 WARN servlet.WebApplicationContext: Configuration error on

Checking if crawl dir exists ...

2006-08-25 Thread Michael Wechner
Hi I think it would be very useful if the NutchBean would check if the crawl dir exists and throw at least a warning in case it doesn't: Index: nutch-0.8/src/java/org/apache/nutch/searcher/NutchBean.java === ---

reading crawl dir from nutch-default.xml

2006-08-25 Thread David Podunavac
Hi i think this patch will make it way easier to configure nutch, crawl dir will be read from nutch-default.xml instead of a relative path from where it has been executed So nutch-default.xml will have its property namesearcher.dir/name valuePATH_TO_CRAWL_DIR/value description and this

Re: Checking if crawl dir exists ...

2006-08-25 Thread Stefan Groschupf
Hi Michi, what is your motivation for that? Stefan Am 25.08.2006 um 06:52 schrieb Michael Wechner: Hi I think it would be very useful if the NutchBean would check if the crawl dir exists and throw at least a warning in case it doesn't: Index:

nutch/lucene question...

2006-08-25 Thread bruce
hi... if it's ok, i've got some basic research questions. can someone tell me if there's a limit to the number of simultaneous websites that nutch/lucence can return...? i'm assuming the nutch/lucene writes the text information from the crawl back to a db. can someone tell me if there's a limit

Re: nutch/lucene question...

2006-08-25 Thread Dennis Kubes
bruce wrote: hi... if it's ok, i've got some basic research questions. can someone tell me if there's a limit to the number of simultaneous websites that nutch/lucence can return...? I assume you are asking its indexing capacity. If that is the case it is billions, it is pretty much