I tried start job tracker without tomcat.
-Original Message-
From: Chris Stephens [mailto:[EMAIL PROTECTED]
Sent: Wednesday, August 23, 2006 6:16 PM
To: nutch-dev@lucene.apache.org
Subject: Re: problem with nutch
Importance: High
This is probably a better question for the user list.
If be exacеt. When I started job tracker on given server was loaded only
namenode. All ports from hadoop-default.xml not used.
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Friday, August 25, 2006 10:48 AM
To: nutch-dev@lucene.apache.org
Subject: RE:
In Addition please draw attention on next part of log:
06/08/25 05:07:59 WARN servlet.WebApplicationContext: Web application not
found /spider_kakle_mapred/spider/conf:/spider_
06/08/25 05:07:59 WARN servlet.WebApplicationContext: Configuration error on
Hi
I think it would be very useful if the NutchBean would check if the
crawl dir exists and throw at least a warning
in case it doesn't:
Index: nutch-0.8/src/java/org/apache/nutch/searcher/NutchBean.java
===
---
Hi
i think this patch will make it way easier to configure nutch, crawl dir
will be read from
nutch-default.xml instead of a relative path from where it has been executed
So nutch-default.xml will have its
property
namesearcher.dir/name
valuePATH_TO_CRAWL_DIR/value
description
and this
Hi Michi,
what is your motivation for that?
Stefan
Am 25.08.2006 um 06:52 schrieb Michael Wechner:
Hi
I think it would be very useful if the NutchBean would check if the
crawl dir exists and throw at least a warning
in case it doesn't:
Index:
hi...
if it's ok, i've got some basic research questions.
can someone tell me if there's a limit to the number of simultaneous
websites that nutch/lucence can return...?
i'm assuming the nutch/lucene writes the text information from the crawl
back to a db. can someone tell me if there's a limit
bruce wrote:
hi...
if it's ok, i've got some basic research questions.
can someone tell me if there's a limit to the number of simultaneous
websites that nutch/lucence can return...?
I assume you are asking its indexing capacity. If that is the case it
is billions, it is pretty much