Hi,

I found the solution to the original problem at the beginning of this
mail thread. I am not sure if anybody is still interested in it,
anyway, here it comes:

The problem is very simple. The current nutch-trunk version requires
initial url list to be stored in folder. In other words when usign
crawl command (like the follwoing example "bin/nutch crawl urls -dir
some_dir -depth n") then that urls MUST stands for folder and not for
flat file.

This is one of (minor) changes made to nutch when it matured from
nutch-0.7.x to nutch-0.8. I didn't originally notice this. Again, this
is simple issue but if anybody is still interested....

Regards,
Lukas

On 12/23/05, Stefan Groschupf <[EMAIL PROTECTED]> wrote:
> Hi
>
> > I have been struggling the same problem two days ago. I posted problem
> > to nutch-dev maillist under the following sublect: "nutch-0.8-dev
> > *mapred.input.subdir* problem
>
> As soon I found some time over the next days I will try to reproduce
> the problem.
> Meanwhile it would be good to know if you guys note that problem with
> the nightly
> build and if this also occurs when using a build form the latest
> sources in subversion.
>
> Stefan
>


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_idv37&alloc_id865&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to