Hello All,

I have been following the Nutch tutorial on
http://wiki.apache.org/nutch/NutchTutorial
which has the following command to be executed

bin/nutch bin/crawl urls -dir crawl -depth 3 -topN 5

which throws me Error: Could not find or load main class bin.crawl

A quick glance through the nutch script in the bin folder reveals that it
matches the second parameter passed to nutch (bin/crawl in this case) and
compares it various nutch options like inject, generate, etc. If no such
match is found, the script executes the parameter as a java class.

This would have worked prior to this git commit:
https://github.com/apache/nutch/commit/d3f2dd1bbad7c9d69a38ef9e6e756003a45da9e7


As a result, the script throws an error when bin/crawl is passed as a
second parameter. I think there is a discrepancy in the tutorials and the
code.

I may be completely off, but any one else facing the same issue?

Thank you.


ᐧ

Reply via email to