I'm upgrading from Nutch 1.4 to Nutch 1.12. I limit this crawl to my seeds, so 
my 1.4 command was:
bin/nutch crawl phfaws -dir crawl -depth 1 -topN 50000

My understanding is that the "crawl" command is deprecated, "-depth" went with 
it, and I need to install the scoring-depth plugin. I'm new to adding plugins. 
The instructions at https://wiki.apache.org/nutch/AboutPlugins give a sample 
command, but I don't know what the official PluginRepository for this plugin is 
and the sample link for the HtmlParser plugin is dead.

I'll appreciate any help. Thank you!

Chip Calhoun
Digital Archivist
Niels Bohr Library & Archives
American Institute of Physics
One Physics Ellipse
College Park, MD  20740
301-209-3180
https://www.aip.org/history-programs/niels-bohr-library

Reply via email to