Hi Arcondo, The nekohtml jar should be version 0.9.5, and should reside in build/plugins/lib-nekohtml once you build Nutch from source. Once you use the default 'runtime' target, the corresponding plugins folders should be copied into runtime/local/plugins Can you check that the jar is copied to this directory before attempting to parse th6e URLs in your segment(s) if using 1.x. I'm also assuming that you have parse-html included in the plugin.includes property within nutch-site.xml before building the source.
Lewis On Thu, Jan 3, 2013 at 9:11 PM, Arcondo Dasilva <[email protected]>wrote: > Thanks for the explanation. I'm more a functional guy with no solid > background in Java. > Could you give some details on how to enforce it manually ? > > Thanks in advance, Arcondo > > > > On Thu, Jan 3, 2013 at 2:49 PM, Lewis John Mcgibbney < > [email protected]> wrote: > > > the jar is not on the classpath > -- *Lewis*

