Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "ErrorMessagesInNutch2" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/ErrorMessagesInNutch2?action=diff&rev1=4&rev2=5

  
  == Missing plugins whilst running Nutch 2.0 on Cloudera's CDH3 ==
  
- Cloudera's CDH3 is Cloudera's distribution including Apache Hadoop. More 
information can be found 
[[https://ccp.cloudera.com/display/CDHDOC/CDH3+Quick+Start+Guide|here]]. This 
common error results due to a bug in MAPREDUCE-967 which modifies the way 
MapReduce unpacks the job's jar. The old way was to unpack the whole of it, now 
only classes/ and lib/ are unpacked. This way Nutch is missing the plugins/ 
directory. A workaround is to force unpacking of the plugin/ directory. This 
can be done by adding the following properties to nutch-site.xml
+ Cloudera's CDH3 is Cloudera's distribution including Apache Hadoop. More 
information can be found 
[[https://ccp.cloudera.com/display/CDHDOC/CDH3+Quick+Start+Guide|here]]. This 
common error results due to a bug in MAPREDUCE-967 which modifies the way 
MapReduce unpacks the job's jar. The old way was to unpack the whole of it, now 
only classes/ and lib/ are unpacked. This way Nutch is missing the plugins/ 
directory. A workaround is to force unpacking of the plugin/ directory. If you 
install only the CDH3 distro, you are OK. It is when you add the Hue distros or 
try to use a Hadoop installed with the Cloudera SCM products that you run into 
problems. This can be done by adding the following properties to nutch-site.xml
  {{{
  <property>
  <name>mapreduce.job.jar.unpack.pattern</name>

Reply via email to