Hi, I would like to point out that the Nutch 2.x version is not under active maintenance/development and has been retired. By saying that, you can follow the below steps to upgrade the 2.x to run on Hadoop 3.x:
1. Navigate to the root directory of the Nutch 2.x (local cloned directory). If you have a binary release, clone the repository using https://github.com/apache/nutch/tree/2.x and navigate to the directory. 2. Open ivy/ivy.xml using the editor of you choice. 3. After the line 49, you have <!-- Hadoop Dependencies --> change the hadoop dependencies to 3.x(version which you want). Just to mention here the 1.x has been updated to 3.1.3 and works well there. If you need to check the changes, you can refer to pull request to see the changes which was made to port Hadoop to 3.1.3 in 1.x branch here <https://github.com/apache/nutch/pull/507/files> 4. Make sure to change all the hadoop dependencies in the ivy.xml file. 5. Then go back to the root directory and build the project using "ant runtime". 6. You can find the runnable nutch script in ./runtime/local/bin/nutch and more information on building project from source here <https://cwiki.apache.org/confluence/display/NUTCH/Nutch2Tutorial> This should make the changes for Nutch to use Hadoop 3.x. *Regards* Shashanka Balakuntala Srinivasa On Tue, Jun 23, 2020 at 12:08 PM Gajalakshmi G <[email protected]> wrote: > Hi all, > > > > I am using Nutch 2.3.1 with gora-core.jar of the version 0.6(tried upto > 0.9 version of gora jars). With this versions I am not able to successfully > crawl a site with Hadoop 3.x version. Is the latest Nutch 2.4 version run > on Hadoop 3.x versions? Do we need to do any specific changes to make the > Nutch 2.4 to run on Hadoop 3.x ? > > > > Thanks & Regards, > > Gajalakshmi.G > > Assistant Consultant > > Tata Consultancy Services > Mailto: [email protected]< > https://mail.tcs.com/owa/redir.aspx?C=15cf4bf65eff4bdab465e0a2dd682f11&URL=mailto%3agajalakshmi.g%40tcs.com > > > =====-----=====-----===== > Notice: The information contained in this e-mail > message and/or attachments to it may contain > confidential or privileged information. If you are > not the intended recipient, any dissemination, use, > review, distribution, printing or copying of the > information contained in this e-mail message > and/or attachments to it are strictly prohibited. If > you have received this communication in error, > please notify us by reply e-mail or telephone and > immediately and permanently delete the message > and any attachments. Thank you > > >

