Hi Zhiyuan,
Thanks for the guidance. I'm making progress but I am still battling initial 
configuration management issues. 
I'm running HDFS and YARN v3.1.4 in pseudo-mode.
My tez-site.xml contains the following content

<configuration>
<property>
  <name>tez.lib.uris</name>
  
<value>${fs.defaultFS}/apps/tez-0.10.1-SNAPSHOT,${fs.defaultFS}/apps/tez-0.10.1-SNAPSHOT/lib,${fs.defaultFS}/apps/nutch</value>
</property>
</configuration>

N.B. When I attempted to use the compressed Tez tar.gz, I was running into 
classpath issues which are largely documented in the installation documentation 
you pointed me to. I overcame these issues by simply uploading the minimal 
directory. All seems fine at this stage as I can run all of the Tez examples. 

I run into trouble when I try to run any job from the Nutch application. For 
example when I run the Injector one of the Nutch plugin extension points (x 
point org.apache.nutch.net.URLNormalizer) cannot be not found. The relevant log 
can be seen at https://paste.apache.org/4whoe.
I should note that the entire Nutch .job is available on HDFS at the URI 
defined in the tez-site.xml above.

The output of jar -tf on the nutch.job artifact can be seen at 
https://paste.apache.org/hl8tk.
Am I required to somehow describe the structural heirarchy of this artifact in 
the tez.lib.uris.classpath configuration property?

Thank you again for any guidance.

lewismc

On 2020/12/14 03:23:48, Zhiyuan Yang <zhiyu...@apache.org> wrote: 
> Hi Lewis,
> 
> If there is no incompatibility, your existing job will run well on Tez
> without code change. You can just follow this guide
> <https://tez.apache.org/install.html> (especially step 4) to try it out.
> 
> Thanks,
> Zhiyuan
> 
> On Mon, Dec 14, 2020 at 9:04 AM Lewis John McGibbney <lewi...@apache.org>
> wrote:
> 

Reply via email to