Hi all,  I am the one trying to run Apache Beam example on cluster:  I used the 
following command with my given input in a folder named 
"harrypotter":#!/bin/bash
HADOOP_CLASSPATH="$HADOOP_CLASSPATH:/tmp/beam/jars/*" hadoop jar 
/tmp/beam/jars/beam-examples-java-2.1.0-SNAPSHOT.jar 
org.apache.beam.examples.complete.TfIdf --runner=ApexRunner 
--embeddedExecution=false --output=apexrunnertfidf 
--input=/tmp/beam/harrypotter/
java -cp 
/homes/org.apache.beam.examples.complete.TfIdf--------------------------------------------------------------------------
However, some configuration seems to go wrong:
Exception in thread "main" java.lang.RuntimeException: Failed to launch the 
application on YARN. at 
org.apache.beam.runners.apex.ApexRunner.run(ApexRunner.java:204) at 
org.apache.beam.runners.apex.ApexRunner.run(ApexRunner.java:82) at 
org.apache.beam.sdk.Pipeline.run(Pipeline.java:297) at 
org.apache.beam.sdk.Pipeline.run(Pipeline.java:283) at 
org.apache.beam.examples.complete.TfIdf.main(TfIdf.java:442) at 
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62) 
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:498) at 
org.apache.hadoop.util.RunJar.run(RunJar.java:234) at 
org.apache.hadoop.util.RunJar.main(RunJar.java:148)Caused by: 
java.io.FileNotFoundException:hadoop/client/dfs.include (No such file or 
directory) at java.io.FileInputStream.open0(Native Method) at 
java.io.FileInputStream.open(FileInputStream.java:195) at 
java.io.FileInputStream.<init>(FileInputStream.java:138) at 
org.apache.commons.io.FileUtils.copyFile(FileUtils.java:1112) at 
org.apache.beam.runners.apex.ApexYarnLauncher$2.visitFile(ApexYarnLauncher.java:277)
 at 
org.apache.beam.runners.apex.ApexYarnLauncher$2.visitFile(ApexYarnLauncher.java:253)
 at java.nio.file.Files.walkFileTree(Files.java:2670) at 
java.nio.file.Files.walkFileTree(Files.java:2742) at 
org.apache.beam.runners.apex.ApexYarnLauncher.createJar(ApexYarnLauncher.java:253)
 at 
org.apache.beam.runners.apex.ApexYarnLauncher.launchApp(ApexYarnLauncher.java:90)
 at org.apache.beam.runners.apex.ApexRunner.run(ApexRunner.java:201)
I checked the :hadoop/client/ folder and found that the dfs.include actually 
exists.May any of you give solution to this?
Claire

Reply via email to