Re: Running Tez with Tachyon
Thank you Hitesh. I will give that a try. Best, -- Jiří Šimša On Mon, Nov 16, 2015 at 11:36 AM, Hitesh Shah wrote: > I am not sure whether the fat-jar is creating an issue. Can you try the > following: > >- convert the fat-jar ( jar with dependencies ) into a directory of > separate jars i.e. a dir with all jars within it ( no nested dirs - just > one top level dir with all jars in it ) >- upload this dir to HDFS >- add this dir to tez.aux.uris ( and remove the fat-jar ) > > Beyond this, if you are familiar with YARN, a few things you can do to > capture more info which can help us help you: >- set "yarn.nodemanager.delete.debug-delay-sec” to say 600 or 900 in > yarn-site.xml ( requires node manager restart ). This retains the container > data after the application completes >- find a launch_container.sh file for any container ( of the > application that is failing ) within local-dirs configured for YARN in your > yarn-site.xml > > The launch_container.sh will tell you what files are being added to the > container’s env and how the classpath is being setup. > > thanks > — Hitesh > > > On Nov 16, 2015, at 8:41 AM, Jiří Šimša wrote: > > > Hi Bikas and Hitesh, > > > > Any suggestion as to how to get tachyon.hadoop.TFS recognized by Tez's > cluster? Thanks. > > > > Best, > > > > -- > > Jiří Šimša > > > > On Thu, Nov 12, 2015 at 10:42 AM, Jiří Šimša > wrote: > > Thank you Bikas and Hitesh for your responses. > > > > I believe the problem is in the cluster. Here is the relevant > information: > > > > 1) My HADOOP_CLASSPATH: > > > > $ hadoop classpath > > > /usr/local/Cellar/hadoop/2.7.1/libexec/etc/hadoop:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/*:/Users/jsimsa/Projects/tez:/Users/jsimsa/Projects/tez/jars/*:/Users/jsimsa/Projects/tez/jars/lib/*:/contrib/capacity-scheduler/*.jar > > > > 2) The contents of /Users/jsimsa/Projects/tez/tez-site.xml: > > > > > > > > > > > > > > > > > > > > > > > > tez.lib.uris > > > > ${fs.defaultFS}/apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz > > > > > > tez.aux.uris > > > > ${fs.defaultFS}/apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > > > > > > > > > > 3) The contents of the /apps HDFS folder: > > > > $ ./bin/hdfs dfs -lsr /apps > > lsr: DEPRECATED: Please use 'ls -R' instead. > > SLF4J: Class path contains multiple SLF4J bindings. > > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class] > > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an > explanation. > > SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] > > 15/11/12 10:39:52 WARN util.NativeCodeLoader: Unable to load > native-hadoop library for your platform... using builtin-java classes where > applicable > > drwxr-xr-x - jsimsa supergroup 0 2015-11-11 18:43 > /apps/tachyon-0.8.2-SNAPSHOT > > -rw-r--r-- 1 jsimsa supergroup 43809325 2015-11-11 18:43 > /apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > > drwxr-xr-x - jsimsa supergroup 0 2015-11-11 18:44 > /apps/tez-0.8.2-SNAPSHOT > > -rw-r--r-- 1 jsimsa supergroup 43884378 2015-11-11 18:44 > /apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz > > > > > > 4) Finally, the command I am running and its output: > > > > $ > HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > hadoop jar ./tez-examples/target/tez-examples-0.8.2-SNAPSHOT.jar > orderedwordcount tachyon://localhost:19998/input.txt > tachyon://localhost:19998/output.txt > > SLF4J: Class path contains multiple SLF4J bindings. > > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class] > > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > > SLF4J: Found binding in > [jar:file:/Users/jsimsa/Projects/tez/jars/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinde
Re: Running Tez with Tachyon
I am not sure whether the fat-jar is creating an issue. Can you try the following: - convert the fat-jar ( jar with dependencies ) into a directory of separate jars i.e. a dir with all jars within it ( no nested dirs - just one top level dir with all jars in it ) - upload this dir to HDFS - add this dir to tez.aux.uris ( and remove the fat-jar ) Beyond this, if you are familiar with YARN, a few things you can do to capture more info which can help us help you: - set "yarn.nodemanager.delete.debug-delay-sec” to say 600 or 900 in yarn-site.xml ( requires node manager restart ). This retains the container data after the application completes - find a launch_container.sh file for any container ( of the application that is failing ) within local-dirs configured for YARN in your yarn-site.xml The launch_container.sh will tell you what files are being added to the container’s env and how the classpath is being setup. thanks — Hitesh On Nov 16, 2015, at 8:41 AM, Jiří Šimša wrote: > Hi Bikas and Hitesh, > > Any suggestion as to how to get tachyon.hadoop.TFS recognized by Tez's > cluster? Thanks. > > Best, > > -- > Jiří Šimša > > On Thu, Nov 12, 2015 at 10:42 AM, Jiří Šimša wrote: > Thank you Bikas and Hitesh for your responses. > > I believe the problem is in the cluster. Here is the relevant information: > > 1) My HADOOP_CLASSPATH: > > $ hadoop classpath > /usr/local/Cellar/hadoop/2.7.1/libexec/etc/hadoop:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/*:/Users/jsimsa/Projects/tez:/Users/jsimsa/Projects/tez/jars/*:/Users/jsimsa/Projects/tez/jars/lib/*:/contrib/capacity-scheduler/*.jar > > 2) The contents of /Users/jsimsa/Projects/tez/tez-site.xml: > > > > > > > > > > > > tez.lib.uris > > ${fs.defaultFS}/apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz > > > > tez.aux.uris > > ${fs.defaultFS}/apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > > > > > > 3) The contents of the /apps HDFS folder: > > $ ./bin/hdfs dfs -lsr /apps > lsr: DEPRECATED: Please use 'ls -R' instead. > SLF4J: Class path contains multiple SLF4J bindings. > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an > explanation. > SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] > 15/11/12 10:39:52 WARN util.NativeCodeLoader: Unable to load native-hadoop > library for your platform... using builtin-java classes where applicable > drwxr-xr-x - jsimsa supergroup 0 2015-11-11 18:43 > /apps/tachyon-0.8.2-SNAPSHOT > -rw-r--r-- 1 jsimsa supergroup 43809325 2015-11-11 18:43 > /apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > drwxr-xr-x - jsimsa supergroup 0 2015-11-11 18:44 > /apps/tez-0.8.2-SNAPSHOT > -rw-r--r-- 1 jsimsa supergroup 43884378 2015-11-11 18:44 > /apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz > > > 4) Finally, the command I am running and its output: > > $ > HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > hadoop jar ./tez-examples/target/tez-examples-0.8.2-SNAPSHOT.jar > orderedwordcount tachyon://localhost:19998/input.txt > tachyon://localhost:19998/output.txt > SLF4J: Class path contains multiple SLF4J bindings. > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/Users/jsimsa/Projects/tez/jars/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: See http://www.slf4j.org/codes.html
Re: Running Tez with Tachyon
Hi Bikas and Hitesh, Any suggestion as to how to get tachyon.hadoop.TFS recognized by Tez's cluster? Thanks. Best, -- Jiří Šimša On Thu, Nov 12, 2015 at 10:42 AM, Jiří Šimša wrote: > Thank you Bikas and Hitesh for your responses. > > I believe the problem is in the cluster. Here is the relevant information: > > *1) My HADOOP_CLASSPATH:* > > $ hadoop classpath > > /usr/local/Cellar/hadoop/2.7.1/libexec/etc/hadoop:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/*:/Users/jsimsa/Projects/tez:/Users/jsimsa/Projects/tez/jars/*:/Users/jsimsa/Projects/tez/jars/lib/*:/contrib/capacity-scheduler/*.jar > > *2) The contents of /Users/jsimsa/Projects/tez/tez-site.xml:* > > > > > > > > > > > > tez.lib.uris > > ${fs.defaultFS}/apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz > > > tez.aux.uris > > ${fs.defaultFS}/apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > > > > > *3) The contents of the /apps HDFS folder:* > > $ ./bin/hdfs dfs -lsr /apps > lsr: DEPRECATED: Please use 'ls -R' instead. > SLF4J: Class path contains multiple SLF4J bindings. > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an > explanation. > SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] > 15/11/12 10:39:52 WARN util.NativeCodeLoader: Unable to load native-hadoop > library for your platform... using builtin-java classes where applicable > drwxr-xr-x - jsimsa supergroup 0 2015-11-11 18:43 > /apps/tachyon-0.8.2-SNAPSHOT > -rw-r--r-- 1 jsimsa supergroup 43809325 2015-11-11 18:43 > /apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > drwxr-xr-x - jsimsa supergroup 0 2015-11-11 18:44 > /apps/tez-0.8.2-SNAPSHOT > -rw-r--r-- 1 jsimsa supergroup 43884378 2015-11-11 18:44 > /apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz > > > *4) Finally, the command I am running and its output:* > > $ > HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar > hadoop jar ./tez-examples/target/tez-examples-0.8.2-SNAPSHOT.jar > orderedwordcount tachyon://localhost:19998/input.txt > tachyon://localhost:19998/output.txt > SLF4J: Class path contains multiple SLF4J bindings. > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/Users/jsimsa/Projects/tez/jars/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: Found binding in > [jar:file:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class] > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an > explanation. > SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory] > 15/11/12 10:37:29 WARN util.NativeCodeLoader: Unable to load native-hadoop > library for your platform... using builtin-java classes where applicable > 15/11/12 10:37:29 INFO client.TezClient: Tez Client Version: [ > component=tez-api, version=0.8.2-SNAPSHOT, > revision=6562a9d882fc455f511dd9d93af1d159d3e3e71b, SCM-URL=scm:git: > https://git-wip-us.apache.org/repos/asf/tez.git, > buildTime=2015-11-11T19:44:28Z ] > 15/11/12 10:37:29 INFO client.RMProxy: Connecting to ResourceManager at / > 0.0.0.0:8032 > 15/11/12 10:37:30 INFO : initialize(tachyon://localhost:19998/input.txt, > Configuration: core-default.xml, core-site.xml, mapred-default.xml, > mapred-site.xml, yarn-default.xml, yarn-site.xml, hdfs-default.xml, > hdfs-site.xml, tez-site.xml). Connecting to Tachyon: > tachyon://localhost:19998/input.txt > 15/11/12 10:37:30 INFO : Loading Tachyon properties from Hadoop > configuration: {} > 15/11/12 10:37:30 INFO : Tachyon client (version 0.8.2-SNAPSHO
Re: Overview documentation for application developers?
That is entirely up to you. If you want to go natively agains Tez, then you can use backend as an example, but I would suggest to just simply use samples from Tez project. But if you want to simply build an App that runs in Tez and not worry about Tez internals you can just use DStream backed by Tez. Oleg On Nov 16, 2015, at 10:32 AM, Marshall Bockrath-Vandegrift mailto:llas...@gmail.com>> wrote: Are you suggesting I should check out DStream's Tez backend as an example, or that I should consider building on top of DStream instead? Thanks, On Mon, Nov 16, 2015 at 8:48 AM Oleg Zhurakousky mailto:ozhurakou...@hortonworks.com>> wrote: If you are using Java 8 you can try this https://github.com/hortonworks/dstream Cheers Oleg On Nov 16, 2015, at 8:26 AM, Marshall Bockrath-Vandegrift mailto:llas...@gmail.com>> wrote: Hi, I'm interested in implementing a new Tez application (re-writing my Parkour Clojure library in terms Tez), but I'm having difficulty finding documentation which links the Tez high-level concepts to the implementation details necessary to actually develop a new Tez application. Am I missing something, or are e.g. the examples the closest to what I'm looking for? Thanks! -Marshall
Re: Overview documentation for application developers?
Are you suggesting I should check out DStream's Tez backend as an example, or that I should consider building on top of DStream instead? Thanks, On Mon, Nov 16, 2015 at 8:48 AM Oleg Zhurakousky < ozhurakou...@hortonworks.com> wrote: > If you are using Java 8 you can try this > https://github.com/hortonworks/dstream > > > Cheers > Oleg > > On Nov 16, 2015, at 8:26 AM, Marshall Bockrath-Vandegrift < > llas...@gmail.com> wrote: > > Hi, > > I'm interested in implementing a new Tez application (re-writing my > Parkour Clojure library in terms Tez), but I'm having difficulty > finding documentation which links the Tez high-level concepts to the > implementation details necessary to actually develop a new Tez > application. Am I missing something, or are e.g. the examples the > closest to what I'm looking for? > > Thanks! > > -Marshall > > >
Re: Overview documentation for application developers?
If you are using Java 8 you can try this https://github.com/hortonworks/dstream Cheers Oleg On Nov 16, 2015, at 8:26 AM, Marshall Bockrath-Vandegrift mailto:llas...@gmail.com>> wrote: Hi, I'm interested in implementing a new Tez application (re-writing my Parkour Clojure library in terms Tez), but I'm having difficulty finding documentation which links the Tez high-level concepts to the implementation details necessary to actually develop a new Tez application. Am I missing something, or are e.g. the examples the closest to what I'm looking for? Thanks! -Marshall
Overview documentation for application developers?
Hi, I'm interested in implementing a new Tez application (re-writing my Parkour Clojure library in terms Tez), but I'm having difficulty finding documentation which links the Tez high-level concepts to the implementation details necessary to actually develop a new Tez application. Am I missing something, or are e.g. the examples the closest to what I'm looking for? Thanks! -Marshall