Re: Running Tez with Tachyon

2015-11-16 Thread Jiří Šimša
Thank you Hitesh. I will give that a try.

Best,

--
Jiří Šimša

On Mon, Nov 16, 2015 at 11:36 AM, Hitesh Shah  wrote:

> I am not sure whether the fat-jar is creating an issue. Can you try the
> following:
>
>- convert the fat-jar ( jar with dependencies ) into a directory of
> separate jars i.e. a dir with all jars within it ( no nested dirs - just
> one top level dir with all jars in it )
>- upload this dir to HDFS
>- add this dir to tez.aux.uris ( and remove the fat-jar )
>
> Beyond this, if you are familiar with YARN, a few things you can do to
> capture more info which can help us help you:
>- set "yarn.nodemanager.delete.debug-delay-sec” to say 600 or 900 in
> yarn-site.xml ( requires node manager restart ). This retains the container
> data after the application completes
>- find a launch_container.sh file for any container ( of the
> application that is failing ) within local-dirs configured for YARN in your
> yarn-site.xml
>
> The launch_container.sh will tell you what files are being added to the
> container’s env and how the classpath is being setup.
>
> thanks
> — Hitesh
>
>
> On Nov 16, 2015, at 8:41 AM, Jiří Šimša  wrote:
>
> > Hi Bikas and Hitesh,
> >
> > Any suggestion as to how to get tachyon.hadoop.TFS recognized by Tez's
> cluster? Thanks.
> >
> > Best,
> >
> > --
> > Jiří Šimša
> >
> > On Thu, Nov 12, 2015 at 10:42 AM, Jiří Šimša 
> wrote:
> > Thank you Bikas and Hitesh for your responses.
> >
> > I believe the problem is in the cluster. Here is the relevant
> information:
> >
> > 1) My HADOOP_CLASSPATH:
> >
> > $ hadoop classpath
> >
> /usr/local/Cellar/hadoop/2.7.1/libexec/etc/hadoop:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/*:/Users/jsimsa/Projects/tez:/Users/jsimsa/Projects/tez/jars/*:/Users/jsimsa/Projects/tez/jars/lib/*:/contrib/capacity-scheduler/*.jar
> >
> > 2) The contents of /Users/jsimsa/Projects/tez/tez-site.xml:
> >
> > 
> > 
> >
> > 
> >
> > 
> >
> > 
> >
> > 
> >   tez.lib.uris
> >
>  
> ${fs.defaultFS}/apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz
> > 
> > 
> >   tez.aux.uris
> >
>  
> ${fs.defaultFS}/apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
> > 
> >
> > 
> >
> > 3) The contents of the /apps HDFS folder:
> >
> > $ ./bin/hdfs dfs -lsr /apps
> > lsr: DEPRECATED: Please use 'ls -R' instead.
> > SLF4J: Class path contains multiple SLF4J bindings.
> > SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> > SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> > SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
> explanation.
> > SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
> > 15/11/12 10:39:52 WARN util.NativeCodeLoader: Unable to load
> native-hadoop library for your platform... using builtin-java classes where
> applicable
> > drwxr-xr-x   - jsimsa supergroup  0 2015-11-11 18:43
> /apps/tachyon-0.8.2-SNAPSHOT
> > -rw-r--r--   1 jsimsa supergroup   43809325 2015-11-11 18:43
> /apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
> > drwxr-xr-x   - jsimsa supergroup  0 2015-11-11 18:44
> /apps/tez-0.8.2-SNAPSHOT
> > -rw-r--r--   1 jsimsa supergroup   43884378 2015-11-11 18:44
> /apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz
> >
> >
> > 4) Finally, the command I am running and its output:
> >
> > $
> HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
> hadoop jar ./tez-examples/target/tez-examples-0.8.2-SNAPSHOT.jar
> orderedwordcount tachyon://localhost:19998/input.txt
> tachyon://localhost:19998/output.txt
> > SLF4J: Class path contains multiple SLF4J bindings.
> > SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> > SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> > SLF4J: Found binding in
> [jar:file:/Users/jsimsa/Projects/tez/jars/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinde

Re: Running Tez with Tachyon

2015-11-16 Thread Hitesh Shah
I am not sure whether the fat-jar is creating an issue. Can you try the 
following: 

   - convert the fat-jar ( jar with dependencies ) into a directory of separate 
jars i.e. a dir with all jars within it ( no nested dirs - just one top level 
dir with all jars in it )
   - upload this dir to HDFS
   - add this dir to tez.aux.uris ( and remove the fat-jar )

Beyond this, if you are familiar with YARN, a few things you can do to capture 
more info which can help us help you: 
   - set "yarn.nodemanager.delete.debug-delay-sec” to say 600 or 900 in 
yarn-site.xml ( requires node manager restart ). This retains the container 
data after the application completes
   - find a launch_container.sh file for any container ( of the application 
that is failing ) within local-dirs configured for YARN in your yarn-site.xml 

The launch_container.sh will tell you what files are being added to the 
container’s env and how the classpath is being setup.

thanks
— Hitesh 


On Nov 16, 2015, at 8:41 AM, Jiří Šimša  wrote:

> Hi Bikas and Hitesh,
> 
> Any suggestion as to how to get tachyon.hadoop.TFS recognized by Tez's 
> cluster? Thanks.
> 
> Best,
> 
> --
> Jiří Šimša
> 
> On Thu, Nov 12, 2015 at 10:42 AM, Jiří Šimša  wrote:
> Thank you Bikas and Hitesh for your responses.
> 
> I believe the problem is in the cluster. Here is the relevant information:
> 
> 1) My HADOOP_CLASSPATH:
> 
> $ hadoop classpath
> /usr/local/Cellar/hadoop/2.7.1/libexec/etc/hadoop:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/*:/Users/jsimsa/Projects/tez:/Users/jsimsa/Projects/tez/jars/*:/Users/jsimsa/Projects/tez/jars/lib/*:/contrib/capacity-scheduler/*.jar
> 
> 2) The contents of /Users/jsimsa/Projects/tez/tez-site.xml:
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
>   tez.lib.uris
>   
> ${fs.defaultFS}/apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz
>  
> 
> 
>   tez.aux.uris
>   
> ${fs.defaultFS}/apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
>  
> 
> 
> 
> 
> 3) The contents of the /apps HDFS folder:
> 
> $ ./bin/hdfs dfs -lsr /apps
> lsr: DEPRECATED: Please use 'ls -R' instead.
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in 
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in 
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an 
> explanation.
> SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
> 15/11/12 10:39:52 WARN util.NativeCodeLoader: Unable to load native-hadoop 
> library for your platform... using builtin-java classes where applicable
> drwxr-xr-x   - jsimsa supergroup  0 2015-11-11 18:43 
> /apps/tachyon-0.8.2-SNAPSHOT
> -rw-r--r--   1 jsimsa supergroup   43809325 2015-11-11 18:43 
> /apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
> drwxr-xr-x   - jsimsa supergroup  0 2015-11-11 18:44 
> /apps/tez-0.8.2-SNAPSHOT
> -rw-r--r--   1 jsimsa supergroup   43884378 2015-11-11 18:44 
> /apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz
> 
> 
> 4) Finally, the command I am running and its output:
> 
> $ 
> HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
>  hadoop jar ./tez-examples/target/tez-examples-0.8.2-SNAPSHOT.jar 
> orderedwordcount tachyon://localhost:19998/input.txt 
> tachyon://localhost:19998/output.txt
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in 
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in 
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in 
> [jar:file:/Users/jsimsa/Projects/tez/jars/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in 
> [jar:file:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html

Re: Running Tez with Tachyon

2015-11-16 Thread Jiří Šimša
Hi Bikas and Hitesh,

Any suggestion as to how to get tachyon.hadoop.TFS recognized by Tez's
cluster? Thanks.

Best,

--
Jiří Šimša

On Thu, Nov 12, 2015 at 10:42 AM, Jiří Šimša  wrote:

> Thank you Bikas and Hitesh for your responses.
>
> I believe the problem is in the cluster. Here is the relevant information:
>
> *1) My HADOOP_CLASSPATH:*
>
> $ hadoop classpath
>
> /usr/local/Cellar/hadoop/2.7.1/libexec/etc/hadoop:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/hdfs/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/yarn/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/lib/*:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/mapreduce/*:/Users/jsimsa/Projects/tez:/Users/jsimsa/Projects/tez/jars/*:/Users/jsimsa/Projects/tez/jars/lib/*:/contrib/capacity-scheduler/*.jar
>
> *2) The contents of /Users/jsimsa/Projects/tez/tez-site.xml:*
>
> 
> 
>
> 
>
> 
>
> 
>
> 
>   tez.lib.uris
>
> ${fs.defaultFS}/apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz
> 
> 
>   tez.aux.uris
>
> ${fs.defaultFS}/apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
> 
>
> 
>
> *3) The contents of the /apps HDFS folder:*
>
> $ ./bin/hdfs dfs -lsr /apps
> lsr: DEPRECATED: Please use 'ls -R' instead.
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
> explanation.
> SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
> 15/11/12 10:39:52 WARN util.NativeCodeLoader: Unable to load native-hadoop
> library for your platform... using builtin-java classes where applicable
> drwxr-xr-x   - jsimsa supergroup  0 2015-11-11 18:43
> /apps/tachyon-0.8.2-SNAPSHOT
> -rw-r--r--   1 jsimsa supergroup   43809325 2015-11-11 18:43
> /apps/tachyon-0.8.2-SNAPSHOT/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
> drwxr-xr-x   - jsimsa supergroup  0 2015-11-11 18:44
> /apps/tez-0.8.2-SNAPSHOT
> -rw-r--r--   1 jsimsa supergroup   43884378 2015-11-11 18:44
> /apps/tez-0.8.2-SNAPSHOT/tez-0.8.2-SNAPSHOT.tar.gz
>
>
> *4) Finally, the command I am running and its output:*
>
> $
> HADOOP_CLASSPATH=${HADOOP_CLASSPATH}:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar
> hadoop jar ./tez-examples/target/tez-examples-0.8.2-SNAPSHOT.jar
> orderedwordcount tachyon://localhost:19998/input.txt
> tachyon://localhost:19998/output.txt
> SLF4J: Class path contains multiple SLF4J bindings.
> SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in
> [jar:file:/usr/local/Cellar/hadoop/2.7.1/libexec/share/hadoop/common/lib/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in
> [jar:file:/Users/jsimsa/Projects/tez/jars/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: Found binding in
> [jar:file:/Users/jsimsa/Projects/tachyon-amplab/clients/client/target/tachyon-client-0.8.2-SNAPSHOT-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
> SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an
> explanation.
> SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
> 15/11/12 10:37:29 WARN util.NativeCodeLoader: Unable to load native-hadoop
> library for your platform... using builtin-java classes where applicable
> 15/11/12 10:37:29 INFO client.TezClient: Tez Client Version: [
> component=tez-api, version=0.8.2-SNAPSHOT,
> revision=6562a9d882fc455f511dd9d93af1d159d3e3e71b, SCM-URL=scm:git:
> https://git-wip-us.apache.org/repos/asf/tez.git,
> buildTime=2015-11-11T19:44:28Z ]
> 15/11/12 10:37:29 INFO client.RMProxy: Connecting to ResourceManager at /
> 0.0.0.0:8032
> 15/11/12 10:37:30 INFO : initialize(tachyon://localhost:19998/input.txt,
> Configuration: core-default.xml, core-site.xml, mapred-default.xml,
> mapred-site.xml, yarn-default.xml, yarn-site.xml, hdfs-default.xml,
> hdfs-site.xml, tez-site.xml). Connecting to Tachyon:
> tachyon://localhost:19998/input.txt
> 15/11/12 10:37:30 INFO : Loading Tachyon properties from Hadoop
> configuration: {}
> 15/11/12 10:37:30 INFO : Tachyon client (version 0.8.2-SNAPSHO

Re: Overview documentation for application developers?

2015-11-16 Thread Oleg Zhurakousky
That is entirely up to you. If you want to go natively agains Tez, then you can 
use backend as an example, but I would suggest to just simply use samples from 
Tez project.
But if you want to simply build an App that runs in Tez and not worry about Tez 
internals you can just use DStream backed by Tez.

Oleg

On Nov 16, 2015, at 10:32 AM, Marshall Bockrath-Vandegrift 
mailto:llas...@gmail.com>> wrote:

Are you suggesting I should check out DStream's Tez backend as an example, or 
that I should consider building on top of DStream instead?

Thanks,

On Mon, Nov 16, 2015 at 8:48 AM Oleg Zhurakousky 
mailto:ozhurakou...@hortonworks.com>> wrote:
If you are using Java 8 you can try this
https://github.com/hortonworks/dstream


Cheers
Oleg

On Nov 16, 2015, at 8:26 AM, Marshall Bockrath-Vandegrift 
mailto:llas...@gmail.com>> wrote:

Hi,

I'm interested in implementing a new Tez application (re-writing my
Parkour Clojure library in terms Tez), but I'm having difficulty
finding documentation which links the Tez high-level concepts to the
implementation details necessary to actually develop a new Tez
application.  Am I missing something, or are e.g. the examples the
closest to what I'm looking for?

Thanks!

-Marshall





Re: Overview documentation for application developers?

2015-11-16 Thread Marshall Bockrath-Vandegrift
Are you suggesting I should check out DStream's Tez backend as an example,
or that I should consider building on top of DStream instead?

Thanks,

On Mon, Nov 16, 2015 at 8:48 AM Oleg Zhurakousky <
ozhurakou...@hortonworks.com> wrote:

> If you are using Java 8 you can try this
> https://github.com/hortonworks/dstream
>
>
> Cheers
> Oleg
>
> On Nov 16, 2015, at 8:26 AM, Marshall Bockrath-Vandegrift <
> llas...@gmail.com> wrote:
>
> Hi,
>
> I'm interested in implementing a new Tez application (re-writing my
> Parkour Clojure library in terms Tez), but I'm having difficulty
> finding documentation which links the Tez high-level concepts to the
> implementation details necessary to actually develop a new Tez
> application.  Am I missing something, or are e.g. the examples the
> closest to what I'm looking for?
>
> Thanks!
>
> -Marshall
>
>
>


Re: Overview documentation for application developers?

2015-11-16 Thread Oleg Zhurakousky
If you are using Java 8 you can try this
https://github.com/hortonworks/dstream


Cheers
Oleg

On Nov 16, 2015, at 8:26 AM, Marshall Bockrath-Vandegrift 
mailto:llas...@gmail.com>> wrote:

Hi,

I'm interested in implementing a new Tez application (re-writing my
Parkour Clojure library in terms Tez), but I'm having difficulty
finding documentation which links the Tez high-level concepts to the
implementation details necessary to actually develop a new Tez
application.  Am I missing something, or are e.g. the examples the
closest to what I'm looking for?

Thanks!

-Marshall




Overview documentation for application developers?

2015-11-16 Thread Marshall Bockrath-Vandegrift
Hi,

I'm interested in implementing a new Tez application (re-writing my
Parkour Clojure library in terms Tez), but I'm having difficulty
finding documentation which links the Tez high-level concepts to the
implementation details necessary to actually develop a new Tez
application.  Am I missing something, or are e.g. the examples the
closest to what I'm looking for?

Thanks!

-Marshall