Hi Oliver,

I tested at a cluster with gpfs filesystem, so the files should be
available on every node at the same paths. On every node a "java -version"
shows the same: java version "1.8.0_60". I have APGAS not in a jar, but the
normal sources in my project. That is not a problem, right? So I tried to
start my program with:
    java -cp .:../lib/hazelcast-3.5.2.jar
-Dapgas.launcher=apgas.impl.SshLauncher -Dapgas.hostfile=hostfile
-Dapgas.places=2 examples.ResilientHelloWorld


This is the output:
[APGAS] Spawning new place: ssh -t -t its-cs194.its.uni-kassel.de java
-Xbootclasspath:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/resources.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/rt.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/sunrsasign.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/jsse.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/jce.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/charsets.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/jfr.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/classes
-cp .:../lib/hazelcast-3.5.2.jar -Dapgas.hostfile=hostfile
-Dapgas.serialization.exception=true -Dapgas.places=2
-Dapgas.launcher.verbose=true -Dapgas.resilient=true
-Dapgas.launcher=apgas.impl.SshLauncher -Dapgas.my.master=
192.168.205.103:5701 apgas.GlobalRuntime
tcgetattr: Invalid argument
Error: Could not find or load main class apgas.GlobalRuntime
Connection to its-cs194.its.uni-kassel.de closed.


The hostfile only contains: "its-cs194.its.uni-kassel.de"
Where I started my programm "pwd" shows:
"/home/users/0019/uk000618/APGAS/bin"


With a simple "java -cp .:../lib/hazelcast-3.5.2.jar -Dapgas.places=2
examples.ResilientHelloWorld" it starts normal and as expected with output:
[APGAS] Spawning new place: java
-Xbootclasspath:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/resources.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/rt.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/sunrsasign.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/jsse.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/jce.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/charsets.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/lib/jfr.jar:/gpfs/home08/0019/uk000618/jdk1.8.0_60/jre/classes
-cp .:../lib/hazelcast-3.5.2.jar -Dapgas.serialization.exception=true
-Dapgas.places=2 -Dapgas.launcher.verbose=true -Dapgas.resilient=true
-Dapgas.my.master=192.168.205.103:5701 apgas.GlobalRuntime
Running main at place(0) of 2 places
0: There are 2 places
1: There are 2 places
....



Many greetings
Jonas

2015-09-29 3:12 GMT+02:00 Olivier Tardieu <tard...@us.ibm.com>:

> Jonas,
>
> I did not explain the requirements...
>
> The ssh launcher expects the file system to look the same across the nodes.
> The compiled classes and the jars (apgas, hazelcast) should be available
> at the same paths.
> This is typically achieved by using a distributed file system.
> Pushing the files using rsync, scp... before launching is also an option.
>
> BTW, "java" is also expected to resolve to a compatible Java 8 jvm on all
> the nodes.
> But an explicit path can be provided using property
> -Dapgas.java=/path/to/java
>
> You can specify "-Dapgas.launcher.verbose=true" to debug the invoked
> command line.
>
> Olivier
>
>
> Jonas Posner <jonas.pos...@student.uni-kassel.de> wrote on 09/28/2015
> 02:45:48 PM:
>
> > From: Jonas Posner <jonas.pos...@student.uni-kassel.de>
> > To: Mailing list for users of the X10 programming language <x10-
> > us...@lists.sourceforge.net>
> > Date: 09/28/2015 02:46 PM
> > Subject: Re: [X10-users] APGAS lib: Adding Places across different
> mashines
> >
> > Hi Olivier,
> >
> > thank you a lot for your effort and replies.
> >
> > Until last week I used an older version from your git. After your
> > first answer I updatet it. The options "-Dapgas.my.ip=" and "-
> > Dapgas.my.master=" fixed my problems with the ResilientHelloWorld
> > example. It works with two different mashines. After this simple
> > test I tried something to let the first place wait for additional
> > manually started places. Your mail today helps me a lot. The option
> > "apgas.impl.NoLauncher" does exactly what you described. It works
> > local and at the cluster, that I use. My "real" program runs very
> > good with the NoLauncher, too.
> >
> > Currently the ssh launcher does not work for me. I created a
> > hostfile with one node. Then I started the program with:
> >     java -cp .:../lib/hazelcast-3.5.2.jar -
> > Dapgas.launcher=apgas.impl.SshLauncher -Dapgas.hostfile=hostfile -
> > Dapgas.places=2 examples.ResilientHelloWorld
> > The ssh connection to the node in the hostfile is successfull, but
> > then there is an error:
> >     Error: Could not find or load main class apgas.GlobalRuntime
> >     Connection to xxx closed.
> > Can you help me there?
> >
> > Many greetings
> > Jonas
> >
> > 2015-09-28 16:55 GMT+02:00 Olivier Tardieu <tard...@us.ibm.com>:
> > Jonas,
> >
> > I pushed new launchers to git.
> >
> > I added a new ssh launcher.
> > java -Dapgas.launcher=apgas.impl.SshLauncher
> > -Dapgas.hostfile=/home/tardieu/hostfile -Dapgas.places=4 ...
> > It uses ssh to run on remote nodes. Place 0 always run on localhost. The
> > hostfile should contain the hosts for the remaining places. 1 per line.
> >
> > I added a new dummy launcher.
> > java -Dapgas.launcher=apgas.impl.NoLauncher -Dapgas.places=4 ...
> > This launcher does not do anything! It makes it possible for the user to
> > launch places explicitly instead of relying on a launcher.
> > Place 0 will wait for 3 other places to connect before running main.
> >
> > I added the -Dapgas.laucher.verbose property.
> > If set, the launcher prints the command executed to launch additional
> > places.
> > Use in combination with apgas.impl.NoLauncher to know the command to
> > invoke.
> >
> > Let me know if you still have issues running on multiple nodes.
> >
> > Olivier
> >
> >
> > Jonas Posner <jonas.pos...@student.uni-kassel.de> wrote on 09/23/2015
> > 09:30:58 AM:
> >
> > > From: Jonas Posner <jonas.pos...@student.uni-kassel.de>
> > > To: x10-users@lists.sourceforge.net
> > > Date: 09/23/2015 09:31 AM
> > > Subject: [X10-users] APGAS lib: Adding Places across different mashines
> > >
> > > Hi all,
> > >
> > > I'm playing around with the APGAS library for Java. I want to add
> > > some places to a running application using the ip:port adress. I use
> > > for testing the ResilientWorld Class in examples. Local works it
> > > fine with the commands:
> > > 1.application: java -cp .:../lib/hazelcast-3.4.jar -Dapgas.places=1
> > > examples.ResilientHelloWorld
> > > 2.application: java -cp .:../lib/hazelcast-3.4.jar -Dapgas.places=1
> > > -Dapgas.master=localhost examples.ResilientHelloWorld
> > >
> > > The second application connects to the first and the output shows
> > > two places as expected. Application two can crash and app one
> > > catches a DeadPlaceException. Everything fine and as expected.
> > >
> > > But now i want to start the applications at tow different machines.
> > > Instead of "localhost" i wrote the ip of the first application.
> > > Additionally I added the port :5701. But application two doesn't
> > > connect. Both applications run alone. I also switched the port in
> > > the Transport Class of APGAS, without  success. I don't know what to
> > > do now. What requirements must be met?
> > >
> > > Thanks and many greetings
> > > Jonas Posner
> > >
> >
> ------------------------------------------------------------------------------
> > > Monitor Your Dynamic Infrastructure at Any Scale With Datadog!
> > > Get real-time metrics from all of your servers, apps and tools
> > > in one place.
> > > SourceForge users - Click here to start your Free Trial of Datadog now!
> > > http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140
> > > _______________________________________________
> > > X10-users mailing list
> > > X10-users@lists.sourceforge.net
> > > https://lists.sourceforge.net/lists/listinfo/x10-users
> >
> >
> >
> >
> ------------------------------------------------------------------------------
> > _______________________________________________
> > X10-users mailing list
> > X10-users@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/x10-users
> >
> ------------------------------------------------------------------------------
> > _______________________________________________
> > X10-users mailing list
> > X10-users@lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/x10-users
>
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> X10-users mailing list
> X10-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/x10-users
>
>
------------------------------------------------------------------------------
_______________________________________________
X10-users mailing list
X10-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/x10-users

Reply via email to