Hi Howard,

> Could you double check that on the linux box you are using an ompi install
> which has java support?

Yes, I have a script file that I call with the Open MPI version that I want
to build so that I can't forget to use an empty directory, to remove the
last installation before installing the new one, and so on. The strange thing
is that I cannot reproduce the error today. I've no idea why it didn't work
two days ago. Nevertheless I'm happy that it works now. Thank you very much
for your help which forced me to try again.

linpc1 java 110 ls -l /usr/local/openmpi-1.9.0_64_gcc/bin/mpijavac
lrwxrwxrwx 1 root root 11 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/bin/mpijavac -> mpijavac.pl
linpc1 java 111 ls -l /usr/local/openmpi-1.9.0_64_gcc/lib64/*java*
-rwxr-xr-x 1 root root   1170 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/libmpi_java.la
lrwxrwxrwx 1 root root     20 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/libmpi_java.so -> libmpi_java.so.0.0.0
lrwxrwxrwx 1 root root     20 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/libmpi_java.so.0 -> libmpi_java.so.0.0.0
-rwxr-xr-x 1 root root 538243 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/libmpi_java.so.0.0.0
-rwxr-xr-x 1 root root   1239 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/liboshmem_java.la
lrwxrwxrwx 1 root root     23 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/liboshmem_java.so -> 
liboshmem_java.so.0.0.0
lrwxrwxrwx 1 root root     23 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/liboshmem_java.so.0 -> 
liboshmem_java.so.0.0.0
-rwxr-xr-x 1 root root 169198 Apr 21 07:52 
/usr/local/openmpi-1.9.0_64_gcc/lib64/liboshmem_java.so.0.0.0
linpc1 java 112 


tyr fd1026 104 mpiexec -np 6 -host tyr,linpc1,sunpc1 java 
MatMultWithAnyProc2DarrayIn1DarrayMain

You have started 6 processes but I need at most 4 processes.
I build a new worker group with 4 processes. The processes with
the following ranks in the basic group belong to the new group:
  2   3   4   5   
...


Kind regards

Siegmar




> Howard
> On Apr 21, 2015 10:11 AM, "Siegmar Gross" <
> siegmar.gr...@informatik.hs-fulda.de> wrote:
> 
> > Hi,
> >
> > today I installed openmpi-dev-1567-g11e8c20 on my machines
> > (Solaris 10 Sparc, Solaris 10 x86_64, and openSUSE Linux 12.1
> > x86_64) with gcc-4.9.2. I used the following configure command
> > for all platforms.
> >
> > ../openmpi-dev-1567-g11e8c20/configure \
> >   --prefix=/usr/local/openmpi-1.9.0_64_gcc \
> >   --libdir=/usr/local/openmpi-1.9.0_64_gcc/lib64 \
> >   --with-jdk-bindir=/usr/local/jdk1.8.0/bin \
> >   --with-jdk-headers=/usr/local/jdk1.8.0/include \
> >   JAVA_HOME=/usr/local/jdk1.8.0 \
> >   LDFLAGS="-m64" CC="gcc" CXX="g++" FC="gfortran" \
> >   CFLAGS="-m64" CXXFLAGS="-m64" FCFLAGS="-m64" \
> >   CPP="cpp" CXXCPP="cpp" \
> >   CPPFLAGS="" CXXCPPFLAGS="" \
> >   --enable-mpi-cxx \
> >   --enable-cxx-exceptions \
> >   --enable-mpi-java \
> >   --enable-heterogeneous \
> >   --enable-mpi-thread-multiple \
> >   --with-hwloc=internal \
> >   --without-verbs \
> >   --with-wrapper-cflags="-std=c11 -m64" \
> >   --with-wrapper-cxxflags="-m64" \
> >   --with-wrapper-fcflags="-m64" \
> >   --enable-debug \
> >   |& tee log.configure.$SYSTEM_ENV.$MACHINE_ENV.64_gcc
> >
> > I can run a small program on both Solaris machines without problems,
> > but get an error on Linux.
> >
> > tyr java 123 mpiexec -np 6 --host sunpc1 java
> > MatMultWithAnyProc2DarrayIn1DarrayMain
> > You have started 6 processes but I need at most 4 processes.
> > I build a new worker group with 4 processes. The processes with
> > the following ranks in the basic group belong to the new group:
> >   2   3   4   5
> >
> > Group "groupOther" contains 2 processes which have
> > nothing to do.
> >
> > Worker process 0 of 4 running on sunpc1.
> > Worker process 1 of 4 running on sunpc1.
> > Worker process 2 of 4 running on sunpc1.
> > Worker process 3 of 4 running on sunpc1.
> >
> > (4,6)-matrix a:
> >
> >       1.00      2.00      3.00      4.00      5.00      6.00
> >       7.00      8.00      9.00     10.00     11.00     12.00
> >      13.00     14.00     15.00     16.00     17.00     18.00
> >      19.00     20.00     21.00     22.00     23.00     24.00
> > ...
> >
> >
> > I get the following error on my Linux machine.
> >
> > tyr java 127 mpiexec -np 6 --host linpc1 java
> > MatMultWithAnyProc2DarrayIn1DarrayMain
> > Exception in thread "main" java.lang.NoClassDefFoundError: mpi/MPIException
> >         at java.lang.Class.getDeclaredMethods0(Native Method)
> >         at java.lang.Class.privateGetDeclaredMethods(Class.java:2688)
> >         at java.lang.Class.getMethod0(Class.java:2937)
> >         at java.lang.Class.getMethod(Class.java:1771)
> >         at
> > sun.launcher.LauncherHelper.validateMainClass(LauncherHelper.java:544)
> >         at
> > sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:526)
> > Caused by: java.lang.ClassNotFoundException: mpi.MPIException
> >         at java.net.URLClassLoader$1.run(URLClassLoader.java:372)
> >         at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
> >         at java.security.AccessController.doPrivileged(Native Method)
> >         at java.net.URLClassLoader.findClass(URLClassLoader.java:360)
> >         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)Exception
> > in thread "main"
> >         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
> >         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
> >         ... 6 more
> > java.lang.NoClassDefFoundError: mpi/MPIException
> >         at java.lang.Class.getDeclaredMethods0(Native Method)
> >         at java.lang.Class.privateGetDeclaredMethods(Class.java:2688)
> >         at java.lang.Class.getMethod0(Class.java:2937)
> >         at java.lang.Class.getMethod(Class.java:1771)
> >         at
> > sun.launcher.LauncherHelper.validateMainClass(LauncherHelper.java:544)
> >         at
> > sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:526)
> > Caused by: java.lang.ClassNotFoundException: mpi.MPIException
> >         at java.net.URLClassLoader$1.run(URLClassLoader.java:372)
> >         at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
> >         at java.security.AccessController.doPrivileged(Native Method)
> >         at java.net.URLClassLoader.findClass(URLClassLoader.java:360)
> >         at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
> >         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
> > -------------------------------------------------------
> > Primary job  terminated normally, but 1 process returned
> > a non-zero exit code.. Per user-direction, the job has been aborted.
> > -------------------------------------------------------
> >         at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
> >         ... 6 more
> > --------------------------------------------------------------------------
> > mpiexec detected that one or more processes exited with non-zero status,
> > thus causing
> > the job to be terminated. The first process to do so was:
> >
> >   Process name: [[57804,1],0]
> >   Exit code:    1
> > --------------------------------------------------------------------------
> > tyr java 128
> >
> >
> >
> > I would be grateful if somebody can fix the problem. Thank
> > you very much for any help in advance.
> >
> >
> > Kind regards
> >
> > Siegmar
> >
> > _______________________________________________
> > users mailing list
> > us...@open-mpi.org
> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> > Link to this post:
> > http://www.open-mpi.org/community/lists/users/2015/04/26760.php
> >

Reply via email to