Hi,

today I installed openmpi-dev-1567-g11e8c20 on my machines
(Solaris 10 Sparc, Solaris 10 x86_64, and openSUSE Linux 12.1
x86_64) with gcc-4.9.2. I used the following configure command
for all platforms.

../openmpi-dev-1567-g11e8c20/configure \
  --prefix=/usr/local/openmpi-1.9.0_64_gcc \
  --libdir=/usr/local/openmpi-1.9.0_64_gcc/lib64 \
  --with-jdk-bindir=/usr/local/jdk1.8.0/bin \
  --with-jdk-headers=/usr/local/jdk1.8.0/include \
  JAVA_HOME=/usr/local/jdk1.8.0 \
  LDFLAGS="-m64" CC="gcc" CXX="g++" FC="gfortran" \
  CFLAGS="-m64" CXXFLAGS="-m64" FCFLAGS="-m64" \
  CPP="cpp" CXXCPP="cpp" \
  CPPFLAGS="" CXXCPPFLAGS="" \
  --enable-mpi-cxx \
  --enable-cxx-exceptions \
  --enable-mpi-java \
  --enable-heterogeneous \
  --enable-mpi-thread-multiple \
  --with-hwloc=internal \
  --without-verbs \
  --with-wrapper-cflags="-std=c11 -m64" \
  --with-wrapper-cxxflags="-m64" \
  --with-wrapper-fcflags="-m64" \
  --enable-debug \
  |& tee log.configure.$SYSTEM_ENV.$MACHINE_ENV.64_gcc

I can run a small program on both Solaris machines without problems,
but get an error on Linux.

tyr java 123 mpiexec -np 6 --host sunpc1 java 
MatMultWithAnyProc2DarrayIn1DarrayMain
You have started 6 processes but I need at most 4 processes.
I build a new worker group with 4 processes. The processes with
the following ranks in the basic group belong to the new group:
  2   3   4   5   

Group "groupOther" contains 2 processes which have
nothing to do.

Worker process 0 of 4 running on sunpc1.
Worker process 1 of 4 running on sunpc1.
Worker process 2 of 4 running on sunpc1.
Worker process 3 of 4 running on sunpc1.

(4,6)-matrix a:

      1.00      2.00      3.00      4.00      5.00      6.00
      7.00      8.00      9.00     10.00     11.00     12.00
     13.00     14.00     15.00     16.00     17.00     18.00
     19.00     20.00     21.00     22.00     23.00     24.00
...


I get the following error on my Linux machine.

tyr java 127 mpiexec -np 6 --host linpc1 java 
MatMultWithAnyProc2DarrayIn1DarrayMain
Exception in thread "main" java.lang.NoClassDefFoundError: mpi/MPIException
        at java.lang.Class.getDeclaredMethods0(Native Method)
        at java.lang.Class.privateGetDeclaredMethods(Class.java:2688)
        at java.lang.Class.getMethod0(Class.java:2937)
        at java.lang.Class.getMethod(Class.java:1771)
        at 
sun.launcher.LauncherHelper.validateMainClass(LauncherHelper.java:544)
        at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:526)
Caused by: java.lang.ClassNotFoundException: mpi.MPIException
        at java.net.URLClassLoader$1.run(URLClassLoader.java:372)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:360)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:424)Exception in 
thread "main" 
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
        ... 6 more
java.lang.NoClassDefFoundError: mpi/MPIException
        at java.lang.Class.getDeclaredMethods0(Native Method)
        at java.lang.Class.privateGetDeclaredMethods(Class.java:2688)
        at java.lang.Class.getMethod0(Class.java:2937)
        at java.lang.Class.getMethod(Class.java:1771)
        at 
sun.launcher.LauncherHelper.validateMainClass(LauncherHelper.java:544)
        at sun.launcher.LauncherHelper.checkAndLoadMain(LauncherHelper.java:526)
Caused by: java.lang.ClassNotFoundException: mpi.MPIException
        at java.net.URLClassLoader$1.run(URLClassLoader.java:372)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:360)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:424)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
-------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
-------------------------------------------------------
        at java.lang.ClassLoader.loadClass(ClassLoader.java:357)
        ... 6 more
--------------------------------------------------------------------------
mpiexec detected that one or more processes exited with non-zero status, thus 
causing
the job to be terminated. The first process to do so was:

  Process name: [[57804,1],0]
  Exit code:    1
--------------------------------------------------------------------------
tyr java 128 



I would be grateful if somebody can fix the problem. Thank
you very much for any help in advance.


Kind regards

Siegmar

Reply via email to