Adrian, On 13 August 2007 at 22:28, Adrian Knoth wrote: | On Thu, Aug 02, 2007 at 10:51:13AM +0200, Adrian Knoth wrote: | | > > We (as in the Debian maintainer for Open MPI) got this bug report from | > > Uwe who sees mpi apps segfault on Debian systems with the FreeBSD | > > kernel. | > > Any input would be greatly appreciated! | > I'll follow the QEMU instructions on your website and investigate on | > my own ;) | | I was able to get OMPI running on kfreebsd-amd64. I used a nightly | snapshot from the trunk, so the problem is "more or less fixed by | upstream" ;) | | adi@debian:~$ ./ompi/bin/mpirun -np 2 ring | 0: sending message (0) to 1 | 0: sent message | 1: waiting for message | 1: got message (1) from 0, sending to 0 | 0: got message (1) from 1 | | adi@debian:~$ ./ompi/bin/ompi_info | Open MPI: 1.3a1r15820 | Open MPI SVN revision: r15820 | Open RTE: 1.3a1r15820 | Open RTE SVN revision: r15820 | OPAL: 1.3a1r15820 | OPAL SVN revision: r15820 | Prefix: /home/adi/ompi | Configured architecture: x86_64-unknown-kfreebsd6.2-gnu | | | I'll now compile the 1.2.3 release tarball and see if I can reproduce
I really appreciate the help. | the segfaults. On the other hand, I guess nobody is using OMPI on | GNU/kFreeBSD, so upgrading the openmpi-package to a subversion snapshot | would also fix the problem (think of "fixed in experimental"). Well, I generally prefer to follow upstream releases, and Jeff from the upstream team echoed that. Let's wait for 1.2.4, shall we? OTOH if you can back out a patch for 1.2.3, I'd apply that. | | JFTR: It's currently not possible to compile OMPI on amd64 (out of the | box). Though it compiles on i386 | | http://experimental.debian.net/fetch.php?&pkg=openmpi&ver=1.2.3-3&arch=kfreebsd-i386&stamp=1187000200&file=log&as=raw | | it fails on amd64: | | http://experimental.debian.net/fetch.php?&pkg=openmpi&ver=1.2.3-3&arch=kfreebsd-amd64&stamp=1186969782&file=log&as=raw | | stacktrace.c: In function 'opal_show_stackframe': | stacktrace.c:145: error: 'FPE_FLTDIV' undeclared (first use in this | function) | stacktrace.c:145: error: (Each undeclared identifier is reported only | once | stacktrace.c:145: error: for each function it appears in.) | stacktrace.c:146: error: 'FPE_FLTOVF' undeclared (first use in this | function) | stacktrace.c:147: error: 'FPE_FLTUND' undeclared (first use in this | function) | make[4]: *** [stacktrace.lo] Error 1 | make[4]: Leaving directory `/build/buildd/openmpi-1.2.3/opal/util' | | | This is caused by libc0.1-dev in /usr/include/bits/sigcontext.h, the | relevant #define's are placed in an #ifdef __i386__ condition. After | extending this for __x86_64__, everything works fine. | | Should I file a bugreport against libc0.1-dev or will you take care? I'm confused. What is libc0.1-dev ? Also note that I happened to have uploaded a third Debian revision of 1.2.3 yesterday, and that Debian release 1.2.3-3 built fine on amd as per: http://buildd.debian.org/build.php?&pkg=openmpi&ver=1.2.3-3&arch=amd64&file=log So are we sure there's a bug? Maybe you were just bitten by something in SVN that is not yet deemed release quality? | I'll keep you posted... I appreciate that. Cheers, Dirk | -- | Cluster and Metacomputing Working Group | Friedrich-Schiller-Universität Jena, Germany | | private: http://adi.thur.de | _______________________________________________ | devel mailing list | de...@open-mpi.org | http://www.open-mpi.org/mailman/listinfo.cgi/devel -- Three out of two people have difficulties with fractions.