Re: [OMPI devel] Trunk broken for PPC64?

2014-08-02 Thread Ralph Castain
Good suggestion, Paul - I have committed it in r32407 and added it to cmr #4826 Thanks! Ralph On Aug 1, 2014, at 1:12 AM, Paul Hargrove wrote: > Gilles, > > At the moment ompi/mca/osc/sm/osc_sm_component.c is using the following: > > #ifdef HAVE_GETPAGESIZE > pagesize = getpagesize();

Re: [OMPI devel] Trunk broken for PPC64?

2014-08-01 Thread Paul Hargrove
On Fri, Aug 1, 2014 at 1:19 AM, Gilles Gouaillardet < gilles.gouaillar...@iferc.org> wrote: > Paul, > > i just commited r32393 (and made a CMR for v1.8) > > can you please give it a try ? > I am not equipped to build from svn on most of my test platforms. However, I applied your one-line change

Re: [OMPI devel] Trunk broken for PPC64?

2014-08-01 Thread Gilles Gouaillardet
Paul, i just commited r32393 (and made a CMR for v1.8) can you please give it a try ? in the mean time, i received your email ... sysconf is called directly (e.g. no #ifdef protected) in several other places : $ grep -R sysconf . | grep -v svn | grep -v sysconfdir | grep -v autom4te |grep PA

Re: [OMPI devel] Trunk broken for PPC64?

2014-08-01 Thread Paul Hargrove
Gilles, At the moment ompi/mca/osc/sm/osc_sm_component.c is using the following: #ifdef HAVE_GETPAGESIZE pagesize = getpagesize(); #else pagesize = 4096; #endif While other places in the code use sysconf(), but not always consistently. And on some systems _SC_PAGESIZE is spelled

Re: [OMPI devel] Trunk broken for PPC64?

2014-08-01 Thread Gilles Gouaillardet
Paul, you are absolutly right ! in ompi/mca/coll/ml/coll_ml_lmngr.c at line 53, cm->lmngr_alignment is hard coded to 4096 as a proof of concept, i hard coded it to 65536 and now coll/ml works just fine i will now write a patch that uses sysconf(_SC_PAGESIZE) instead Cheers, Gilles On 2014/08

Re: [OMPI devel] Trunk broken for PPC64?

2014-08-01 Thread Paul Hargrove
Hmm, maybe this has nothing to do with big-endian. Below is a backtrace from ring_c on an IA64 platform (definitely little-endian) that looks very similar to me. It happens that sysconf(_SC_PAGESIZE) returns 64K on both of these systems. So, I wonder if that might be related. -Paul $ mpirun -mca

Re: [OMPI devel] Trunk broken for PPC64?

2014-08-01 Thread Paul Hargrove
Gilles's findings are consistent with mine which showed the SEGVs to be in the coll/ml code. I've built with --enable-debug and so below is a backtrace (well, two actually) that might be helpful. Unfortunately the output of the two ranks did get slightly entangled. -Paul $ ../INST/bin/mpirun -mca

Re: [OMPI devel] Trunk broken for PPC64?

2014-08-01 Thread Gilles Gouaillardet
Paul and Ralph, for what it's worth : a) i faced the very same issue on my (slw) qemu emulated ppc64 vm b) i was able to run very basic programs when passing --mca coll ^ml to mpirun Cheers, Gilles On 2014/08/01 12:30, Ralph Castain wrote: > Yes, I fear this will require some effort to cha

Re: [OMPI devel] Trunk broken for PPC64?

2014-07-31 Thread Ralph Castain
Yes, I fear this will require some effort to chase all the breakage down given that (to my knowledge, at least) we lack PPC machines in the devel group. On Jul 31, 2014, at 5:46 PM, Paul Hargrove wrote: > On the path to verifying George's atomics patch, I have started just by > verifying that

[OMPI devel] Trunk broken for PPC64?

2014-07-31 Thread Paul Hargrove
On the path to verifying George's atomics patch, I have started just by verifying that I can still build the UNPATCHED trunk on each of the platforms I listed. I have tried two PPC64/Linux systems so far and am seeing the same problem on both. Though I can pass "make check" both platforms SEGV on