On Mar 15, 2014, at 10:19 PM, Hjelm, Nathan T <hje...@lanl.gov> wrote:

> On Friday, March 14, 2014 8:48 PM, devel [devel-boun...@open-mpi.org] on 
> behalf of Ralph Castain [r...@open-mpi.org] wrote:
>> To: Open MPI Developers
>> Subject: [OMPI devel] 1.7.5 end-of-week status report
>> 
>> Hi folks
>> 
>> I have both good and bad news to report - first the good.
>> 
>> OSHMEM now passes nearly all its tests on my Linux cluster (tcp). My hat is 
>> off to the Mellanox guys for getting this done, including getting our MTT 
>> repo tests complete.
>> 
>> The MPI layer passes nearly all the IBM, Intel, and one-sided tests. Only a 
>> few failures.
>> 
>> Now the bad. The coll/ml component continues to have problems, including 
>> segfaults, and I have discovered that the bcol and coll/ml code remains 
>> entangled (I thought it had been separated, but sadly not). I have therefore 
>> ompi_ignored coll/ml and bcol/ptpcoll.
> 
> No need. I discovered a bug in my last coll/ml fix. It incorrectly handled 
> one of the possibly hierarchies. The bug is fixed in trunk and a CMR is open 
> for 1.7.5. In the future I will clean up this path but the fix should have us 
> working again.

I'm glad you were able to patch it, but this still begs the question of what to 
do with coll/ml. It's disturbing that its existence alone was enough to break 
the Java bindings (and yes, I concede those aren't built by default or part of 
the MPI standard) without even traversing its code path, and we've had a lot of 
problems with errors when we do go thru it. More disturbing, you can't even 
cleanly no-build that component due to the unfortunate cross-linkage with 
bcol/ptpcoll, so we definitely need a note in NEWS to warn people they need to 
no-build both.

It's unclear to me how to handle this situation, so we'll need to discuss it at 
the telecon. At the very least, I think we need to ensure coll/ml is not the 
default for 1.7.5 as it doesn't appear to be ready for that role.


> 
> -Nathan
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/devel
> Link to this post: 
> http://www.open-mpi.org/community/lists/devel/2014/03/14352.php

Reply via email to