Hi,
Seqf bug fixed in r18706.
Best Regards
Lenny.
On Thu, Jun 19, 2008 at 5:37 PM, Lenny Verkhovsky <
lenny.verkhov...@gmail.com> wrote:
> Sorry,
> I checked it without sm.
>
> pls ignore this mail.
>
>
>
> On Thu, Jun 19, 2008 at 4:32 PM, Lenny Verkhovsky <
> lenny.verkhov...@gmail.com> wrote:
>
Sorry,
I checked it without sm.
pls ignore this mail.
On Thu, Jun 19, 2008 at 4:32 PM, Lenny Verkhovsky <
lenny.verkhov...@gmail.com> wrote:
> Hi,
> I found what caused the problem in both cases.
>
> --- ompi/mca/btl/sm/btl_sm.c(revision 18675)
> +++ ompi/mca/btl/sm/btl_sm.c(working co
Hi,
I found what caused the problem in both cases.
--- ompi/mca/btl/sm/btl_sm.c(revision 18675)
+++ ompi/mca/btl/sm/btl_sm.c(working copy)
@@ -812,7 +812,7 @@
*/
MCA_BTL_SM_FIFO_WRITE(endpoint, endpoint->my_smp_rank,
endpoint->peer_smp_rank, frag->hdr,
It seems like we have 2 bugs here.
1. After commiting NUMA awareness we see seqf
2. Before commiting NUMA r18656 we see application hangs.
3. I checked both it with and without sendi, same results.
4. It hangs most of the times, but sometimes large msg ( >1M ) are working.
I will keep investigati
Lenny,
I guess you're running the latest version. If not, please update,
Galen and myself corrected some bugs last week. If you're using the
latest (and greatest) then ... well I imagine there is at least one
bug left.
There is a quick test you can do. In the btl_sm.c in the module
stru