Yann K. wrote:
------------------------------------------------------------------------
Subject:
[Error] Asynchronous Thread
From:
"Yann K." <[EMAIL PROTECTED]>
Date:
Thu, 21 Jun 2007 16:50:59 +0200
To:
[EMAIL PROTECTED]
To:
[EMAIL PROTECTED]
Hello everybody,
I have a problem making a diagnostic on those kind of errors, which
happen at the same time :
At the mpi level :
case IBV_EVENT_SRQ_ERR:
ibv_error_abort(GEN_EXIT_ERR, "MPI Gen2 Async Special Event
thread : Got FATAL event %d\n",
event.event_type);
At the kernel level :
Jun 21 11:17:55 [EMAIL PROTECTED] kernel: ib_mthca 0000:07:00.0: CQ
overrun on CQN c2009c
It seems that you got CQ overrun which means that more completions that
the CQ size were created.
You can solve this by creating a bigger CQ or use more than one CQ...
(i don't really understand why you sent the code from the MPI which
handles SRQ error).
thanks
Dotan
_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general