Yann K. wrote:


------------------------------------------------------------------------

Subject:
[Error] Asynchronous Thread
From:
"Yann K." <[EMAIL PROTECTED]>
Date:
Thu, 21 Jun 2007 16:50:59 +0200
To:
[EMAIL PROTECTED]

To:
[EMAIL PROTECTED]


Hello everybody,

I have a problem making a diagnostic on those kind of errors, which happen at the same time :

At the mpi level :

       case IBV_EVENT_SRQ_ERR:
ibv_error_abort(GEN_EXIT_ERR, "MPI Gen2 Async Special Event thread : Got FATAL event %d\n",
                           event.event_type);

At the kernel level :

Jun 21 11:17:55 [EMAIL PROTECTED] kernel: ib_mthca 0000:07:00.0: CQ
overrun on CQN c2009c
It seems that you got CQ overrun which means that more completions that the CQ size were created.
You can solve this by creating a bigger CQ or use more than one CQ...

(i don't really understand why you sent the code from the MPI which handles SRQ error).

thanks
Dotan


_______________________________________________
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Reply via email to