ooooh, i forgot the only diagnostic information i've discovered. if i kill and then restart the sending process, things remain wedged. if i kill and restart combinations of teh receiving processes, things remain wedged. if i kill all teh receiving processes, and then restart them, things unwedge.
this is what makes me think this is a zmq issue. On Jan 22, 2012, at 8:44 AM, Andrew Hume wrote: > i am currently seeing the following scenario: > > one process opens a PUSH socket on an ipc address /x/y > several processes open a PULL socket on that same /x/y > traffc flows as expected for 24-48 hrs. > then traffic stops. > the sending process is stalled in send, and all the reading processes > are stalled in a recv. > > it is not readily reproducible. > traffic rate is not well known at this point, but an offpeak number > is 1.2M messages/s (= 173MB/s). the high point might be 3x or 4x that. > > any hints on how i might proceed? > shoudl i try another transport? > > ------------------ > Andrew Hume (best -> Telework) +1 623-551-2845 > [email protected] (Work) +1 973-236-2014 > AT&T Labs - Research; member of USENIX and LOPSA > > > > > _______________________________________________ > zeromq-dev mailing list > [email protected] > http://lists.zeromq.org/mailman/listinfo/zeromq-dev ------------------ Andrew Hume (best -> Telework) +1 623-551-2845 [email protected] (Work) +1 973-236-2014 AT&T Labs - Research; member of USENIX and LOPSA
_______________________________________________ zeromq-dev mailing list [email protected] http://lists.zeromq.org/mailman/listinfo/zeromq-dev
