Hi with our new ZEO client systems, we frequently observe this problem:
a ZEO process starts to use 100% CPU time (user) without a significant increase of requests. Sometimes (but not always) the process stops answering requests, still using 100% CPU. When we kill such a process, it changes to zombie state (shown in top as 'Z' and '<defunct>'), still using 100% CPU, but now its system time, not user. The HTTP port is still in use, so we have to reboot this node to restart the ZEO client. This usually fails because some filesystems cannot be unmouted, there are still files locked. I tried both start modes, runzope and zopectl, but no difference. All that is in opposition of what I know about zombie processes, they should use no CPU time. Versions are: RedHat RHEL4, Kernel 2.6.9-42.0.10.ELsmp, with address extension (16 GB RAM) Python 2.3.6 Zope 2.8.8 The older cluster nodes work perfectly, no such zombie problem ever (connected to the same storage server); they run on Debian Sarge, Kernel 2.4.27 SMP Any hint is appreciated Regards Bengt _______________________________________________ Zope maillist - Zope@zope.org http://mail.zope.org/mailman/listinfo/zope ** No cross posts or HTML encoding! ** (Related lists - http://mail.zope.org/mailman/listinfo/zope-announce http://mail.zope.org/mailman/listinfo/zope-dev )