We're also seeing a big increase in instability since moving to the RHEL 6 kernels. Specifically, our typical platform consists of a Supermicro motherboard with dual 12-core AMD procs (ie. 24 in one system); The most frustrating part is that the symptom we're seeing is highly intermittent (sometimes it takes 10 minutes to trigger, sometimes several days), and doesn't result in a kernel panic or dump per se. Instead what we're seeing is an unresponsive system (still responding to ping, but all services on the box are unresponsive), with this scrolling by on the console:
BUG: soft lockup - CPU#22 stuck for 67s! [node:585441] BUG: soft lockup - CPU#23 stuck for 68s! [node:585419] (multiple times per second, repeating all the different process numbers and many different processes running within containers). We're going to file a bug report on this, of course, but wondered if there was anything else we can do here to get any other information which can help the devs to come up with the cause and hopefully fix for the above? (Again, we're not getting a panic, and we're not able to do anything on the console.) Thanks, Stephen -- Stephen Balukoff Blue Box Group, LLC (800)613-4305 x807 _______________________________________________ Users mailing list [email protected] https://openvz.org/mailman/listinfo/users
