Peter Humphrey <[EMAIL PROTECTED]> posted [EMAIL PROTECTED], excerpted below, on Sat, 23 Jun 2007 09:16:07 +0100:
> Here's what top showed then. Look at the /nice/ values on lines 3 and 4, > and compare those with the %CPU and Processor fields of processes 5279 > and 5280 (sorry about the line wraps). This has me deeply puzzled: Fixed the line wraps and removed a bit of extraneous information. =8^) > top - 09:04:59 up 23 min, 5 users, load average: 3.60, 4.79, 3.91 > Tasks: 124 total, 2 running, 122 sleeping, 0 stopped, 0 zombie > Cpu0: 0.3%us, 0.3%sy, 0.0%ni, 99.3%id, [zeroes] > Cpu1: 0.0%us, 0.3%sy, 99.7%ni, 0.0%id, [zeroes] > PID USER PR NI S %CPU %MEM TIME+ P COMMAND > 5279 prh 34 19 S 50 1.0 6:53.97 1 setiathome-5.12 > 5280 prh 34 19 S 50 1.0 6:54.08 0 setiathome-5.12 > I don't think this is a scheduling problem; it goes deeper, so that the > kernel doesn't have a consistent picture of which processor is which. Critical question here, is that in SMP Irix or SMP Solaris mode? (See the top manpage if you don't know what I mean.) Asked another way, is that displaying percent of total CPU time (both CPUs) or percent of total divided by number of CPUs (so percent of one CPU)? If it's Irix mode (percent total CPU time), then it's reporting full usage of both CPUs, one on each. The CPU0 line would then be the one screwed up, since it's reporting idle when it clearly has to be in use. If it's Solaris mode (percent of a single CPU's time, so total of all percentages should be 200% if you have two CPUs), then the CPUs lines would seem to be correct, both processes would appear to be running on CPU1, maxing it out, and the P column of the 5280 line would have to be screwed up. (That's assuming you let the figures stabilize after the last schedtool call you made.) In either case, I'm not sure where your bug is, but you are correct, the problem appears to be way deeper than scheduling. I'd guess it's ultimately a kernel bug, possibly due to a hardware bug, possibly not, but you might wish to file it on top initially, just to see if they've seen similar and can tell you what's going on. Unless you want to double-check patching status yourself, you might as well file the bug with Gentoo first, in case it's a Gentoo bug. They'll probably end up closing it "upstream", but at least then when you file it upstream, you can say you've cleared it with Gentoo first. As for top, note that there's a trick you can use with it. You'll likely want to trim the memory columns etc as I did for your bug report, but you may not want to mess up your regular config to do so. Not a problem! =8^) Create a symlink to top called something else (say topbug). Then run it using the symlink, and you can change and save your setttings, and it'll save them in a different rc file (topbugrc using my example). That way, you can run it with the bug report settings when you want to, without messing up your regular config. Of course, don't forget to mention in your bug report whether you were in Solaris or Irix SMP mode, because as I explained, it /does/ make a difference. Let me know how this goes, post the bug number when you file it or whatever, as I'd like to follow it too. You definitely have a strange one here, and I'd /love/ to see what the real experts have to say about it! You are absolutely correct, it doesn't seem to make any sense at all! Good luck. That's one /strange/ problem you have going there! No /wonder/ you were expressing frustration earlier! -- Duncan - List replies preferred. No HTML msgs. "Every nonfree program has a lord, a master -- and if you use the program, he is your master." Richard Stallman -- [EMAIL PROTECTED] mailing list
