Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-15 Thread Jason Weisberger
Update time.

Today, still running on the integrated Radeon HD 3300, I did an
upgrade to chromium again.  BAM!  Kernel panic.  Recoverable, but the
compilation failed again.

So, the NVIDIA 9800GT has been out since my last message and I am
apparently getting the kernel panics again.  I suppose it was luck
that I got through the upgrade last time.

So looks like I'll be getting a new motherboard.

Jason Weisberger
jbdu...@gmail.com



Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-05 Thread Jason Weisberger
Well it worked.  I'll get to additional testing later and see if I can
come up with a test card for that slot.  I suppose I could also move
it down a slot.  Thanks to everybody who helped and I hope fixing that
damn radeon wiki returned the favor to the community.

Thanks again!

On Thu, Jan 5, 2012 at 1:10 AM, Michael Mol mike...@gmail.com wrote:
 On Thu, Jan 5, 2012 at 12:57 AM, Jason Weisberger jbdu...@gmail.com wrote:
 Well like i said earlier, it did work that way.  The final test is doing it
 in X with full drivers loaded again.  Then i get to kick my 9800gt off a
 cliff.

 Well, keep in mind you're switching from a PCIe card to an integrated
 chipset. It could be your PCIe slot (or even northbridge) that's
 failing. I'd suggest finding another PCIe video card and giving it a
 shot, if the X-with-integrated-video works.

 If it works with the HD3300, but not with a PCIe graphics card (even
 an ATI one), I'd suspect your northbridge is messed up, and may have
 been damaged by the overclocking.


 --
 :wq




-- 
Jason Weisberger
jbdu...@gmail.com



Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-05 Thread Mark Knecht
On Thu, Jan 5, 2012 at 4:39 AM, Jason Weisberger jbdu...@gmail.com wrote:
 Well it worked.  I'll get to additional testing later and see if I can
 come up with a test card for that slot.  I suppose I could also move
 it down a slot.  Thanks to everybody who helped and I hope fixing that
 damn radeon wiki returned the favor to the community.

 Thanks again!

SNIP
 --
 Jason Weisberger
 jbdu...@gmail.com


Jason,
   First, great that it's working, whatever that means exactly.

   Second, please try to bottom post if you can.

   Third, if you can relate in just a few words what you actually did
to fix the problems so far it may aid people searching in the future.

Cheers,
Mark



Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-05 Thread Jason Weisberger
   First, great that it's working, whatever that means exactly.

   Second, please try to bottom post if you can.

   Third, if you can relate in just a few words what you actually did
 to fix the problems so far it may aid people searching in the future.

 Cheers,
 Mark

Well, it's boiled down to either the motherboard or the graphics card
due to the fact that everything compiles properly and there are no
blue screens or kernel panics after removing the 9800GT.  I'm going to
get my hands on an HD 4850 from a buddy of mine and try compiling and
playing with that.  If it crashes again, it's motherboard.  If not,
it's probably my card.  Whether it's the PCI-E slot or northbridge
taking a dump on the motherboard is irrelevant to me, I'll just get a
new board.

I'll know by the end of this week.  I can update with which way it went.



-- 
Jason Weisberger
jbdu...@gmail.com



[gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Jason Weisberger
Here's the rig:

AMD Phenom II X3 720 (unlocked to 4 cores and OC'd to 3 Ghz, however
taking it back to stock doesn't affect the problem)
4 Gigs DDR3-1600 at 7-7-7-16
ASUS ASUS M4A78T-E AM3 Mobo
MSI N9800GT GeForce 9800 GT 512MB

other less important things would be two 320GB SATA drives, an IDE dvd
writer, internal card reader, blah blah

There are two things I can get this system to consistently lock up
while doing: a large compilation such as Chromium, open office, or
GCC, or playing Age of Conan in my separate Windows 7 partition.  For
normal everyday tasks, I can't lock the thing up.  The system even
plays WoW on maximum settings, Skyrim on max, Counter-Strike Source on
max, 1080p youtube or local videos / movies.

Watching the processor temp stays relatively low, the GPU temp stays
normal, north bridge temp is good.

The kernel panics I get are usually recoverable, I can just F7 back
into my desktop and the compile may or may not still be going.  If it
is, it's usually dead by the third or fourth panic.

Memtest86+ ran for 24 hours and found no issues with memory on 18 passes.

So, pretty much where I'm at is either processor or RAM.  Unless
anybody here has a better idea.  And how could I test whether it was
the processor or RAM without any current replacement parts?  What
tests usually show one over the other?

-- 
Jason Weisberger
jbdu...@gmail.com



Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Michael Mol
Jason Weisberger wrote:
 Here's the rig:
 
 AMD Phenom II X3 720 (unlocked to 4 cores and OC'd to 3 Ghz, however
 taking it back to stock doesn't affect the problem)
 4 Gigs DDR3-1600 at 7-7-7-16
 ASUS ASUS M4A78T-E AM3 Mobo
 MSI N9800GT GeForce 9800 GT 512MB
 
 other less important things would be two 320GB SATA drives, an IDE dvd
 writer, internal card reader, blah blah
 
 There are two things I can get this system to consistently lock up
 while doing: a large compilation such as Chromium, open office, or
 GCC, or playing Age of Conan in my separate Windows 7 partition.  For
 normal everyday tasks, I can't lock the thing up.  The system even
 plays WoW on maximum settings, Skyrim on max, Counter-Strike Source on
 max, 1080p youtube or local videos / movies.
 
 Watching the processor temp stays relatively low, the GPU temp stays
 normal, north bridge temp is good.
 
 The kernel panics I get are usually recoverable, I can just F7 back
 into my desktop and the compile may or may not still be going.  If it
 is, it's usually dead by the third or fourth panic.
 
 Memtest86+ ran for 24 hours and found no issues with memory on 18 passes.
 
 So, pretty much where I'm at is either processor or RAM.  Unless
 anybody here has a better idea.  And how could I test whether it was
 the processor or RAM without any current replacement parts?  What
 tests usually show one over the other?

Chromium, at least, is going to force you to start swapping if you've
only got 4GB of RAM in the machine, so I'd check smartctl and see if the
drive you swap partition on is having any trouble.

I'd also suggest removing the NVidia card, going without X for a little
while[1], and see if you can reproduce the issue[2].

I'd guess that it's the unlocked core that's the source of your woes,
though. Not all X3s are so labeled just for market segmentation.

[1] Only because your onboard video is ATI, and flipping between ATI and
NVidia configurations for something like this would strike me as too
much a hassle.

[2] Only because limiting the number of active components helps when
troubleshooting hardware issues. I once found a sporadic kernel panic
was related to a failing NIC. Another time I found it was related to an
old Hauppauge PVR150 that worked fine with my old motherboard, but was
incompatible with my new motherboard.



[gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Jason Weisberger
I would instantly agree, but backing off the unlock and oc doesn't improve
the situation.  I can try text console with the onboard graphics and see
what happens.


Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Mark Knecht
On Wed, Jan 4, 2012 at 12:42 PM, Jason Weisberger jbdu...@gmail.com wrote:
 I would instantly agree, but backing off the unlock and oc doesn't improve
 the situation.  I can try text console with the onboard graphics and see
 what happens.

I would second Michael's suggestion that you completely remove the
NVidia board from the box, at least for testing. Remove any other
cards that you can. Additionally boot using gentoo=nox to do some
testing. If you get the cards out and boot only to a text console then
you can eliminate those cards as contributing to the problem.

Just a suggestion.

Good lick,
Mark



[gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Jason Weisberger
Yeah, it never occurred to me that even when i was in console compiling
stuff that i was using a 1280x1024 vesa or nouveau framebuffer each time.
That will be first thing when i get home.
On Jan 4, 2012 3:52 PM, Mark Knecht markkne...@gmail.com wrote:

 On Wed, Jan 4, 2012 at 12:42 PM, Jason Weisberger jbdu...@gmail.com
 wrote:
  I would instantly agree, but backing off the unlock and oc doesn't
 improve
  the situation.  I can try text console with the onboard graphics and see
  what happens.

 I would second Michael's suggestion that you completely remove the
 NVidia board from the box, at least for testing. Remove any other
 cards that you can. Additionally boot using gentoo=nox to do some
 testing. If you get the cards out and boot only to a text console then
 you can eliminate those cards as contributing to the problem.

 Just a suggestion.

 Good lick,
 Mark




Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Mark Knecht
On Wed, Jan 4, 2012 at 1:22 PM, Jason Weisberger jbdu...@gmail.com wrote:
 Yeah, it never occurred to me that even when i was in console compiling
 stuff that i was using a 1280x1024 vesa or nouveau framebuffer each time.
 That will be first thing when i get home.


Good luck with that. Also, remove any other old cards ala Michael's
comment about an old NIC. I've had the same thing happen here when
some old card that worked in every old machine I'd ever owned suddenly
caused problems in a new box.

Hope things improve.

Cheers,
Mark



[gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Jason Weisberger
Sorry for the delayed response on the result, but i decided to do a
makeopts -j1 while i was at it.  Compile has been going on for two hours.
Shoot me :)


Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Dale

Jason Weisberger wrote:


Sorry for the delayed response on the result, but i decided to do a 
makeopts -j1 while i was at it.  Compile has been going on for two 
hours.  Shoot me :)




I doubt this is the issue but I ran into a weird one a while back.  I 
have a data drive, it's mounted on /data and stores all sorts of stuff 
including movies.  Anyway, when the drive was being wrote to and reached 
a certain point, it would cause a hard lock up most of the time.  
Sometimes I could use SysReq keys to get it back.  My point is, 
somewhere in the back of your mind, ask yourself if something, swap 
maybe, could be hitting a certain spot on a drive or file system.


In the end, I moved all my data off that drive, redone the file system, 
moved all the data back and it has worked fine ever since.  It's weird I 
know.  Just keep that in mind in case something comes up and this is 
what is going on.  Puters do weird things just to throw use a curve ball 
and make us scratch our heads until we are bald.


Dale

:-)  :-)

--
I am only responsible for what I said ... Not for what you understood or how 
you interpreted my words!

Miss the compile output?  Hint:
EMERGE_DEFAULT_OPTS=--quiet-build=n




[gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Jason Weisberger
Oh yeah, I hear you on the head scratching, believe me.

Well so far on the built in card I have managed to compile chromium -j5,
which i couldn't do for the two weeks prior.  Same build, so i know it
wasn't an upstream or ebuild problem.  Damn.

Next step is to get X running again and do it one more time.

Last thing i want to do is go from a 9800gt to an integrated 3300, but i
suppose it's better than nothing.  At least i had something to test on.


Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Dale

Jason Weisberger wrote:


Oh yeah, I hear you on the head scratching, believe me.

Well so far on the built in card I have managed to compile chromium 
-j5, which i couldn't do for the two weeks prior.  Same build, so i 
know it wasn't an upstream or ebuild problem.  Damn.


Next step is to get X running again and do it one more time.

Last thing i want to do is go from a 9800gt to an integrated 3300, but 
i suppose it's better than nothing.  At least i had something to test on.




Try a different version of the driver.  Maybe that specific version has 
a bug in it.  If you run stable, try a unstable version.  If you run 
unstable, try a stable one.  Worth a shot.


While at it, blow some air on it too.  Check all the contacts or as I 
do, use a pencil eraser and rub on it a couple times.  Makes it shiney 
then try again.  Wouldn't hurt to blow down in the connector on the mobo 
either.  Spec of dust in the wrong place can cause weird things.


When falling off cliff, grab all the straws you can.  ;-)

Dale

:-)  :-)

--
I am only responsible for what I said ... Not for what you understood or how 
you interpreted my words!

Miss the compile output?  Hint:
EMERGE_DEFAULT_OPTS=--quiet-build=n




Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Jason Weisberger
Well after a couple hours of trying to get the Radeon HD3300 to
actually start properly so I can test the compilation.. I haven't
done it yet.  Instead I decided to go ape-s**t on the radeon wiki
article which drove me into so many dead ends.

http://en.gentoo-wiki.com/wiki/Radeon

Now that it's up and running, I'm recompiling chromium from inside
gnome 3 with -j5 and I'll report in tomorrow morning.

On Wed, Jan 4, 2012 at 8:52 PM, Dale rdalek1...@gmail.com wrote:
 Jason Weisberger wrote:


 Oh yeah, I hear you on the head scratching, believe me.

 Well so far on the built in card I have managed to compile chromium -j5,
 which i couldn't do for the two weeks prior.  Same build, so i know it
 wasn't an upstream or ebuild problem.  Damn.

 Next step is to get X running again and do it one more time.

 Last thing i want to do is go from a 9800gt to an integrated 3300, but i
 suppose it's better than nothing.  At least i had something to test on.


 Try a different version of the driver.  Maybe that specific version has a
 bug in it.  If you run stable, try a unstable version.  If you run unstable,
 try a stable one.  Worth a shot.

 While at it, blow some air on it too.  Check all the contacts or as I do,
 use a pencil eraser and rub on it a couple times.  Makes it shiney then try
 again.  Wouldn't hurt to blow down in the connector on the mobo either.
  Spec of dust in the wrong place can cause weird things.

 When falling off cliff, grab all the straws you can.  ;-)


 Dale

 :-)  :-)

 --
 I am only responsible for what I said ... Not for what you understood or how
 you interpreted my words!

 Miss the compile output?  Hint:
 EMERGE_DEFAULT_OPTS=--quiet-build=n





-- 
Jason Weisberger
jbdu...@gmail.com



Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Michael Mol
This is why I figured you'd want to go without X for the purpose of
testing. No X, no framebuffer, the card should have acted like a
plane-jane VGA or VESA adapter.

On Thu, Jan 5, 2012 at 12:41 AM, Jason Weisberger jbdu...@gmail.com wrote:
 Well after a couple hours of trying to get the Radeon HD3300 to
 actually start properly so I can test the compilation.. I haven't
 done it yet.  Instead I decided to go ape-s**t on the radeon wiki
 article which drove me into so many dead ends.

 http://en.gentoo-wiki.com/wiki/Radeon

 Now that it's up and running, I'm recompiling chromium from inside
 gnome 3 with -j5 and I'll report in tomorrow morning.

 On Wed, Jan 4, 2012 at 8:52 PM, Dale rdalek1...@gmail.com wrote:
 Jason Weisberger wrote:


 Oh yeah, I hear you on the head scratching, believe me.

 Well so far on the built in card I have managed to compile chromium -j5,
 which i couldn't do for the two weeks prior.  Same build, so i know it
 wasn't an upstream or ebuild problem.  Damn.

 Next step is to get X running again and do it one more time.

 Last thing i want to do is go from a 9800gt to an integrated 3300, but i
 suppose it's better than nothing.  At least i had something to test on.


 Try a different version of the driver.  Maybe that specific version has a
 bug in it.  If you run stable, try a unstable version.  If you run unstable,
 try a stable one.  Worth a shot.

 While at it, blow some air on it too.  Check all the contacts or as I do,
 use a pencil eraser and rub on it a couple times.  Makes it shiney then try
 again.  Wouldn't hurt to blow down in the connector on the mobo either.
  Spec of dust in the wrong place can cause weird things.

 When falling off cliff, grab all the straws you can.  ;-)


 Dale

 :-)  :-)

 --
 I am only responsible for what I said ... Not for what you understood or how
 you interpreted my words!

 Miss the compile output?  Hint:
 EMERGE_DEFAULT_OPTS=--quiet-build=n





 --
 Jason Weisberger
 jbdu...@gmail.com




-- 
:wq



[gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Jason Weisberger
Well like i said earlier, it did work that way.  The final test is doing it
in X with full drivers loaded again.  Then i get to kick my 9800gt off a
cliff.


Re: [gentoo-user] [OT] Hardware Problems causing kernel panics during large compiles

2012-01-04 Thread Michael Mol
On Thu, Jan 5, 2012 at 12:57 AM, Jason Weisberger jbdu...@gmail.com wrote:
 Well like i said earlier, it did work that way.  The final test is doing it
 in X with full drivers loaded again.  Then i get to kick my 9800gt off a
 cliff.

Well, keep in mind you're switching from a PCIe card to an integrated
chipset. It could be your PCIe slot (or even northbridge) that's
failing. I'd suggest finding another PCIe video card and giving it a
shot, if the X-with-integrated-video works.

If it works with the HD3300, but not with a PCIe graphics card (even
an ATI one), I'd suspect your northbridge is messed up, and may have
been damaged by the overclocking.


-- 
:wq