John McSwain wrote:
> Wit wrote:
>> In my first post, I asked about recent changes. No response.
> 
> There have been no changes to hardware or software.

Good! Taht narrows the field substantially.

><snip>

> Having been trying to compile a new kernel today and I keep getting 
> various errors no matter which kernel.  Just tried to compile the same 
> kernel still in use with same .config and get errors.

Are they repeatable? That includes same "place", etc? If not, what's the 
nature of these? Kernel panics? Failures in compile, link?

As Ken said in the other post, and I suggested in my original reply, 
memtest that booger. Note if failures are repeatable and how long it 
takes. Does it take multiple passes (indicating slow overheating) before 
the errors appear.

> 
>> I still think it's hardware *unless* there have been recent changes in
>> software. Could even be file system corruption exposing a bug in
>> apps/kernel by corrupting data or code.
> 
> I'm beginning to think it is hardware also.  I opened the case this 
> morning, cleaned and checked but found nothing.

Were all the fans running when you powered it up *before* closing it 
back up? Did you carefully push each memory stick, PCI card, etc. into 
the slots? Thermal expansion and contraction over extended periods can 
cause components to "walk" up out of fully seated position. Has the unit 
been recently moved? With todays cases, the can be a fair amount of flex 
that cause cards to tilt in the slot. I have one unit that I've learned 
to push the video card down after any handling. 'Course, I sometimes go 
on a tear and move a lot of things a lot. And set them on precarious 
perches, off level, other items on top,... well I *do* know better.

> 
>> I wonder how long since fsck has been run. If the drives are "smart"
>> capable, maybe any problems there can be seen.
> 
> I'm running ext3 on all drives.  The drives are smart capable and I 
> looked in proc but didn't really know what I was seeing.  Nothing stood 
> out at me.

have you looked at the boot logs to see if the journaling file system 
(ext3) reports any oddities?

Another possibility. Weak power. Even with surge suppression strips, 
there is a possibility some surge weakened the power supply. Or if it's 
old and just large enough for the load, maybe it got really warm one day 
and gave up some capacity.

Try removing anything not essential to the task at hand (compiling 
kernel in this test case?), like cd reader/writers, sound cards, extra 
disk drives (don't really remove them, just pull the power connectors 
out) and see what happens.

How old is the unit? Unbeknown to most, electronics used to "age" and 
eventually die or become extremely "flaky". I don't know if they still 
tend to do that (haven't had the desire to keep up) but on an older box...

> 
>> --
>> Wit
> 

-- 
http://linuxfromscratch.org/mailman/listinfo/lfs-support
FAQ: http://www.linuxfromscratch.org/lfs/faq.html
Unsubscribe: See the above information page

Reply via email to