cause of reboot

2013-09-30 Thread Emre Çamalan
Hi,
my server reboots every night and same o'clock last 10 days. But this machine's 
uptime was 96days.

Suddenly reboot this machine and now this continue every night again and again.

I didn't find any reason and I didn't change anything else.

I looked last command, 
reboot ~ ~   AM 03.15  ~


I checked dmesg -a , it has only generic things no other things.

I checked /var/log/messages

I wrote script and checked swapinfo , cpu and memory but all of them was normal 
before reboot.

I checked crontab and scripts.

I checked crashinfo  but no dump from kernel.

I didn't find any useful info from this commands.

I checked praudit /var/audir/20130930..CrashRecovery  
and I didn't understand very well from this file but I think this means my 
machine crash and reboot isn't it??

And How can I understand what is the reason of rebooting my FreeBSD8.3 server.

Please help I need to find cause of reboot..
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: cause of reboot

2013-09-30 Thread Jason Hellenthal
Did you check for the command reboot in all of the periodic scripts and cron 
jobs as well as the command shutdown ?



On Sep 30, 2013, at 10:24, Emre Çamalan mail...@yandex.com wrote:

Hi,
my server reboots every night and same o'clock last 10 days. But this machine's 
uptime was 96days.

Suddenly reboot this machine and now this continue every night again and again.

I didn't find any reason and I didn't change anything else.

I looked last command, 
reboot ~ ~   AM 03.15  ~


I checked dmesg -a , it has only generic things no other things.

I checked /var/log/messages

I wrote script and checked swapinfo , cpu and memory but all of them was normal 
before reboot.

I checked crontab and scripts.

I checked crashinfo  but no dump from kernel.

I didn't find any useful info from this commands.

I checked praudit /var/audir/20130930..CrashRecovery
and I didn't understand very well from this file but I think this means my 
machine crash and reboot isn't it??

And How can I understand what is the reason of rebooting my FreeBSD8.3 server.

Please help I need to find cause of reboot..
___
freebsd-hack...@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to freebsd-hackers-unsubscr...@freebsd.org


smime.p7s
Description: S/MIME cryptographic signature


Re: cause of reboot

2013-09-30 Thread Steve O'Hara-Smith
On Mon, 30 Sep 2013 17:24:18 +0300
Emre Çamalan mail...@yandex.com wrote:

 Hi,
 my server reboots every night and same o'clock last 10 days. But this
 machine's uptime was 96days.
 
 Suddenly reboot this machine and now this continue every night again and
 again.
 
 I didn't find any reason and I didn't change anything else.
 
 I looked last command, 
 reboot ~ ~   AM 03.15  ~

That's likely something in the daily run going wrong, try disabling
items in there one by one (by editing /etc/periodic.conf - which probably
doesn't yet exist so create it and look in /etc/defaults/periodic.conf for
details) until you get through a night without a reboot. Then the next step
is to figure out why whatever is crashing the system does so, but first
let's find out what.

-- 
Steve O'Hara-Smith st...@sohara.org
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org

Re: cause of reboot

2013-09-30 Thread Emre Çamalan
Yes I checked also it , such as ;
grep -i 'reboot' /root/scripts/* -R
grep -i 'halt' /root/scripts/* -R
grep -i 'shutdown' /root/scripts/* -R


30.09.2013, 17:33, Jason Hellenthal jhellent...@dataix.net:
 Did you check for the command reboot in all of the periodic scripts and cron 
 jobs as well as the command shutdown ?

 On Sep 30, 2013, at 10:24, Emre Çamalan mail...@yandex.com wrote:

 Hi,
 my server reboots every night and same o'clock last 10 days. But this 
 machine's uptime was 96days.

 Suddenly reboot this machine and now this continue every night again and 
 again.

 I didn't find any reason and I didn't change anything else.

 I looked last command,
 reboot ~ ~   AM 03.15  ~

 I checked dmesg -a , it has only generic things no other things.

 I checked /var/log/messages

 I wrote script and checked swapinfo , cpu and memory but all of them was 
 normal before reboot.

 I checked crontab and scripts.

 I checked crashinfo  but no dump from kernel.

 I didn't find any useful info from this commands.

 I checked praudit /var/audir/20130930..CrashRecovery
 and I didn't understand very well from this file but I think this means my 
 machine crash and reboot isn't it??

 And How can I understand what is the reason of rebooting my FreeBSD8.3 server.

 Please help I need to find cause of reboot..
 ___
 freebsd-hack...@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
 To unsubscribe, send any mail to freebsd-hackers-unsubscr...@freebsd.org
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org

Re: cause of reboot

2013-09-30 Thread Jason Hellenthal
And /etc/ ?
And /var/cron ?



On Sep 30, 2013, at 11:00, Emre Çamalan mail...@yandex.com wrote:

Yes I checked also it , such as ;
grep -i 'reboot' /root/scripts/* -R
grep -i 'halt' /root/scripts/* -R
grep -i 'shutdown' /root/scripts/* -R


30.09.2013, 17:33, Jason Hellenthal jhellent...@dataix.net:
 Did you check for the command reboot in all of the periodic scripts and cron 
 jobs as well as the command shutdown ?
 
 On Sep 30, 2013, at 10:24, Emre Çamalan mail...@yandex.com wrote:
 
 Hi,
 my server reboots every night and same o'clock last 10 days. But this 
 machine's uptime was 96days.
 
 Suddenly reboot this machine and now this continue every night again and 
 again.
 
 I didn't find any reason and I didn't change anything else.
 
 I looked last command,
 reboot ~ ~   AM 03.15  ~
 
 I checked dmesg -a , it has only generic things no other things.
 
 I checked /var/log/messages
 
 I wrote script and checked swapinfo , cpu and memory but all of them was 
 normal before reboot.
 
 I checked crontab and scripts.
 
 I checked crashinfo  but no dump from kernel.
 
 I didn't find any useful info from this commands.
 
 I checked praudit /var/audir/20130930..CrashRecovery
 and I didn't understand very well from this file but I think this means my 
 machine crash and reboot isn't it??
 
 And How can I understand what is the reason of rebooting my FreeBSD8.3 server.
 
 Please help I need to find cause of reboot..
 ___
 freebsd-hack...@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
 To unsubscribe, send any mail to freebsd-hackers-unsubscr...@freebsd.org


smime.p7s
Description: S/MIME cryptographic signature


Re: cause of reboot

2013-09-30 Thread Jason Hellenthal
Also  . . . grep -ri  . . . 



On Sep 30, 2013, at 11:06, Jason Hellenthal jhellent...@dataix.net wrote:

And /etc/ ?
And /var/cron ?



On Sep 30, 2013, at 11:00, Emre Çamalan mail...@yandex.com wrote:

Yes I checked also it , such as ;
grep -i 'reboot' /root/scripts/* -R
grep -i 'halt' /root/scripts/* -R
grep -i 'shutdown' /root/scripts/* -R


30.09.2013, 17:33, Jason Hellenthal jhellent...@dataix.net:
 Did you check for the command reboot in all of the periodic scripts and cron 
 jobs as well as the command shutdown ?
 
 On Sep 30, 2013, at 10:24, Emre Çamalan mail...@yandex.com wrote:
 
 Hi,
 my server reboots every night and same o'clock last 10 days. But this 
 machine's uptime was 96days.
 
 Suddenly reboot this machine and now this continue every night again and 
 again.
 
 I didn't find any reason and I didn't change anything else.
 
 I looked last command,
 reboot ~ ~   AM 03.15  ~
 
 I checked dmesg -a , it has only generic things no other things.
 
 I checked /var/log/messages
 
 I wrote script and checked swapinfo , cpu and memory but all of them was 
 normal before reboot.
 
 I checked crontab and scripts.
 
 I checked crashinfo  but no dump from kernel.
 
 I didn't find any useful info from this commands.
 
 I checked praudit /var/audir/20130930..CrashRecovery
 and I didn't understand very well from this file but I think this means my 
 machine crash and reboot isn't it??
 
 And How can I understand what is the reason of rebooting my FreeBSD8.3 server.
 
 Please help I need to find cause of reboot..
 ___
 freebsd-hack...@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
 To unsubscribe, send any mail to freebsd-hackers-unsubscr...@freebsd.org


smime.p7s
Description: S/MIME cryptographic signature


Re: cause of reboot

2013-09-30 Thread Julian Elischer

On 9/30/13 11:08 PM, Jason Hellenthal wrote:

Also  . . . grep -ri  . . .



On Sep 30, 2013, at 11:06, Jason Hellenthal jhellent...@dataix.net wrote:

And /etc/ ?
And /var/cron ?



On Sep 30, 2013, at 11:00, Emre Çamalan mail...@yandex.com wrote:

Yes I checked also it , such as ;
grep -i 'reboot' /root/scripts/* -R
grep -i 'halt' /root/scripts/* -R
grep -i 'shutdown' /root/scripts/* -R


30.09.2013, 17:33, Jason Hellenthal jhellent...@dataix.net:

Did you check for the command reboot in all of the periodic scripts and cron 
jobs as well as the command shutdown ?

On Sep 30, 2013, at 10:24, Emre Çamalan mail...@yandex.com wrote:

Hi,
my server reboots every night and same o'clock last 10 days. But this machine's 
uptime was 96days.

Suddenly reboot this machine and now this continue every night again and again.

I didn't find any reason and I didn't change anything else.

I looked last command,
reboot ~ ~   AM 03.15  ~

I checked dmesg -a , it has only generic things no other things.

I checked /var/log/messages

I wrote script and checked swapinfo , cpu and memory but all of them was normal 
before reboot.

I checked crontab and scripts.

I checked crashinfo  but no dump from kernel.

I didn't find any useful info from this commands.

I checked praudit /var/audir/20130930..CrashRecovery
and I didn't understand very well from this file but I think this means my 
machine crash and reboot isn't it??

And How can I understand what is the reason of rebooting my FreeBSD8.3 server.

Please help I need to find cause of reboot..
___
freebsd-hack...@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to freebsd-hackers-unsubscr...@freebsd.org


Firstly, does this machine have the kernel debugger installed?
if not then a kernel page fault will look like a reboot.

if you know when this is going to occur, then I suggest that you
set up some logging of every new process run, to a second machine, or 
possibly

you might just ssh int the machine and leave 'top' running
when it reboots you shoud get a snapshot of what's going on.

you could just rename 'reboot' and see if it still happens.
If not, then replace reboot (and friends) with a script that reports 
who called it.




___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org

Re: cause of reboot

2013-09-30 Thread Patrick Lamaiziere
Le Mon, 30 Sep 2013 17:24:18 +0300,
Emre Çamalan mail...@yandex.com a écrit :

 Hi,
 my server reboots every night and same o'clock last 10 days. But this
 machine's uptime was 96days.
 
 Suddenly reboot this machine and now this continue every night again
 and again.
 
 I didn't find any reason and I didn't change anything else.
 
 I looked last command, 
 reboot ~ ~   AM 03.15  ~

The last time It happened (one month ago) to me it was the hard disk
(periodic scripts read a large part of the disk).

If the disk is smart capable try a full test with smartctl
(sysutils/smartmontools)

HTH, regards
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: cause of reboot

2013-09-30 Thread Michael Powell
Patrick Lamaiziere wrote:
[snip]
 
 I looked last command,
 reboot ~ ~   AM 03.15  ~
 
 The last time It happened (one month ago) to me it was the hard disk
 (periodic scripts read a large part of the disk).
 
 If the disk is smart capable try a full test with smartctl
 (sysutils/smartmontools)

My gateway/firewall/mail/ids router box at home has 2 GB RAM in it, so 
normally it has enough extra room that nothing ever pushes over into swap 
with one exception: the periodic run at 0300. It is generally never more 
than just a few kilobytes, but I find it slightly surprising nonetheless.

If a sector (or more) on the drive that is backing the swap partition has 
gone bad it might not even be noticeable until something pages out to swap 
(like my 0300 periodic run). 

If the drive is a WD the 'Quick' test using the manufacturers' wddiags 
utility should spot it, and is non-destructive. I have occasionally seen the 
full test not destroy data - but I wouldn't count on it being non-
destructive. However, as long as the remap area isn't full the long test 
will repair the drive by relocating and mapping out the bad spot. When this 
silent fading away of magnetic media occurs wrt to higher-end RAID 
controllers the scrub function in the controller BIOS is where you would 
want to go.

The other problem relative to this that I've run into is the apple before 
the cart syndrome around backups. I have seen dump fail to allow for backing 
up data prior to using the full wddiags to repair a drive so you kinda get 
stuck. If the full test is going to wipe the drive and you can't generate a 
fresh current backup you're stuck only being able to restore whatever is the 
last good backup you have on hand.

Wouldn't surpise me at all if this were to turn out to be the drive just 
recently grew one or more bad spots. A bad spot or few on an old drive that 
gets repaired I might continue to use the drive for a while, maybe even for 
like a year time-frame wise. If 2 months later it starts growing more bad 
spots the drive goes in the rubbish bin.

-Mike




___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: cause of reboot

2013-09-30 Thread Michael Powell
kpn...@pobox.com wrote:
[snip]
 While we're throwing ideas onto the table let me mention power supplies.
 Power supplies and hard drives are in a race to see which one will fail
 first. It may be that the power supply is marginal and added load from
 the drives being hit hard may send it over the edge. How heavily loaded
 is the machine in question?

Absolute and total agreement with this. As they age and the filter caps leak 
and dry out more it will eventually become apparent. But in the meantime the 
output DC can just about meet spec up until really loaded. Then the ripple 
becomes so excessive it's not quite DC any longer. You can clearly see it 
using an oscilloscope. 

The 0300 AM periodic does hammer a machine enough to possibly push a 
marginal power supply over the edge. I once had a box where the RAM chips 
would sing with a high-pitched whistle only during the 0300 periodic run. 
It sounded just like the horizontal output on a television right before 
destruction.  :-) 
 
[snip]

-Mike


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: cause of reboot

2013-09-30 Thread Polytropon
On Mon, 30 Sep 2013 20:32:39 -0400, Michael Powell wrote:
 kpn...@pobox.com wrote:
 [snip]
  While we're throwing ideas onto the table let me mention power supplies.
  Power supplies and hard drives are in a race to see which one will fail
  first. It may be that the power supply is marginal and added load from
  the drives being hit hard may send it over the edge. How heavily loaded
  is the machine in question?
 
 Absolute and total agreement with this.

The idea of a hardware problem looks more and more obvious here.

A software configuration problem could be located by diff'ing
the currently used files against stock files, or by checking
the logs of a versioning system (if you use one to track your
local configuration file changes, for example in a CVS reposi-
tory).

It could be a matter of power (by more than usual drain when
the machine is heavily loaded), but also a file system inconsis-
tency is possible. In case the machine is using a background
fsck that silently fails to deal with a specific damage, using
background_fsck=NO in /etc/rc.conf to _definitely_ bring the
file systems up _clean_ prior to multi-user mode booting would
probably be a good idea. Using smartctl to check the hard disks
SMART data would make sure the disk is not dying (and the reboot
is an effect of that).

Monitoring the server when (or while) it reboots would surely be
interesting. Maybe open some sessions to have a close look at
programs like top, systat -v and mbmon (to check for
temperatures and voltages) - and when run at 1 second intervals,
it should be possible to obtain a good system status diagram
of the last state before reboot, when the connection drops.



 I once had a box where the RAM chips 
 would sing with a high-pitched whistle only during the 0300 periodic run. 
 It sounded just like the horizontal output on a television right before 
 destruction.  :-) 

I have heared something comparable from a graphics card when
developing some OpenGL demo stuff. :-)




-- 
Polytropon
Magdeburg, Germany
Happy FreeBSD user since 4.0
Andra moi ennepe, Mousa, ...
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org