Moving to fm-discuss.

Short answer is we'd need to see the fmd process core to be able to tell you 
what's going on.  You mention you're running build 101b, which is fairly old, 
so 
there's a reasonable chance that you're hitting a bug that's been fixed in a 
later build.

Can you send us the output of the following command?:

% pstack <path to fmd core file>

(by default fmd drops its cores in /var/fm/fmd/)

rob



Erich Weiler wrote:
> Hey ya'll,
> 
> Was hoping someone could shed some light on this...  I have an installation 
> of OpenSolaris (snv_101b), and I keep noticing that my / partition is full.  
> I discovered that /var/fm/fmd was filling it up with ~100GB of core.fmd.* 
> files.  I can delete them easily enough, but sure enough, they come back 
> after a few days.
> 
> I see this:
> 
> # svcs -xv
> svc:/system/intrd:default (interrupt balancer)
>  State: maintenance since Sun Sep 13 19:14:36 2009
> Reason: Restarting too quickly.
>    See: http://sun.com/msg/SMF-8000-L5
>    See: man -M /usr/share/man -s 1M intrd
>    See: /var/svc/log/system-intrd:default.log
> Impact: This service is not running.
> 
> svc:/system/fmd:default (Solaris Fault Manager)
>  State: maintenance since Mon Sep 14 04:15:03 2009
> Reason: Restarting too quickly.
>    See: http://sun.com/msg/SMF-8000-L5
>    See: man -M /usr/share/man -s 1M fmd
>    See: /var/svc/log/system-fmd:default.log
> Impact: This service is not running.
> 
> Looking at /var/svc/log/system-fmd:default.log shows nothing specific:
> 
> [ Sep 14 04:14:35 Executing start method ("/usr/lib/fm/fmd/fmd"). ]
> [ Sep 14 04:14:35 Method "start" exited with status 0. ]
> [ Sep 14 04:14:41 Stopping because process dumped core. ]
> [ Sep 14 04:14:41 Executing stop method (:kill). ]
> [ Sep 14 04:14:41 Executing start method ("/usr/lib/fm/fmd/fmd"). ]
> [ Sep 14 04:14:42 Method "start" exited with status 0. ]
> [ Sep 14 04:14:43 Stopping because process dumped core. ]
> [ Sep 14 04:14:43 Executing stop method (:kill). ]
> [ Sep 14 04:14:43 Executing start method ("/usr/lib/fm/fmd/fmd"). ]
> [ Sep 14 04:14:44 Method "start" exited with status 0. ]
> [ Sep 14 04:14:49 Stopping because process dumped core. ]
> [ Sep 14 04:14:50 Executing stop method (:kill). ]
> [ Sep 14 04:14:50 Executing start method ("/usr/lib/fm/fmd/fmd"). ]
> [ Sep 14 04:14:50 Method "start" exited with status 0. ]
> [ Sep 14 04:14:56 Method "start" exited with status 0. ]
> [ Sep 14 04:14:57 Stopping because process dumped core. ]
> [ Sep 14 04:14:57 Executing stop method (:kill). ]
> [ Sep 14 04:14:57 Executing start method ("/usr/lib/fm/fmd/fmd"). ]
> [ Sep 14 04:14:58 Method "start" exited with status 0. ]
> [ Sep 14 04:15:02 Method "start" exited with status 0. ]
> 
> Just for fun I also looked at the intrd service because that seemed to be 
> having problems too.  
> 
> # view /var/svc/log/system-intrd:default.log
> ...
> Config.pm did not return a true value at 
> /usr/perl5/5.8.4/lib/i86pc-solaris-64int/DynaLoader.pm line 25.
> BEGIN failed--compilation aborted at 
> /usr/perl5/5.8.4/lib/i86pc-solaris-64int/DynaLoader.pm line 25.
> Compilation failed in require at /usr/perl5/5.8.4/lib/Sun/Solaris/Kstat.pm 
> line 33.
> BEGIN failed--compilation aborted at 
> /usr/perl5/5.8.4/lib/Sun/Solaris/Kstat.pm line 33.
> Compilation failed in require at /usr/lib/intrd line 69.
> [ Sep 13 19:14:36 Stopping because all processes in service exited. ]
> [ Sep 13 19:14:36 Executing stop method (:kill). ]
> [ Sep 13 19:14:36 Executing start method ("/lib/svc/method/svc-intrd"). ]
> [ Sep 13 19:14:36 Method "start" exited with status 0. ]
> Config.pm did not return a true value at 
> /usr/perl5/5.8.4/lib/i86pc-solaris-64int/DynaLoader.pm line 25.
> BEGIN failed--compilation aborted at 
> /usr/perl5/5.8.4/lib/i86pc-solaris-64int/DynaLoader.pm line 25.
> Compilation failed in require at /usr/perl5/5.8.4/lib/Sun/Solaris/Kstat.pm 
> line 33.
> BEGIN failed--compilation aborted at 
> /usr/perl5/5.8.4/lib/Sun/Solaris/Kstat.pm line 33.
> Compilation failed in require at /usr/lib/intrd line 69.
> [ Sep 13 19:14:36 Stopping because all processes in service exited. ]
> [ Sep 13 19:14:36 Executing stop method (:kill). ]
> [ Sep 13 19:14:36 Restarting too quickly, changing state to maintenance. ]
> ...
> 
> I see nothing in /var/adm/messages referencing fmd, but I do see this entry 
> for intrd:
> 
> Sep 13 19:14:36 myserver svc.startd[7]: [ID 748625 daemon.error] 
> system/intrd:default failed repeatedly: transitioned to maintenance (see 
> 'svcs -xv' for details)
> 
> Does anyone have an idea of where I should look next?  intrd and fmd services 
> are not online, and I can't seem to bring them up.


Reply via email to