Hi,

scdpmd is not working because there is a problem with dpmccr_initialize
function within scdpmd daemon.

Piotr Jasiukajtis pisze:
> No, I can't find any scdpmd core files.
> 
> Venkateswarlu Tella pisze:
>> Hi Piotr,
>> It looks like the scpdm processes which has been started are going away.
>> Is that because the scdpdm is core dumping?
>> Can you check if there are scdpmd cores?
>>
>> Thanks
>> -Venku
>>
>> On 10/24/08 01:07, Piotr Jasiukajtis wrote:
>>> Hi,
>>>
>>> [ pa? 22 10:04:33 Disabled. ]
>>> [ pa? 22 10:04:33 Rereading configuration. ]
>>> [ pa? 22 10:11:31 Disabled. ]
>>> [ pa? 22 10:33:14 Disabled. ]
>>> [ pa? 22 10:33:19 Enabled. ]
>>> [ pa? 22 10:38:07 Disabled. ]
>>> [ pa? 22 10:39:50 Disabled. ]
>>> [ pa? 22 10:41:07 Enabled. ]
>>> [ pa? 22 10:41:10 Disabled. ]
>>> [ pa? 22 10:49:04 Enabled. ]
>>> [ pa? 22 11:02:15 Disabled. ]
>>> [ pa? 22 11:03:41 Enabled. ]
>>> [ pa? 22 11:05:38 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:42 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:42 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:42 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:42 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:44 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:44 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:44 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:44 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:45 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:45 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:45 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:45 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:47 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:47 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:47 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:47 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:48 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:48 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:48 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:48 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:49 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:49 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:49 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:49 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:50 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:50 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:50 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:50 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:50 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:50 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:50 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:50 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:51 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:51 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:51 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:51 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:05:51 Method "start" exited with status 0. ]
>>> [ pa? 22 11:05:51 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:05:51 Executing stop method (:kill). ]
>>> [ pa? 22 11:05:51 Restarting too quickly, changing state to
>>> maintenance. ]
>>> [ pa? 22 11:40:42 Leaving maintenance because clear requested. ]
>>> [ pa? 22 11:40:42 Enabled. ]
>>> [ pa? 22 11:40:42 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:42 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:43 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:43 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:43 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:43 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:43 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:43 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:43 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:44 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:44 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:44 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:44 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:44 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:44 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:44 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:44 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:44 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:44 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:45 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:45 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:45 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:45 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:45 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:45 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:46 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:46 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:46 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:46 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:46 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:46 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:46 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:47 Executing start method
>>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
>>> [ pa? 22 11:40:47 Method "start" exited with status 0. ]
>>> [ pa? 22 11:40:47 Stopping because all processes in service exited. ]
>>> [ pa? 22 11:40:47 Executing stop method (:kill). ]
>>> [ pa? 22 11:40:47 Restarting too quickly, changing state to
>>> maintenance. ]
>>>
>>>
>>>
>>>
>>>
>>> bash-3.2# /usr/cluster/lib/svc/method/svc_scdpm start
>>> bash-3.2# echo $?
>>> 0
>>> bash-3.2# . /lib/svc/share/smf_include.sh
>>> bash-3.2# LIBSCDIR=/usr/cluster/lib/sc
>>> bash-3.2# USRBIN=/usr/bin
>>> bash-3.2# SERVER=scdpmd
>>> bash-3.2# SCLIB=/usr/cluster/lib/sc
>>> bash-3.2# ${LIBSCDIR}/${SERVER}
>>> bash-3.2# echo $?
>>> 0
>>> bash-3.2# svcs -xv
>>> svc:/system/cluster/scdpm:default (Sun Cluster Disk Path Monitoring
>>> Daemon)
>>>  State: maintenance since 22 pa?dziernika 2008 11:48:44 CEST
>>> Reason: Restarting too quickly.
>>>    See: http://sun.com/msg/SMF-8000-L5
>>>    See: man -M /usr/cluster/man -s 1M scdpm
>>>    See: /var/svc/log/system-cluster-scdpm:default.log
>>> Impact: 1 dependent service is not running:
>>>         svc:/system/cluster/cl-svc-cluster-milestone:default
>>>
>>> svc:/system/cluster/scsymon-srv:default (Sun Cluster SyMON Server Daemon)
>>>  State: offline since 22 pa?dziernika 2008 11:03:42 CEST
>>> Reason: Dependency svc:/application/management/sunmcagent:default is
>>> absent.
>>>    See: http://sun.com/msg/SMF-8000-E2
>>> Impact: This service is not running.
>>> bash-3.2#
>>>
>>>
>>>
>>>
>>>
>>> Venkateswarlu Tella pisze:
>>>> Hi Piotr,
>>>> Can you check the scdpm log to find why this be being failed to start?
>>>> #cat /var/svc/log/system-cluster-scdpm:default.log
>>>>
>>>> Thanks
>>>> -Venku
>>>>
>>>> On 10/22/08 14:56, Piotr Jasiukajtis wrote:
>>>>> Hi,
>>>>>
>>>>> I have Colorado single node cluster up and running in VirtualBox but:
>>>>>
>>>>> 1. These services are not running:
>>>>>
>>>>> svc:/system/cluster/scdpm:default (Sun Cluster Disk Path Monitoring
>>>>> Daemon)
>>>>>  State: maintenance since 22 pa?dziernika 2008 11:05:51 CEST
>>>>> Reason: Restarting too quickly.
>>>>>    See: http://sun.com/msg/SMF-8000-L5
>>>>>    See: man -M /usr/cluster/man -s 1M scdpm
>>>>>    See: /var/svc/log/system-cluster-scdpm:default.log
>>>>> Impact: 1 dependent service is not running:
>>>>>         svc:/system/cluster/cl-svc-cluster-milestone:default
>>>>>
>>>>> svc:/system/cluster/scsymon-srv:default (Sun Cluster SyMON Server
>>>>> Daemon)
>>>>>  State: offline since 22 pa?dziernika 2008 11:03:42 CEST
>>>>> Reason: Dependency svc:/application/management/sunmcagent:default is
>>>>> absent.
>>>>>    See: http://sun.com/msg/SMF-8000-E2
>>>>> Impact: This service is not running.
>>>>>
>>>>>
>>>>> Well, service 'scsymon-srv:default' is not running by default even with
>>>>> Sun Cluster 3.2 ...
>>>>>
>>>>>
>>>>> 2. I think 'scinstall' should check if physical:nwam is running since
>>>>> nwam is enabled by default in OpenSolaris.
>>>>> OpenSolaris by default add hostname to the 127.0.0.1 address.
>>>>>
>>>>>
>>>>> bash-3.2# mdb -k 0
>>>>> Loading modules: [ unix genunix dtrace specfs cpu.generic uppc pcplusmp
>>>>> scsi_vhci zfs ip hook neti sctp arp usba uhci s1394 qlc fctl md lofs
>>>>> fcip fcp cpc random crypto logindmux ptm ufs sppp nsmb sd ]
>>>>>> ::panicinfo
>>>>>              cpu        0
>>>>>           thread dfc45c00
>>>>>          message BAD TRAP: type=e (#pf Page fault) rp=dfc5a8ac addr=0
>>>>> occurred in module "<unknown>" due to a NULL pointer dereference
>>>>>               gs      1b0
>>>>>               fs fe910000
>>>>>               es d3030160
>>>>>               ds d3030160
>>>>>              edi        0
>>>>>              esi        0
>>>>>              ebp dfc5a924
>>>>>              esp dfc5a8e4
>>>>>              ebx dfde7820
>>>>>              edx dfc5a914
>>>>>              ecx dfc5a91a
>>>>>              eax      800
>>>>>           trapno        e
>>>>>              err        0
>>>>>              eip        0
>>>>>               cs      158
>>>>>           eflags    10296
>>>>>             uesp f9e9d87f
>>>>>               ss       ff
>>>>>              gdt fe7fe00002cf
>>>>>              idt fe7fd00007ff
>>>>>              ldt        0
>>>>>             task      150
>>>>>              cr0 80050033
>>>>>              cr2        0
>>>>>              cr3 214e9000
>>>>>              cr4      698
>>>>>> ::status
>>>>> debugging crash dump vmcore.0 (32-bit) from osdevbox
>>>>> operating system: 5.11 snv_99 (i86pc)
>>>>> panic message: BAD TRAP: type=e (#pf Page fault) rp=dfc5a8ac addr=0
>>>>> occurred in module "<unknown>" due to a NULL pointer dereference
>>>>> dump content: kernel pages only
>>>>>> $C
>>>>> dfc5a944 clprivnet_wput+0x62(dfc547a8, dfde7820)
>>>>> dfc5a984 putnext+0x1bc(dfc54640, dfde7820)
>>>>> dfc5a9a0 softmac_m_tx+0x93(dd8d8380, dfde7820)
>>>>> dfc5a9b8 dls_tx+0x16(dfc52ea8, dfde7820)
>>>>> dfc5a9d8 dld_tx_single+0x1f(dfc53c38, dfde7820)
>>>>> dfc5a9f4 str_mdata_fastpath_put+0x60(dfc53c38, dfde7820)
>>>>> dfc5aa24 dld_wput+0x56(dfc50eb8, dfde7820)
>>>>> dfc5aa64 putnext+0x1bc(dfc50d50, dfde7820)
>>>>> dfc5aabc ip_xmit_v4+0x392(dfde7820, d8356408, 0, 1)
>>>>> dfc5ac24 ip_wput_ire+0x1ab7(d8a854d0, dfde7820, d8356408, d8279a40,
>>>>> 2, 0)
>>>>> dfc5ac98 ip_output_options+0x1c58(d8279a40, dfde7700, d8a854d0, 2,
>>>>> dfc5acd0)
>>>>> dfc5ad50 udp_output_v4+0x547(d8279a40, 0, ffffffff, 4300, 0, dfc5ada0)
>>>>> dfc5ada4 udp_wput+0x203(d8a854d0, dfde7840)
>>>>> dfc5adf8 sodgram_direct+0x15a(d827ab00, dfa3a620, 10, dfc5af30, 8000)
>>>>> dfc5ae78 sosend_dgram+0x22d(d827ab00, dfa3a620, 10, dfc5af30, 8000)
>>>>> dfc5aed0 sotpi_sendmsg+0x42e(d827ab00, dfc5af54, dfc5af30)
>>>>> dfc5af18 sendit+0x12b(6, dfc5af54, dfc5af30, 8000)
>>>>> dfc5af84 sendto+0x78()
>>>>> dfc5afac sys_call+0x10c()
>>>>>> ::cpuinfo
>>>>>  ID ADDR     FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD   PROC
>>>>>   0 fec21d18  1b    7    0  59   no    no t-0    dfc45c00 dhcpagent
>>>>>> ::regs
>>>>> %cs = 0x0158            %eax = 0x00000800
>>>>> %ds = 0xd3030160                %ebx = 0xdfde7820
>>>>> %ss = 0x00ff            %ecx = 0xdfc5a91a
>>>>> %es = 0xd3030160                %edx = 0xdfc5a914
>>>>> %fs = 0xfe910000                %esi = 0x00000000
>>>>> %gs = 0x01b0            %edi = 0x00000000
>>>>>
>>>>> %eip = 0x00000000
>>>>> %ebp = 0xdfc5a924
>>>>> %esp = 0xdfc5a8e4
>>>>>
>>>>> %eflags = 0x00010296
>>>>>   id=0 vip=0 vif=0 ac=0 vm=0 rf=1 nt=0 iopl=0x0
>>>>>   status=<of,df,IF,tf,SF,zf,AF,PF,cf>
>>>>>
>>>>>   %uesp = 0xf9e9d87f
>>>>> %trapno = 0xe
>>>>>    %err = 0x0
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>>>
>>>
> 
> 


-- 
Regards,
Piotr Jasiukajtis | estibi | SCA OS0072
http://estseg.blogspot.com

Reply via email to