Hi, scdpmd is not working because there is a problem with dpmccr_initialize function within scdpmd daemon.
Piotr Jasiukajtis pisze: > No, I can't find any scdpmd core files. > > Venkateswarlu Tella pisze: >> Hi Piotr, >> It looks like the scpdm processes which has been started are going away. >> Is that because the scdpdm is core dumping? >> Can you check if there are scdpmd cores? >> >> Thanks >> -Venku >> >> On 10/24/08 01:07, Piotr Jasiukajtis wrote: >>> Hi, >>> >>> [ pa? 22 10:04:33 Disabled. ] >>> [ pa? 22 10:04:33 Rereading configuration. ] >>> [ pa? 22 10:11:31 Disabled. ] >>> [ pa? 22 10:33:14 Disabled. ] >>> [ pa? 22 10:33:19 Enabled. ] >>> [ pa? 22 10:38:07 Disabled. ] >>> [ pa? 22 10:39:50 Disabled. ] >>> [ pa? 22 10:41:07 Enabled. ] >>> [ pa? 22 10:41:10 Disabled. ] >>> [ pa? 22 10:49:04 Enabled. ] >>> [ pa? 22 11:02:15 Disabled. ] >>> [ pa? 22 11:03:41 Enabled. ] >>> [ pa? 22 11:05:38 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:42 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:42 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:42 Executing stop method (:kill). ] >>> [ pa? 22 11:05:42 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:44 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:44 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:44 Executing stop method (:kill). ] >>> [ pa? 22 11:05:44 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:45 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:45 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:45 Executing stop method (:kill). ] >>> [ pa? 22 11:05:45 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:47 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:47 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:47 Executing stop method (:kill). ] >>> [ pa? 22 11:05:47 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:48 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:48 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:48 Executing stop method (:kill). ] >>> [ pa? 22 11:05:48 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:49 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:49 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:49 Executing stop method (:kill). ] >>> [ pa? 22 11:05:49 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:50 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:50 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:50 Executing stop method (:kill). ] >>> [ pa? 22 11:05:50 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:50 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:50 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:50 Executing stop method (:kill). ] >>> [ pa? 22 11:05:50 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:51 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:51 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:51 Executing stop method (:kill). ] >>> [ pa? 22 11:05:51 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:05:51 Method "start" exited with status 0. ] >>> [ pa? 22 11:05:51 Stopping because all processes in service exited. ] >>> [ pa? 22 11:05:51 Executing stop method (:kill). ] >>> [ pa? 22 11:05:51 Restarting too quickly, changing state to >>> maintenance. ] >>> [ pa? 22 11:40:42 Leaving maintenance because clear requested. ] >>> [ pa? 22 11:40:42 Enabled. ] >>> [ pa? 22 11:40:42 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:42 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:43 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:43 Executing stop method (:kill). ] >>> [ pa? 22 11:40:43 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:43 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:43 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:43 Executing stop method (:kill). ] >>> [ pa? 22 11:40:43 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:44 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:44 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:44 Executing stop method (:kill). ] >>> [ pa? 22 11:40:44 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:44 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:44 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:44 Executing stop method (:kill). ] >>> [ pa? 22 11:40:44 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:44 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:44 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:45 Executing stop method (:kill). ] >>> [ pa? 22 11:40:45 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:45 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:45 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:45 Executing stop method (:kill). ] >>> [ pa? 22 11:40:45 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:46 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:46 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:46 Executing stop method (:kill). ] >>> [ pa? 22 11:40:46 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:46 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:46 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:46 Executing stop method (:kill). ] >>> [ pa? 22 11:40:47 Executing start method >>> ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] >>> [ pa? 22 11:40:47 Method "start" exited with status 0. ] >>> [ pa? 22 11:40:47 Stopping because all processes in service exited. ] >>> [ pa? 22 11:40:47 Executing stop method (:kill). ] >>> [ pa? 22 11:40:47 Restarting too quickly, changing state to >>> maintenance. ] >>> >>> >>> >>> >>> >>> bash-3.2# /usr/cluster/lib/svc/method/svc_scdpm start >>> bash-3.2# echo $? >>> 0 >>> bash-3.2# . /lib/svc/share/smf_include.sh >>> bash-3.2# LIBSCDIR=/usr/cluster/lib/sc >>> bash-3.2# USRBIN=/usr/bin >>> bash-3.2# SERVER=scdpmd >>> bash-3.2# SCLIB=/usr/cluster/lib/sc >>> bash-3.2# ${LIBSCDIR}/${SERVER} >>> bash-3.2# echo $? >>> 0 >>> bash-3.2# svcs -xv >>> svc:/system/cluster/scdpm:default (Sun Cluster Disk Path Monitoring >>> Daemon) >>> State: maintenance since 22 pa?dziernika 2008 11:48:44 CEST >>> Reason: Restarting too quickly. >>> See: http://sun.com/msg/SMF-8000-L5 >>> See: man -M /usr/cluster/man -s 1M scdpm >>> See: /var/svc/log/system-cluster-scdpm:default.log >>> Impact: 1 dependent service is not running: >>> svc:/system/cluster/cl-svc-cluster-milestone:default >>> >>> svc:/system/cluster/scsymon-srv:default (Sun Cluster SyMON Server Daemon) >>> State: offline since 22 pa?dziernika 2008 11:03:42 CEST >>> Reason: Dependency svc:/application/management/sunmcagent:default is >>> absent. >>> See: http://sun.com/msg/SMF-8000-E2 >>> Impact: This service is not running. >>> bash-3.2# >>> >>> >>> >>> >>> >>> Venkateswarlu Tella pisze: >>>> Hi Piotr, >>>> Can you check the scdpm log to find why this be being failed to start? >>>> #cat /var/svc/log/system-cluster-scdpm:default.log >>>> >>>> Thanks >>>> -Venku >>>> >>>> On 10/22/08 14:56, Piotr Jasiukajtis wrote: >>>>> Hi, >>>>> >>>>> I have Colorado single node cluster up and running in VirtualBox but: >>>>> >>>>> 1. These services are not running: >>>>> >>>>> svc:/system/cluster/scdpm:default (Sun Cluster Disk Path Monitoring >>>>> Daemon) >>>>> State: maintenance since 22 pa?dziernika 2008 11:05:51 CEST >>>>> Reason: Restarting too quickly. >>>>> See: http://sun.com/msg/SMF-8000-L5 >>>>> See: man -M /usr/cluster/man -s 1M scdpm >>>>> See: /var/svc/log/system-cluster-scdpm:default.log >>>>> Impact: 1 dependent service is not running: >>>>> svc:/system/cluster/cl-svc-cluster-milestone:default >>>>> >>>>> svc:/system/cluster/scsymon-srv:default (Sun Cluster SyMON Server >>>>> Daemon) >>>>> State: offline since 22 pa?dziernika 2008 11:03:42 CEST >>>>> Reason: Dependency svc:/application/management/sunmcagent:default is >>>>> absent. >>>>> See: http://sun.com/msg/SMF-8000-E2 >>>>> Impact: This service is not running. >>>>> >>>>> >>>>> Well, service 'scsymon-srv:default' is not running by default even with >>>>> Sun Cluster 3.2 ... >>>>> >>>>> >>>>> 2. I think 'scinstall' should check if physical:nwam is running since >>>>> nwam is enabled by default in OpenSolaris. >>>>> OpenSolaris by default add hostname to the 127.0.0.1 address. >>>>> >>>>> >>>>> bash-3.2# mdb -k 0 >>>>> Loading modules: [ unix genunix dtrace specfs cpu.generic uppc pcplusmp >>>>> scsi_vhci zfs ip hook neti sctp arp usba uhci s1394 qlc fctl md lofs >>>>> fcip fcp cpc random crypto logindmux ptm ufs sppp nsmb sd ] >>>>>> ::panicinfo >>>>> cpu 0 >>>>> thread dfc45c00 >>>>> message BAD TRAP: type=e (#pf Page fault) rp=dfc5a8ac addr=0 >>>>> occurred in module "<unknown>" due to a NULL pointer dereference >>>>> gs 1b0 >>>>> fs fe910000 >>>>> es d3030160 >>>>> ds d3030160 >>>>> edi 0 >>>>> esi 0 >>>>> ebp dfc5a924 >>>>> esp dfc5a8e4 >>>>> ebx dfde7820 >>>>> edx dfc5a914 >>>>> ecx dfc5a91a >>>>> eax 800 >>>>> trapno e >>>>> err 0 >>>>> eip 0 >>>>> cs 158 >>>>> eflags 10296 >>>>> uesp f9e9d87f >>>>> ss ff >>>>> gdt fe7fe00002cf >>>>> idt fe7fd00007ff >>>>> ldt 0 >>>>> task 150 >>>>> cr0 80050033 >>>>> cr2 0 >>>>> cr3 214e9000 >>>>> cr4 698 >>>>>> ::status >>>>> debugging crash dump vmcore.0 (32-bit) from osdevbox >>>>> operating system: 5.11 snv_99 (i86pc) >>>>> panic message: BAD TRAP: type=e (#pf Page fault) rp=dfc5a8ac addr=0 >>>>> occurred in module "<unknown>" due to a NULL pointer dereference >>>>> dump content: kernel pages only >>>>>> $C >>>>> dfc5a944 clprivnet_wput+0x62(dfc547a8, dfde7820) >>>>> dfc5a984 putnext+0x1bc(dfc54640, dfde7820) >>>>> dfc5a9a0 softmac_m_tx+0x93(dd8d8380, dfde7820) >>>>> dfc5a9b8 dls_tx+0x16(dfc52ea8, dfde7820) >>>>> dfc5a9d8 dld_tx_single+0x1f(dfc53c38, dfde7820) >>>>> dfc5a9f4 str_mdata_fastpath_put+0x60(dfc53c38, dfde7820) >>>>> dfc5aa24 dld_wput+0x56(dfc50eb8, dfde7820) >>>>> dfc5aa64 putnext+0x1bc(dfc50d50, dfde7820) >>>>> dfc5aabc ip_xmit_v4+0x392(dfde7820, d8356408, 0, 1) >>>>> dfc5ac24 ip_wput_ire+0x1ab7(d8a854d0, dfde7820, d8356408, d8279a40, >>>>> 2, 0) >>>>> dfc5ac98 ip_output_options+0x1c58(d8279a40, dfde7700, d8a854d0, 2, >>>>> dfc5acd0) >>>>> dfc5ad50 udp_output_v4+0x547(d8279a40, 0, ffffffff, 4300, 0, dfc5ada0) >>>>> dfc5ada4 udp_wput+0x203(d8a854d0, dfde7840) >>>>> dfc5adf8 sodgram_direct+0x15a(d827ab00, dfa3a620, 10, dfc5af30, 8000) >>>>> dfc5ae78 sosend_dgram+0x22d(d827ab00, dfa3a620, 10, dfc5af30, 8000) >>>>> dfc5aed0 sotpi_sendmsg+0x42e(d827ab00, dfc5af54, dfc5af30) >>>>> dfc5af18 sendit+0x12b(6, dfc5af54, dfc5af30, 8000) >>>>> dfc5af84 sendto+0x78() >>>>> dfc5afac sys_call+0x10c() >>>>>> ::cpuinfo >>>>> ID ADDR FLG NRUN BSPL PRI RNRN KRNRN SWITCH THREAD PROC >>>>> 0 fec21d18 1b 7 0 59 no no t-0 dfc45c00 dhcpagent >>>>>> ::regs >>>>> %cs = 0x0158 %eax = 0x00000800 >>>>> %ds = 0xd3030160 %ebx = 0xdfde7820 >>>>> %ss = 0x00ff %ecx = 0xdfc5a91a >>>>> %es = 0xd3030160 %edx = 0xdfc5a914 >>>>> %fs = 0xfe910000 %esi = 0x00000000 >>>>> %gs = 0x01b0 %edi = 0x00000000 >>>>> >>>>> %eip = 0x00000000 >>>>> %ebp = 0xdfc5a924 >>>>> %esp = 0xdfc5a8e4 >>>>> >>>>> %eflags = 0x00010296 >>>>> id=0 vip=0 vif=0 ac=0 vm=0 rf=1 nt=0 iopl=0x0 >>>>> status=<of,df,IF,tf,SF,zf,AF,PF,cf> >>>>> >>>>> %uesp = 0xf9e9d87f >>>>> %trapno = 0xe >>>>> %err = 0x0 >>>>> >>>>> >>>>> >>>>> >>>>> >>>>> >>> > > -- Regards, Piotr Jasiukajtis | estibi | SCA OS0072 http://estseg.blogspot.com