Hi, I know it's not tested...
My plan was to install SCX on single machine in xVM. My dom0 is SXCE105. I installed 3 pvm domU: - n01 - sxce101 - n02 - sxce101a - n03 - sxce101a The 'sponsoring' node is n03. But there is a problem because n02 cannot communicate with n03. I don?t have an idea, just take a look: It?s on n02: >>> Cluster Name <<< Each cluster has a name assigned to it. When adding a node to the cluster, you must identify the name of the cluster you are attempting to join. A sanity check is performed to verify that the "sponsoring" node is a member of that cluster. What is the name of the cluster you want to join? vencl Attempting to contact "n03" ... timed out Unable to contact "n03" at this time. Do you want to try again (yes/no) [yes]? Attempting to contact "n03" ... timed out Unable to contact "n03" at this time. Do you want to try again (yes/no) [yes]? Attempting to contact "n03" ... timed out Unable to contact "n03" at this time. Do you want to try again (yes/no) [yes]? estibi at n03 ~> pfexec snoop -v -d xnf0 'host n02' Using device xnf0 (promiscuous mode) ETHER: ----- Ether Header ----- ETHER: ETHER: Packet 1 arrived at 15:07:6.61877 ETHER: Packet size = 98 bytes ETHER: Destination = 0:16:3e:9:6e:b2, ETHER: Source = 0:16:3e:42:20:11, ETHER: Ethertype = 0800 (IP) ETHER: IP: ----- IP Header ----- IP: IP: Version = 4 IP: Header length = 20 bytes IP: Type of service = 0x00 IP: xxx. .... = 0 (precedence) IP: ...0 .... = normal delay IP: .... 0... = normal throughput IP: .... .0.. = normal reliability IP: .... ..0. = not ECN capable transport IP: .... ...0 = no ECN congestion experienced IP: Total length = 84 bytes IP: Identification = 27545 IP: Flags = 0x4 IP: .1.. .... = do not fragment IP: ..0. .... = last fragment IP: Fragment offset = 0 bytes IP: Time to live = 255 seconds/hops IP: Protocol = 17 (UDP) IP: Header checksum = 8cdd IP: Source address = 192.168.0.232, n02.local IP: Destination address = 192.168.0.233, n03.local IP: No options IP: UDP: ----- UDP Header ----- UDP: UDP: Source port = 47601 UDP: Destination port = 111 (Sun RPC) UDP: Length = 64 UDP: Checksum = 0322 UDP: RPC: ----- SUN RPC Header ----- RPC: RPC: Transaction id = 1232758637 RPC: Type = 0 (Call) RPC: RPC version = 2 RPC: Program = 100000 (PMAP), version = 2, procedure = 3 RPC: Credentials: Flavor = 0 (None), len = 0 bytes RPC: Verifier : Flavor = 0 (None), len = 0 bytes RPC: PMAP: ----- Portmapper ----- PMAP: PMAP: Proc = 3 (Get port number) PMAP: Program = 100145 (?) PMAP: Version = 1 PMAP: Protocol = 6 (TCP) PMAP: ETHER: ----- Ether Header ----- ETHER: ETHER: Packet 2 arrived at 15:07:6.61913 ETHER: Packet size = 62 bytes ETHER: Destination = 0:16:3e:42:20:11, ETHER: Source = 0:16:3e:9:6e:b2, ETHER: Ethertype = 0800 (IP) ETHER: IP: ----- IP Header ----- IP: IP: Version = 4 IP: Header length = 20 bytes IP: Type of service = 0x00 IP: xxx. .... = 0 (precedence) IP: ...0 .... = normal delay IP: .... 0... = normal throughput IP: .... .0.. = normal reliability IP: .... ..0. = not ECN capable transport IP: .... ...0 = no ECN congestion experienced IP: Total length = 48 bytes IP: Identification = 46953 IP: Flags = 0x4 IP: .1.. .... = do not fragment IP: ..0. .... = last fragment IP: Fragment offset = 0 bytes IP: Time to live = 255 seconds/hops IP: Protocol = 17 (UDP) IP: Header checksum = 4131 IP: Source address = 192.168.0.233, n03.local IP: Destination address = 192.168.0.232, n02.local IP: No options IP: UDP: ----- UDP Header ----- UDP: UDP: Source port = 111 UDP: Destination port = 47601 (Sun RPC) UDP: Length = 28 UDP: Checksum = 0000 (no checksum) UDP: RPC: ----- SUN RPC Header ----- RPC: RPC: Transaction id = 1232758637 RPC: Type = 1 (Reply) RPC: This is a reply to frame 1 RPC: Status = 1 (Denied) RPC: Reject status = 1 (can't authenticate) RPC: Why = 7 (unknown reason) estibi at n03 ~> netstat -af inet UDP: IPv4 Local Address Remote Address State -------------------- -------------------- ---------- *.* Unbound *.* Unbound *.* Unbound *.sunrpc Idle *.* Unbound *.40547 Idle *.sunrpc Idle *.* Unbound *.42496 Idle *.51891 Idle *.50997 Idle *.mdns Idle *.ntp Idle localhost.ntp Idle n03.local.ntp Idle clusternode1-priv.ntp Idle *.11161 Idle TCP: IPv4 Local Address Remote Address Swind Send-Q Rwind Recv-Q State -------------------- -------------------- ----- ------ ----- ------ ----------- *.* *.* 0 0 49152 0 IDLE localhost.5999 *.* 0 0 49152 0 LISTEN *.scqsd *.* 0 0 49152 0 LISTEN *.scqsd *.* 0 0 49152 0 LISTEN *.* *.* 0 0 49152 0 IDLE *.sunrpc *.* 0 0 49152 0 LISTEN *.* *.* 0 0 49152 0 IDLE *.sunrpc *.* 0 0 49152 0 LISTEN *.* *.* 0 0 49152 0 IDLE *.ssh *.* 0 0 49152 0 LISTEN localhost.smtp *.* 0 0 49152 0 LISTEN localhost.submission *.* 0 0 49152 0 LISTEN localhost.5987 *.* 0 0 49152 0 LISTEN localhost.898 *.* 0 0 49152 0 LISTEN localhost.61180 *.* 0 0 49152 0 LISTEN localhost.5988 *.* 0 0 49152 0 LISTEN localhost.56361 *.* 0 0 49152 0 LISTEN *.64437 *.* 0 0 49152 0 LISTEN *.62590 *.* 0 0 49152 0 LISTEN *.56842 *.* 0 0 49152 0 LISTEN *.54907 *.* 0 0 49152 0 LISTEN *.sccheckd *.* 0 0 49152 0 LISTEN *.44383 *.* 0 0 49152 0 LISTEN *.51586 *.* 0 0 49152 0 LISTEN *.45459 *.* 0 0 49152 0 BOUND localhost.6788 *.* 0 0 49152 0 LISTEN localhost.6789 *.* 0 0 49152 0 LISTEN localhost.45970 *.* 0 0 49152 0 LISTEN n03.local.ssh 192.168.0.9.37828 49640 47 49640 0 ESTABLISHED localhost.6010 *.* 0 0 49152 0 LISTEN *.40373 *.* 0 0 49152 0 LISTEN *.35696 *.* 0 0 49152 0 LISTEN *.pnmd *.* 0 0 49152 0 LISTEN *.35135 *.* 0 0 49152 0 BOUND *.54599 *.* 0 0 49152 0 LISTEN *.11165 *.* 0 0 49152 0 LISTEN *.48888 *.* 0 0 49152 0 LISTEN *.50908 *.* 0 0 49152 0 LISTEN *.* *.* 0 0 49152 0 IDLE *.11164 *.* 0 0 49152 0 LISTEN *.11163 *.* 0 0 49152 0 LISTEN *.* *.* 0 0 49152 0 IDLE *.11162 *.* 0 0 49152 0 LISTEN SCTP: Local Address Remote Address Swind Send-Q Rwind Recv-Q StrsI/O State ------------------------------- ------------------------------- ------ ------ ------ ------ ------- ----------- 0.0.0.0 0.0.0.0 0 0 102400 0 32/32 CLOSED estibi at n03 ~> estibi at n03 ~> svcs -xv svc:/system/cluster/scdpm:default (Sun Cluster Disk Path Monitoring Daemon) State: maintenance since Sun Jan 18 12:21:51 2009 Reason: Restarting too quickly. See: http://sun.com/msg/SMF-8000-L5 See: man -M /usr/cluster/man -s 1M scdpm See: /var/svc/log/system-cluster-scdpm:default.log Impact: This service is not running. svc:/system/cluster/scsymon-srv:default (Sun Cluster SyMON Server Daemon) State: offline since Sun Jan 18 12:16:33 2009 Reason: Dependency svc:/application/management/sunmcagent:default is absent. See: http://sun.com/msg/SMF-8000-E2 Impact: This service is not running. estibi at n03 ~> tail -30 /var/svc/log/system-cluster-scdpm:default.log [ Jan 18 12:21:45 Executing stop method (:kill). ] [ Jan 18 12:21:46 Executing start method ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] [ Jan 18 12:21:46 Method "start" exited with status 0. ] [ Jan 18 12:21:47 Stopping because all processes in service exited. ] [ Jan 18 12:21:47 Executing stop method (:kill). ] [ Jan 18 12:21:47 Executing start method ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] [ Jan 18 12:21:47 Method "start" exited with status 0. ] [ Jan 18 12:21:47 Stopping because all processes in service exited. ] [ Jan 18 12:21:48 Executing stop method (:kill). ] [ Jan 18 12:21:48 Executing start method ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] [ Jan 18 12:21:48 Method "start" exited with status 0. ] [ Jan 18 12:21:48 Stopping because all processes in service exited. ] [ Jan 18 12:21:48 Executing stop method (:kill). ] [ Jan 18 12:21:48 Executing start method ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] [ Jan 18 12:21:49 Method "start" exited with status 0. ] [ Jan 18 12:21:49 Stopping because all processes in service exited. ] [ Jan 18 12:21:49 Executing stop method (:kill). ] [ Jan 18 12:21:49 Executing start method ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] [ Jan 18 12:21:50 Method "start" exited with status 0. ] [ Jan 18 12:21:50 Stopping because all processes in service exited. ] [ Jan 18 12:21:50 Executing stop method (:kill). ] [ Jan 18 12:21:50 Executing start method ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] [ Jan 18 12:21:50 Method "start" exited with status 0. ] [ Jan 18 12:21:50 Stopping because all processes in service exited. ] [ Jan 18 12:21:50 Executing stop method (:kill). ] [ Jan 18 12:21:50 Executing start method ("/usr/cluster/lib/svc/method/svc_scdpm start"). ] [ Jan 18 12:21:51 Method "start" exited with status 0. ] [ Jan 18 12:21:51 Stopping because all processes in service exited. ] [ Jan 18 12:21:51 Executing stop method (:kill). ] [ Jan 18 12:21:51 Restarting too quickly, changing state to maintenance. ] estibi at n03 ~> . /lib/svc/share/smf_include.sh estibi at n03 ~> estibi at n03 ~> LIBSCDIR=/usr/cluster/lib/sc estibi at n03 ~> USRBIN=/usr/bin estibi at n03 ~> SERVER=scdpmd estibi at n03 ~> SCLIB=/usr/cluster/lib/sc estibi at n03 ~> estibi at n03 ~> /usr/sbin/clinfo ; echo $? 0 estibi at n03 ~> ${LIBSCDIR}/${SERVER} estibi at n03 ~> echo $? 0 estibi at n03 ~> estibi at n03 ~> truss -f ${LIBSCDIR}/${SERVER} 2> scdpmd.log estibi at n03 ~> -- Regards, Piotr Jasiukajtis | estibi | SCA OS0072 http://estseg.blogspot.com -------------- next part -------------- A non-text attachment was scrubbed... Name: scdpmd.log.bz2 Type: application/x-bzip Size: 3927 bytes Desc: not available URL: <http://mail.opensolaris.org/pipermail/ha-clusters-discuss/attachments/20090118/2fd5a5e3/attachment.bin>