Hi,

I know it's not tested...

My plan was to install SCX on single machine in xVM.
My dom0 is SXCE105.
I installed 3 pvm domU:
- n01 - sxce101
- n02 - sxce101a
- n03 - sxce101a

The 'sponsoring' node is n03.

But there is a problem because n02 cannot communicate with n03.

I don?t have an idea, just take a look:


It?s on n02:

  >>> Cluster Name <<<

    Each cluster has a name assigned to it. When adding a node to the
    cluster, you must identify the name of the cluster you are attempting
    to join. A sanity check is performed to verify that the "sponsoring"
    node is a member of that cluster.

    What is the name of the cluster you want to join?  vencl

    Attempting to contact "n03" ... timed out

Unable to contact "n03" at this time.

    Do you want to try again (yes/no) [yes]?

    Attempting to contact "n03" ... timed out

Unable to contact "n03" at this time.

    Do you want to try again (yes/no) [yes]?

    Attempting to contact "n03" ... timed out

Unable to contact "n03" at this time.

    Do you want to try again (yes/no) [yes]?






estibi at n03 ~> pfexec snoop -v -d xnf0 'host n02'
Using device xnf0 (promiscuous mode)
ETHER:  ----- Ether Header -----
ETHER:
ETHER:  Packet 1 arrived at 15:07:6.61877
ETHER:  Packet size = 98 bytes
ETHER:  Destination = 0:16:3e:9:6e:b2,
ETHER:  Source      = 0:16:3e:42:20:11,
ETHER:  Ethertype = 0800 (IP)
ETHER:
IP:   ----- IP Header -----
IP:
IP:   Version = 4
IP:   Header length = 20 bytes
IP:   Type of service = 0x00
IP:         xxx. .... = 0 (precedence)
IP:         ...0 .... = normal delay
IP:         .... 0... = normal throughput
IP:         .... .0.. = normal reliability
IP:         .... ..0. = not ECN capable transport
IP:         .... ...0 = no ECN congestion experienced
IP:   Total length = 84 bytes
IP:   Identification = 27545
IP:   Flags = 0x4
IP:         .1.. .... = do not fragment
IP:         ..0. .... = last fragment
IP:   Fragment offset = 0 bytes
IP:   Time to live = 255 seconds/hops
IP:   Protocol = 17 (UDP)
IP:   Header checksum = 8cdd
IP:   Source address = 192.168.0.232, n02.local
IP:   Destination address = 192.168.0.233, n03.local
IP:   No options
IP:
UDP:  ----- UDP Header -----
UDP:
UDP:  Source port = 47601
UDP:  Destination port = 111 (Sun RPC)
UDP:  Length = 64
UDP:  Checksum = 0322
UDP:
RPC:  ----- SUN RPC Header -----
RPC:
RPC:  Transaction id = 1232758637
RPC:  Type = 0 (Call)
RPC:  RPC version = 2
RPC:  Program = 100000 (PMAP), version = 2, procedure = 3
RPC:  Credentials: Flavor = 0 (None), len = 0 bytes
RPC:  Verifier   : Flavor = 0 (None), len = 0 bytes
RPC:
PMAP:  ----- Portmapper -----
PMAP:
PMAP:  Proc = 3 (Get port number)
PMAP:  Program = 100145 (?)
PMAP:  Version = 1
PMAP:  Protocol = 6 (TCP)
PMAP:

ETHER:  ----- Ether Header -----
ETHER:
ETHER:  Packet 2 arrived at 15:07:6.61913
ETHER:  Packet size = 62 bytes
ETHER:  Destination = 0:16:3e:42:20:11,
ETHER:  Source      = 0:16:3e:9:6e:b2,
ETHER:  Ethertype = 0800 (IP)
ETHER:
IP:   ----- IP Header -----
IP:
IP:   Version = 4
IP:   Header length = 20 bytes
IP:   Type of service = 0x00
IP:         xxx. .... = 0 (precedence)
IP:         ...0 .... = normal delay
IP:         .... 0... = normal throughput
IP:         .... .0.. = normal reliability
IP:         .... ..0. = not ECN capable transport
IP:         .... ...0 = no ECN congestion experienced
IP:   Total length = 48 bytes
IP:   Identification = 46953
IP:   Flags = 0x4
IP:         .1.. .... = do not fragment
IP:         ..0. .... = last fragment
IP:   Fragment offset = 0 bytes
IP:   Time to live = 255 seconds/hops
IP:   Protocol = 17 (UDP)
IP:   Header checksum = 4131
IP:   Source address = 192.168.0.233, n03.local
IP:   Destination address = 192.168.0.232, n02.local
IP:   No options
IP:
UDP:  ----- UDP Header -----
UDP:
UDP:  Source port = 111
UDP:  Destination port = 47601 (Sun RPC)
UDP:  Length = 28
UDP:  Checksum = 0000 (no checksum)
UDP:
RPC:  ----- SUN RPC Header -----
RPC:
RPC:  Transaction id = 1232758637
RPC:  Type = 1 (Reply)
RPC:  This is a reply to frame 1
RPC:  Status = 1 (Denied)
RPC:  Reject status = 1 (can't authenticate)
RPC:     Why = 7 (unknown reason)








estibi at n03 ~> netstat -af inet

UDP: IPv4
   Local Address        Remote Address      State
-------------------- -------------------- ----------
      *.*                                 Unbound
      *.*                                 Unbound
      *.*                                 Unbound
      *.sunrpc                            Idle
      *.*                                 Unbound
      *.40547                             Idle
      *.sunrpc                            Idle
      *.*                                 Unbound
      *.42496                             Idle
      *.51891                             Idle
      *.50997                             Idle
      *.mdns                              Idle
      *.ntp                               Idle
localhost.ntp                             Idle
n03.local.ntp                             Idle
clusternode1-priv.ntp                      Idle
      *.11161                             Idle

TCP: IPv4
   Local Address        Remote Address    Swind Send-Q Rwind Recv-Q    State
-------------------- -------------------- ----- ------ ----- ------
-----------
      *.*                  *.*                0      0 49152      0 IDLE
localhost.5999             *.*                0      0 49152      0 LISTEN
      *.scqsd              *.*                0      0 49152      0 LISTEN
      *.scqsd              *.*                0      0 49152      0 LISTEN
      *.*                  *.*                0      0 49152      0 IDLE
      *.sunrpc             *.*                0      0 49152      0 LISTEN
      *.*                  *.*                0      0 49152      0 IDLE
      *.sunrpc             *.*                0      0 49152      0 LISTEN
      *.*                  *.*                0      0 49152      0 IDLE
      *.ssh                *.*                0      0 49152      0 LISTEN
localhost.smtp             *.*                0      0 49152      0 LISTEN
localhost.submission       *.*                0      0 49152      0 LISTEN
localhost.5987             *.*                0      0 49152      0 LISTEN
localhost.898              *.*                0      0 49152      0 LISTEN
localhost.61180            *.*                0      0 49152      0 LISTEN
localhost.5988             *.*                0      0 49152      0 LISTEN
localhost.56361            *.*                0      0 49152      0 LISTEN
      *.64437              *.*                0      0 49152      0 LISTEN
      *.62590              *.*                0      0 49152      0 LISTEN
      *.56842              *.*                0      0 49152      0 LISTEN
      *.54907              *.*                0      0 49152      0 LISTEN
      *.sccheckd           *.*                0      0 49152      0 LISTEN
      *.44383              *.*                0      0 49152      0 LISTEN
      *.51586              *.*                0      0 49152      0 LISTEN
      *.45459              *.*                0      0 49152      0 BOUND
localhost.6788             *.*                0      0 49152      0 LISTEN
localhost.6789             *.*                0      0 49152      0 LISTEN
localhost.45970            *.*                0      0 49152      0 LISTEN
n03.local.ssh        192.168.0.9.37828    49640     47 49640      0
ESTABLISHED
localhost.6010             *.*                0      0 49152      0 LISTEN
      *.40373              *.*                0      0 49152      0 LISTEN
      *.35696              *.*                0      0 49152      0 LISTEN
      *.pnmd               *.*                0      0 49152      0 LISTEN
      *.35135              *.*                0      0 49152      0 BOUND
      *.54599              *.*                0      0 49152      0 LISTEN
      *.11165              *.*                0      0 49152      0 LISTEN
      *.48888              *.*                0      0 49152      0 LISTEN
      *.50908              *.*                0      0 49152      0 LISTEN
      *.*                  *.*                0      0 49152      0 IDLE
      *.11164              *.*                0      0 49152      0 LISTEN
      *.11163              *.*                0      0 49152      0 LISTEN
      *.*                  *.*                0      0 49152      0 IDLE
      *.11162              *.*                0      0 49152      0 LISTEN

SCTP:
        Local Address                   Remote Address          Swind
Send-Q Rwind  Recv-Q StrsI/O  State
------------------------------- ------------------------------- ------
------ ------ ------ ------- -----------
0.0.0.0                         0.0.0.0                              0
    0 102400      0  32/32  CLOSED
estibi at n03 ~>




estibi at n03 ~> svcs -xv
svc:/system/cluster/scdpm:default (Sun Cluster Disk Path Monitoring Daemon)
 State: maintenance since Sun Jan 18 12:21:51 2009
Reason: Restarting too quickly.
   See: http://sun.com/msg/SMF-8000-L5
   See: man -M /usr/cluster/man -s 1M scdpm
   See: /var/svc/log/system-cluster-scdpm:default.log
Impact: This service is not running.

svc:/system/cluster/scsymon-srv:default (Sun Cluster SyMON Server Daemon)
 State: offline since Sun Jan 18 12:16:33 2009
Reason: Dependency svc:/application/management/sunmcagent:default is absent.
   See: http://sun.com/msg/SMF-8000-E2
Impact: This service is not running.


estibi at n03 ~> tail -30 /var/svc/log/system-cluster-scdpm:default.log
[ Jan 18 12:21:45 Executing stop method (:kill). ]
[ Jan 18 12:21:46 Executing start method
("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
[ Jan 18 12:21:46 Method "start" exited with status 0. ]
[ Jan 18 12:21:47 Stopping because all processes in service exited. ]
[ Jan 18 12:21:47 Executing stop method (:kill). ]
[ Jan 18 12:21:47 Executing start method
("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
[ Jan 18 12:21:47 Method "start" exited with status 0. ]
[ Jan 18 12:21:47 Stopping because all processes in service exited. ]
[ Jan 18 12:21:48 Executing stop method (:kill). ]
[ Jan 18 12:21:48 Executing start method
("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
[ Jan 18 12:21:48 Method "start" exited with status 0. ]
[ Jan 18 12:21:48 Stopping because all processes in service exited. ]
[ Jan 18 12:21:48 Executing stop method (:kill). ]
[ Jan 18 12:21:48 Executing start method
("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
[ Jan 18 12:21:49 Method "start" exited with status 0. ]
[ Jan 18 12:21:49 Stopping because all processes in service exited. ]
[ Jan 18 12:21:49 Executing stop method (:kill). ]
[ Jan 18 12:21:49 Executing start method
("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
[ Jan 18 12:21:50 Method "start" exited with status 0. ]
[ Jan 18 12:21:50 Stopping because all processes in service exited. ]
[ Jan 18 12:21:50 Executing stop method (:kill). ]
[ Jan 18 12:21:50 Executing start method
("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
[ Jan 18 12:21:50 Method "start" exited with status 0. ]
[ Jan 18 12:21:50 Stopping because all processes in service exited. ]
[ Jan 18 12:21:50 Executing stop method (:kill). ]
[ Jan 18 12:21:50 Executing start method
("/usr/cluster/lib/svc/method/svc_scdpm start"). ]
[ Jan 18 12:21:51 Method "start" exited with status 0. ]
[ Jan 18 12:21:51 Stopping because all processes in service exited. ]
[ Jan 18 12:21:51 Executing stop method (:kill). ]
[ Jan 18 12:21:51 Restarting too quickly, changing state to maintenance. ]


estibi at n03 ~> . /lib/svc/share/smf_include.sh
estibi at n03 ~>
estibi at n03 ~> LIBSCDIR=/usr/cluster/lib/sc
estibi at n03 ~> USRBIN=/usr/bin
estibi at n03 ~> SERVER=scdpmd
estibi at n03 ~> SCLIB=/usr/cluster/lib/sc
estibi at n03 ~>
estibi at n03 ~> /usr/sbin/clinfo ; echo $?
0
estibi at n03 ~> ${LIBSCDIR}/${SERVER}
estibi at n03 ~> echo $?
0
estibi at n03 ~>



estibi at n03 ~> truss -f ${LIBSCDIR}/${SERVER} 2> scdpmd.log
estibi at n03 ~>



-- 
Regards,
Piotr Jasiukajtis | estibi | SCA OS0072
http://estseg.blogspot.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: scdpmd.log.bz2
Type: application/x-bzip
Size: 3927 bytes
Desc: not available
URL: 
<http://mail.opensolaris.org/pipermail/ha-clusters-discuss/attachments/20090118/2fd5a5e3/attachment.bin>

Reply via email to