Looks like this bug;
*libpiclsnmp:snmp_init()* Blocks Indefinitely in *open()* on
primary Domain
*Bug ID 6736962:* Power Management sometimes fails to retrieve policy
from the service processor on LDoms startup after the control domain
boots. If CPU power management could not retrieve the power management
policy from the service processor, it allows LDoms to start up as
expected, but logs the error Unable to get the initial PM Policy -
timeout to the LDoms log and remains in performance mode.
Add forceload: drv/ds_snmp to /etc/system, then reboot the control domain.
There was a ton of messages in the the ldmd log about pmi timeouts.
While the customer tried the add forceload it was still having issues.
Verified that the ds_snmp was loaded via modinfo but still having issues.
They made a couple of changes via the BUI on the SP and magically it
started to work and we are not sure why.
One was to make sure the DNS entry was set correctly on the SP as it was
never reset after a static address was used for the SP.
The other was a change to the syslog ip which was blank and they set it
to 0.0.0.0, both of these settings were copied from another
5440 that was working ok.
Not sure if there was any other changes.
Regards
Gary
On 4/13/2010 8:28 AM, Gary Andresen wrote:
Thanks.
So 4 Gig is still recommended for zfs then. I thought 2Gig was
adequate now days.
I give it a tweak.
Regards
Gary
On 4/12/2010 9:59 PM, Octave Orgeron wrote:
Hi,
If you are using ZFS as the file system in the control domain, I
would allocate atleast 4GB of memory to allow enough space for the
ZFS ARC. There is also a tunable to control the size of the ARC, take
a look at the ZFS guide on solarisinternals.com. Another possible
area is the network settings in the link aggregation on the server or
switch side.
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
Octave J. Orgeron
Solaris Virtualization Architect and Consultant
Web: http://unixconsole.blogspot.com
E-Mail: [email protected]
*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*-*
----- Original Message ----
From: Gary Andresen<[email protected]>
To: [email protected]
Sent: Mon, April 12, 2010 9:32:41 PM
Subject: [ldoms-discuss] ldm list -l primary takes ~30 secs to return
Still trying to track down whats happening at a customer site, but
right now on a T5440 box with 2 CPUs 32 gig of memory and setting the
control domain
to 1 core (8 threads) and 2g of memory, 1 MAU with Solaris 10 U8 with
latest patch cluster,Latest Firmware 139446-10, boot disk is a zfs
pool mirrored.
Network is a aggregate of nxge0 and 4 I believe and vsw0 was created by
ldm add-vsw net-dev=arrg1 primary-vsw0 primary
unplumbed aggr1
plumbed vsw0 in it's place.
All seemed to be going well but;
Running 'ldm list -l primary' takes up to 20-30 secs to printout
data. No errors that they can see (dmesg, /var/adm/messages).
Missing patch? Ideas?
Gary
_______________________________________________
ldoms-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/ldoms-discuss
_______________________________________________
ldoms-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/ldoms-discuss
_______________________________________________
ldoms-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/ldoms-discuss