Greetings,

I'm having problems with nevada on Xvm.  Looking back, I've had the problem all 
along. But I didn't really dig into it until now.  Specifically, I got very 
interested in liveupgrade once the zfs root became available in snv_90.  
However, looking back, I believe I've had the problem since the snv_79a/SXDE 
01/08 release.

I had hacked together a number of workarounds, thinking I had bitten off a new 
Motherboard/CPU combo that wasn't quite stable.  But, I've convinced myself 
that is not the problem, and the Mobo has been fully patched.  I only dug into 
things as the problem I'll describe prevents luupgrade from completing.

Problem manifests itself under a variety of circumstances, and at first I 
didn't associate it with Xen.  The problem is that a system under moderate load 
will no longer fork(), and will not send any errors to syslog or stderr.

I've a number of processes that attest to this:
    'ntpq -p xenhost' from another system shows reasonable jitter
    ssh to the box produces a "password:" prompt (PAM interactive login), but 
no shell, 
    a console login accepts userid, but never prompts for password.  
All indicitave of a system that is running but cannot fork().

System in question is an ASUS M3A78-EMH + AMD Phenom 9600 + 8 GBytes RAM.

After flailing about a bit, I backed off, installed snv_91, as there are some 
release notes which I thought might apply, and reset the BIOS to factory 
defaults, and began the one change per test debug cycle on the snv_91 install.

The test operation was luupgrade (options using a local .iso image, and a zfs 
clone BE)

The only common factor associated with the failure was running Solaris as Dom0. 
Solaris on bare metal does not exhibit the problem (though I can't say for 
certain for snv_79a, snv_86, snv_90, as I wasn't quite so rigorous in testing).

luupgrade completes (as do a number of other large compiles, etc) only when 
Solaris runs on the bare metal, regardless of SVM, ECC, ACPI, or other settings.

I can't find a report of the same problem, though this thread may be close:
http://www.opensolaris.org/jive/search.jspa?q=fork%20hangs&objID=f53&dateRange=thisyear&searchID=1108736&forumID=53&rankBy=10001&threadID=58022

I don't grok the xen code at all, so I'm not going to go diving for the fix, 
but if this rings a bell, and someone wants me to try something, I'll be happy 
to.  The system is my test system after all.

Cheers!
-sam
 
 
This message posted from opensolaris.org
_______________________________________________
xen-discuss mailing list
[email protected]

Reply via email to