Re: [zfs-discuss] Solaris 11 System Reboots Continuously Because of a ZFS-Related Panic (7191375)

2012-12-12 Thread Tomas Forsman
On 12 December, 2012 - Thomas Nau sent me these 7,3K bytes:

> Jamie
> We ran Into the same and had to migrate the pool while imported
> read-only. On top we were adviced to NOT use an L2ARC. Maybe you
> should consider that as well

We also ran into something similar, imported read-only and created a new
pool. A few months later, we ran into an L2ARC bug (15809921) to which
we've received an IDR that we have not applied yet.

This bug caused the following:
errors: Permanent errors have been detected in the following files:

:<0x132c1f>

on a 3x3 mirrored pool (triple-mirroring), all 9 disks had checksum
errors.

> Thomas
> 
> 
> Am 12.12.2012 um 19:21 schrieb Jamie Krier :
> 
> > I've hit this bug on four of my Solaris 11 servers. Looking for anyone else 
> > who has seen it, as well as comments/speculation on cause.  
> > 
> > This bug is pretty bad.  If you are lucky you can import the pool read-only 
> > and migrate it elsewhere.  
> > 
> > I've also tried setting zfs:zfs_recover=1,aok=1 with varying results.
> > 
> > 
> > 
> > http://docs.oracle.com/cd/E26502_01/html/E28978/gmkgj.html#scrolltoc
> > 
> > 
> > 
> > Hardware platform:
> > 
> > Supermicro X8DAH
> > 
> > 144GB ram
> > 
> > Supermicro sas2 jbods
> > 
> > LSI 9200-8e controllers (Phase 13 fw)
> > 
> > Zuesram log
> > 
> > ZuesIops sas l2arc
> > 
> > Seagate ST33000650SS sas drives
> > 
> > 
> > 
> > All four servers are running the same hardware, so at first I suspected a 
> > problem there.  I opened a ticket with Oracle which ended with this email:
> > 
> > -
> > 
> > We strongly expect that this is a software issue because this problem does 
> > not happen
> > 
> > on Solaris 10.   On Solaris 11, it happens with both the SPARC and the X64 
> > versions of
> > 
> > Solaris.
> > 
> > 
> > 
> > We have quite a few customer who have seen this issue and we are in the 
> > process of
> > 
> > working on a fix.  Because we do not know the source of the problem yet, I 
> > cannot speculate
> > 
> > on the time to fix.  This particular portion of Solaris 11 (the virtual 
> > memory sub-system) is quite
> > 
> > different than in Solaris 10.  We re-wrote the memory management in order 
> > to get ready for
> > 
> > systems with much more memory than Solaris 10 was designed to handle.
> > 
> > 
> > 
> > Because this is the memory management system, there is not expected to be 
> > any
> > 
> > work-around.
> > 
> > 
> > 
> > Depending on your company's requirements, one possibility is to use Solaris 
> > 10 until this
> > 
> > issue is resolved.
> > 
> > 
> > 
> > I apologize for any inconvenience that  this bug may cause.  We are working 
> > on it as a Sev 1 Priority1 in sustaining engineering.
> > 
> > -
> > 
> > 
> > 
> > I am thinking about switching to an Illumos distro, but wondering if this 
> > problem may be present there as well. 
> > 
> > 
> > 
> > Thanks
> > 
> > 
> > 
> > - Jamie
> > 
> > ___
> > zfs-discuss mailing list
> > zfs-discuss@opensolaris.org
> > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

> ___
> zfs-discuss mailing list
> zfs-discuss@opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss



/Tomas
-- 
Tomas Forsman, st...@acc.umu.se, http://www.acc.umu.se/~stric/
|- Student at Computing Science, University of Umeå
`- Sysadmin at {cs,acc}.umu.se
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] Solaris 11 System Reboots Continuously Because of a ZFS-Related Panic (7191375)

2012-12-12 Thread Thomas Nau
Jamie
We ran Into the same and had to migrate the pool while imported read-only. On 
top we were adviced to NOT use an L2ARC. Maybe you should consider that as well

Thomas


Am 12.12.2012 um 19:21 schrieb Jamie Krier :

> I've hit this bug on four of my Solaris 11 servers. Looking for anyone else 
> who has seen it, as well as comments/speculation on cause.  
> 
> This bug is pretty bad.  If you are lucky you can import the pool read-only 
> and migrate it elsewhere.  
> 
> I've also tried setting zfs:zfs_recover=1,aok=1 with varying results.
> 
> 
> 
> http://docs.oracle.com/cd/E26502_01/html/E28978/gmkgj.html#scrolltoc
> 
> 
> 
> Hardware platform:
> 
> Supermicro X8DAH
> 
> 144GB ram
> 
> Supermicro sas2 jbods
> 
> LSI 9200-8e controllers (Phase 13 fw)
> 
> Zuesram log
> 
> ZuesIops sas l2arc
> 
> Seagate ST33000650SS sas drives
> 
> 
> 
> All four servers are running the same hardware, so at first I suspected a 
> problem there.  I opened a ticket with Oracle which ended with this email:
> 
> -
> 
> We strongly expect that this is a software issue because this problem does 
> not happen
> 
> on Solaris 10.   On Solaris 11, it happens with both the SPARC and the X64 
> versions of
> 
> Solaris.
> 
> 
> 
> We have quite a few customer who have seen this issue and we are in the 
> process of
> 
> working on a fix.  Because we do not know the source of the problem yet, I 
> cannot speculate
> 
> on the time to fix.  This particular portion of Solaris 11 (the virtual 
> memory sub-system) is quite
> 
> different than in Solaris 10.  We re-wrote the memory management in order to 
> get ready for
> 
> systems with much more memory than Solaris 10 was designed to handle.
> 
> 
> 
> Because this is the memory management system, there is not expected to be any
> 
> work-around.
> 
> 
> 
> Depending on your company's requirements, one possibility is to use Solaris 
> 10 until this
> 
> issue is resolved.
> 
> 
> 
> I apologize for any inconvenience that  this bug may cause.  We are working 
> on it as a Sev 1 Priority1 in sustaining engineering.
> 
> -
> 
> 
> 
> I am thinking about switching to an Illumos distro, but wondering if this 
> problem may be present there as well. 
> 
> 
> 
> Thanks
> 
> 
> 
> - Jamie
> 
> ___
> zfs-discuss mailing list
> zfs-discuss@opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] ZFS array on marvell88sx in Solaris 11.1

2012-12-12 Thread Bob Friesenhahn

On Wed, 12 Dec 2012, sol wrote:


Thanks for the reply.
I've just tried openindiana and it behaves identically -
disks attached to the mv88sx6081 don't show up as disks.
(and APIC error interrupt (status0=0, status1=40) is emitted at boot.)

I've tried some changes to /etc/system with no success
(sata_func_enable=0x5, ahci_msi_enabled=0, sata_max_queue_depth=1)

Is there anything else I can try?


If the SATA card you are using is a JBOD-style card (i.e. disks are 
portable to a different controller), are you able/willing to swap it 
for one that Solaris is known to support well?


Bob
--
Bob Friesenhahn
bfrie...@simple.dallas.tx.us, http://www.simplesystems.org/users/bfriesen/
GraphicsMagick Maintainer,http://www.GraphicsMagick.org/
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


[zfs-discuss] Solaris 11 System Reboots Continuously Because of a ZFS-Related Panic (7191375)

2012-12-12 Thread Jamie Krier
I've hit this bug on four of my Solaris 11 servers. Looking for anyone else
who has seen it, as well as comments/speculation on cause.

This bug is pretty bad.  If you are lucky you can import the pool read-only
and migrate it elsewhere.

I've also tried setting zfs:zfs_recover=1,aok=1 with varying results.


http://docs.oracle.com/cd/E26502_01/html/E28978/gmkgj.html#scrolltoc


Hardware platform:

Supermicro X8DAH

144GB ram

Supermicro sas2 jbods

LSI 9200-8e controllers (Phase 13 fw)

Zuesram log

ZuesIops sas l2arc

Seagate ST33000650SS sas drives


All four servers are running the same hardware, so at first I suspected a
problem there.  I opened a ticket with Oracle which ended with this email:

-

We strongly expect that this is a software issue because this problem does
not happen

on Solaris 10.   On Solaris 11, it happens with both the SPARC and the X64
versions of

Solaris.


We have quite a few customer who have seen this issue and we are in the
process of

working on a fix.  Because we do not know the source of the problem yet, I
cannot speculate

on the time to fix.  This particular portion of Solaris 11 (the virtual
memory sub-system) is quite

different than in Solaris 10.  We re-wrote the memory management in order
to get ready for

systems with much more memory than Solaris 10 was designed to handle.


Because this is the memory management system, there is not expected to be
any

work-around.


Depending on your company's requirements, one possibility is to use Solaris
10 until this

issue is resolved.


I apologize for any inconvenience that  this bug may cause.  We are working
on it as a Sev 1 Priority1 in sustaining engineering.

-


I am thinking about switching to an Illumos distro, but wondering if this
problem may be present there as well.


Thanks


- Jamie
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] ZFS array on marvell88sx in Solaris 11.1

2012-12-12 Thread sol
Some more information about the system:

Solaris 11.1 with latest updates (assembled 19 Sep 2012), amd64
The card is vendor 0x11ab device 0x6081
Marvell Technology Group Ltd. MV88SX6081 8-port SATA II PCI-X Controller
 CardVendor 0x11ab card 0x11ab (Marvell Technology Group Ltd., Card unknown)

  STATUS    0x02b8  COMMAND 0x0007
  CLASS     0x01 0x00 0x00  REVISION 0x09
  BIST      0x00  HEADER 0x00  LATENCY 0x20  CACHE 0x10
  BASE0     0xfac0 SIZE 1048576  MEM64
  BASE2     0xc400 SIZE 256  I/O
  BASEROM   0x  addr 0x
  MAX_LAT   0x00  MIN_GNT 0x00  INT_PIN 0x01  INT_LINE 0x0b


I've forgotten where it hung when looking at verbose boot output
(although it hung at a couple of different points)
so I'll post that next time it hangs.


>___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] ZFS array on marvell88sx in Solaris 11.1

2012-12-12 Thread sol
Thanks for the reply.
I've just tried openindiana and it behaves identically -
disks attached to the mv88sx6081 don't show up as disks.
(and APIC error interrupt (status0=0, status1=40) is emitted at boot.)

I've tried some changes to /etc/system with no success
(sata_func_enable=0x5, ahci_msi_enabled=0, sata_max_queue_depth=1)

Is there anything else I can try?


>
> From: Bob Friesenhahn 
>To: sol  
>Cc: "zfs-discuss@opensolaris.org"  
>Sent: Wednesday, 12 December 2012, 14:49
>Subject: Re: [zfs-discuss] ZFS array on marvell88sx in Solaris 11.1
> 
>On Wed, 12 Dec 2012, sol wrote:
>
>> Hello
>> 
>> I've got a ZFS box running perfectly with an 8-port SATA card
>> using the marvell88sx driver in opensolaris-2009.
>> 
>> However when I try to run Solaris-11 it won't boot.
>> If I unplug some of the hard disks it might boot
>> but then none of them show up in 'format'
>> and none of them have configured status in 'cfgadm'
>> (and there's an error or hang if I try to configure them).
>> 
>> Does anyone have any suggestions how to solve the problem?
>
>Since you were previously using opensolaris-2009, have you considered trying 
>OpenIndiana oi_151a7 instead?  You could experiment by booting from the live 
>CD and seeing if your disks show up.
>
>___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


Re: [zfs-discuss] ZFS array on marvell88sx in Solaris 11.1

2012-12-12 Thread Bob Friesenhahn

On Wed, 12 Dec 2012, sol wrote:


Hello

I've got a ZFS box running perfectly with an 8-port SATA card
using the marvell88sx driver in opensolaris-2009.

However when I try to run Solaris-11 it won't boot.
If I unplug some of the hard disks it might boot
but then none of them show up in 'format'
and none of them have configured status in 'cfgadm'
(and there's an error or hang if I try to configure them).

Does anyone have any suggestions how to solve the problem?


Since you were previously using opensolaris-2009, have you considered 
trying OpenIndiana oi_151a7 instead?  You could experiment by booting 
from the live CD and seeing if your disks show up.


Bob
--
Bob Friesenhahn
bfrie...@simple.dallas.tx.us, http://www.simplesystems.org/users/bfriesen/
GraphicsMagick Maintainer,http://www.GraphicsMagick.org/
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss


[zfs-discuss] ZFS array on marvell88sx in Solaris 11.1

2012-12-12 Thread sol
Hello

I've got a ZFS box running perfectly with an 8-port SATA card
using the marvell88sx driver in opensolaris-2009.

However when I try to run Solaris-11 it won't boot.
If I unplug some of the hard disks it might boot
but then none of them show up in 'format'
and none of them have configured status in 'cfgadm'
(and there's an error or hang if I try to configure them).

Does anyone have any suggestions how to solve the problem?

Thanks!
___
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss