Re: [smartos-discuss] panic: 'zones' appears to be hung

2017-04-06 Thread Michael Loftis
Following up on this after Dan flagged me in #SmartOS - update your
controller firmware to the latest version.  There's a bugfix for
controller hangs on cache flush for at least H330, H730, H840, FD33xS
and FD33xD - I don't have anything with the 700 series but in the
x30/x40 the change entry that appears to cause this issue is in
25.5.0.0019 "Corrected an issue where a specific cache flush condition
could cause the controller to hang."
http://www.dell.com/support/home/us/en/19/Drivers/DriversDetails?driverId=3X0XK

On Fri, Oct 14, 2016 at 2:06 PM, Alessio Ciregia  wrote:
> I've performed a scrub on a server.
> The server automatically reboot after some time.
> 
> Dmesg reports this message:
> 
> 2016-10-14T20:28:51.231384+00:00 iperione savecore: [ID 570001
> auth.error] reboot after panic: I/O to pool 'zones' appears to be hung.
> 
> After the reboot the scrub continues, but after half an hour the server
> reboot again. And again until I don't stop the scrub process.
> 
> What about that? What can I do?
> 
> Some server characteristics are:
>   controller DELL-PERC H700
>   RAM 64GB
>   four 500GB disks configured as raidz1
> 
> SmartOS version is joyent_20161013T025521Z, but the same issue is
> present even booting the previous version.
> 
> Thanks,
>   
> 



-- 

"Genius might be described as a supreme capacity for getting its possessors
into trouble of all kinds."
-- Samuel Butler


---
smartos-discuss
Archives: https://www.listbox.com/member/archive/184463/=now
RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=25769125_secret=25769125-7688e9fb
Powered by Listbox: http://www.listbox.com


Re: [smartos-discuss] panic: 'zones' appears to be hung

2016-10-14 Thread Ian Collins

On 10/15/16 10:06 AM, Alessio Ciregia wrote:

I've performed a scrub on a server.
The server automatically reboot after some time.

Dmesg reports this message:

2016-10-14T20:28:51.231384+00:00 iperione savecore: [ID 570001
auth.error] reboot after panic: I/O to pool 'zones' appears to be hung.


Are there any messages relating to disk timeouts and/or command retries 
in the logs before the panic?



After the reboot the scrub continues, but after half an hour the server
reboot again. And again until I don't stop the scrub process.

What about that? What can I do?

Some server characteristics are:
- controller DELL-PERC H700
- RAM 64GB
- four 500GB disks configured as raidz1

SmartOS version is joyent_20161013T025521Z, but the same issue is
present even booting the previous version.


How is the H700 configured, RAID or HBA mode?

Are the disks SSD or rust?

Reports like these pop up from time to time and most cases I have seen 
have been caused by failing or otherwise dodgy drives or controllers.


--
Ian.



---
smartos-discuss
Archives: https://www.listbox.com/member/archive/184463/=now
RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=25769125_secret=25769125-7688e9fb
Powered by Listbox: http://www.listbox.com


[smartos-discuss] panic: 'zones' appears to be hung

2016-10-14 Thread Alessio Ciregia
I've performed a scrub on a server.
The server automatically reboot after some time.

Dmesg reports this message:

2016-10-14T20:28:51.231384+00:00 iperione savecore: [ID 570001
auth.error] reboot after panic: I/O to pool 'zones' appears to be hung.

After the reboot the scrub continues, but after half an hour the server
reboot again. And again until I don't stop the scrub process.

What about that? What can I do?

Some server characteristics are:
- controller DELL-PERC H700
- RAM 64GB
- four 500GB disks configured as raidz1

SmartOS version is joyent_20161013T025521Z, but the same issue is
present even booting the previous version.


Thanks,
A.


---
smartos-discuss
Archives: https://www.listbox.com/member/archive/184463/=now
RSS Feed: https://www.listbox.com/member/archive/rss/184463/25769125-55cfbc00
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=25769125_secret=25769125-7688e9fb
Powered by Listbox: http://www.listbox.com