> On 8. Jul 2024, at 13:38, Udo Grabowski (IMK) <udo.grabow...@kit.edu> wrote:
> 
> Hi,
> 
> we currently have a raid-z1 pool resilvering (two damaged devices
> in different vdevs), but a third disk in one of the degraded vdevs
> occasionally timeouts:
> 
> Jul  7 06:25:36 imksunth8 scsi: [ID 243001 kern.warning] WARNING: /scsi_vhci 
> (scsi_vhci0):
> Jul  7 06:25:36 imksunth8       /scsi_vhci/disk@g5000cca2441ed63c (sd184): 
> Command Timeout on path mpt_sas4/disk@w5000cca2441ed63d,0
> 
> The problem: These hiccups cause the resilvering to RESTART ! Which
> doesn't help to get the job quickly done, and just accelerates the
> wear on the already unhealthy third disk, and will finally spiral down
> to complete dataloss because of 2-disk-failure on a z1 vdev.
> 
> Is there a way to switchoff this behaviour via a kmdb parameter which
> can be set while operating (it's an older illumos-cf25223258 from 2016) ?
> -- 
> Dr.Udo Grabowski  Inst.of Meteorology & Climate Research IMK-ASF-SAT
> https://www.imk-asf.kit.edu/english/sat.php
> KIT - Karlsruhe Institute of Technology          https://www.kit.edu
> Postfach 3640,76021 Karlsruhe,Germany T:(+49)721 608-26026 F:-926026
> 


I would use live boot with more recent image and get resilver done (hopefully 
faster). 2016 is very old setup and you are basically missing all improvements 
done with resilver code….

rgds,
toomas
------------------------------------------
illumos: illumos-discuss
Permalink: 
https://illumos.topicbox.com/groups/discuss/T2a32a4cc427e4845-Mf14a0c65a7021d38fc1404fb
Delivery options: https://illumos.topicbox.com/groups/discuss/subscription

Reply via email to