I've noted that PVE 4.3 added smartmontools to prerequisite, instali it
and print health disk info on interface. I've also found:

        https://pve.proxmox.com/wiki/Disk_Health_Monitoring

Good!


But i'm some servers with hardware raid controllers, eg some HP servers
with SmartArray controllers.

with the default configuration:

 Oct 21 17:07:40 ashpool smartd[8593]: smartd 6.6 2016-05-31 r4324 
[x86_64-linux-4.4.13-2-pve] (local build)
 Oct 21 17:07:40 ashpool smartd[8593]: Copyright (C) 2002-16, Bruce Allen, 
Christian Franke, www.smartmontools.org
 Oct 21 17:07:40 ashpool smartd[8593]: Opened configuration file 
/etc/smartd.conf
 Oct 21 17:07:40 ashpool smartd[8593]: Drive: DEVICESCAN, implied '-a' 
Directive on line 21 of file /etc/smartd.conf
 Oct 21 17:07:40 ashpool smartd[8593]: Configuration file /etc/smartd.conf was 
parsed, found DEVICESCAN, scanning devices
 Oct 21 17:07:40 ashpool smartd[8593]: Device: /dev/sda, opened
 Oct 21 17:07:40 ashpool smartd[8593]: Device: /dev/sda, [HP       LOGICAL 
VOLUME   3.56], lu id: 0x600508b1001c4972bd1d4906bb1dc75d, S/N: PDNNK0BRH9Z27F, 
500 GB
 Oct 21 17:07:40 ashpool smartd[8593]: Device: /dev/sda, does not support SMART 
Self-Test Log.
 Oct 21 17:07:40 ashpool smartd[8593]: Device: /dev/sda, is SMART capable. 
Adding to "monitor" list.
 [... then smartd scans all the iSCSI volumes ...]
 Oct 21 17:07:40 ashpool smartd[8593]: Monitoring 0 ATA/SATA, 1 SCSI/SAS and 0 
NVMe devices
 Oct 21 17:07:40 ashpool smartd[8593]: Device: /dev/sda, failed to read 
Temperature
 Oct 21 17:07:40 ashpool smartd[8593]: Device: /dev/sda, state written to 
/var/lib/smartmontools/smartd.HP-LOGICAL_VOLUME-PDNNK0BRH9Z27F.scsi.state

Note the 'Device: /dev/sda, failed to read Temperature', that obviously repeat
everytime a smart inquiry get done.


I've tried to do direct hardware scan, eg commenting DEVICESCAN
in smartd.conf and adding:

        /dev/sda -d cciss,0 -a
        /dev/sda -d cciss,1 -a

and now i've access to full SMART data:

 Oct 21 18:14:19 ashpool smartd[7072]: Signal HUP - rereading configuration 
file /etc/smartd.conf
 Oct 21 18:14:19 ashpool smartd[7072]: Opened configuration file 
/etc/smartd.conf
 Oct 21 18:14:19 ashpool smartd[7072]: Configuration file /etc/smartd.conf 
parsed.
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda, type changed from 
'sat,auto+cciss' to 'sat'
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_00] [SAT], 
opened
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_00] [SAT], 
MM0500GBKAK, S/N:9XF3Y3EN, WWN:5-000c50-08782f381, FW:HPGC, 500 GB
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_00] [SAT], 
not found in smartd database.
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_00] [SAT], 
not capable of SMART Health Status check
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_00] [SAT], 
is SMART capable. Adding to "monitor" list.
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda, type changed from 
'sat,auto+cciss' to 'sat'
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_01] [SAT], 
opened
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_01] [SAT], 
MM0500GBKAK, S/N:9XF3XZ0Z, WWN:5-000c50-08784283f, FW:HPGC, 500 GB
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_01] [SAT], 
not found in smartd database.
 Oct 21 18:14:19 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_01] [SAT], 
not capable of SMART Health Status check
 Oct 21 18:14:20 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_01] [SAT], 
is SMART capable. Adding to "monitor" list.
 Oct 21 18:14:20 ashpool smartd[7072]: Monitoring 2 ATA/SATA, 0 SCSI/SAS and 0 
NVMe devices
 Oct 21 18:14:20 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_00] [SAT], 
state written to /var/lib/smartmontools/smartd.MM0500GBKAK-9XF3Y3EN.ata.state
 Oct 21 18:14:20 ashpool smartd[7072]: Device: /dev/sda [cciss_disk_01] [SAT], 
state written to /var/lib/smartmontools/smartd.MM0500GBKAK-9XF3XZ0Z.ata.state

But i'm asking if this can ''confuse'' proxmox a bit (because they see
a disk and two device to monitor...), and was probably better instead to
simply ignore temperature reading.


Thanks.

-- 
dott. Marco Gaiarin                                     GNUPG Key ID: 240A3D66
  Associazione ``La Nostra Famiglia''          http://www.lanostrafamiglia.it/
  Polo FVG   -   Via della Bontà, 7 - 33078   -   San Vito al Tagliamento (PN)
  marco.gaiarin(at)lanostrafamiglia.it   t +39-0434-842711   f +39-0434-842797

                Dona il 5 PER MILLE a LA NOSTRA FAMIGLIA!
    http://www.lanostrafamiglia.it/25/index.php/component/k2/item/123
        (cf 00307430132, categoria ONLUS oppure RICERCA SANITARIA)
_______________________________________________
pve-user mailing list
[email protected]
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user

Reply via email to