Environment:
NETAPP SAN storage
Hyper-V Cluster
OS: Cloud Linux 6.3
Im in the process of moving individual VM's off to a separate Hyper-V
cluster as we are having some stability issues.
The VM connects fine to iscsi, but when I've copied the VHD over to the new
cluster and power it on , I cant reconnect to the iscsi targets anymore.
Even when attempting to connect to the a new lun with a different initiator
name the issue persists
Ive attempted to
stop iscsi
log out of the session
delete the session
and to remove everything under /var/lib/iscisi
When attempting to reconnect, I can see the session and even the /dev/sdc
and /dev/sdd drives that the session provisions
iscsiadm --mode session
tcp: [1] x.x.x.x:3260,7 iqn.1992-08.com.netapp:xxxxxxx
root@lnxwebr02 [~]# iscsiadm --mode session -P 3
..........
..........
************************
Attached SCSI devices:
************************
Host Number: 4 State: running
scsi4 Channel 00 Id 0 Lun: 0
Attached scsi disk sdc State: running
scsi4 Channel 00 Id 0 Lun: 1
Attached scsi disk sdd State: running
However the logs shows buffer io errors
Dec 3 09:23:21 lnxwebr02 kernel: [ 1094.018336] Buffer I/O error on device
sdc, logical block 0
Dec 3 09:24:18 lnxwebr02 kernel: [ 1151.062556] Buffer I/O error on device
sdd, logical block 0
and connection errors
Dec 3 09:12:05 lnxwebr02 kernel: [ 418.382452] scsi4 : iSCSI Initiator
over TCP/IP
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.698600] scsi 4:0:0:0:
Direct-Access NETAPP LUN 8020 PQ: 0 ANSI: 5
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.700035] sd 4:0:0:0: Attached scsi
generic sg2 type 0
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.706815] scsi 4:0:0:1:
Direct-Access NETAPP LUN 8020 PQ: 0 ANSI: 5
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.706842] sd 4:0:0:0: [sdc]
1572990976 512-byte logical blocks: (805 GB/750 GiB)
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.708974] sd 4:0:0:1: Attached scsi
generic sg3 type 0
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.712961] sd 4:0:0:1: [sdd]
419430400 512-byte logical blocks: (214 GB/200 GiB)
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.713334] sd 4:0:0:0: [sdc] Write
Protect is off
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.714268] sd 4:0:0:0: [sdc] Write
cache: disabled, read cache: enabled, doesn't support DPO or FUA
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.715730] sd 4:0:0:1: [sdd] Write
Protect is off
Dec 3 09:12:06 lnxwebr02 kernel: [ 418.716605] sd 4:0:0:1: [sdd] Write
cache: disabled, read cache: enabled, doesn't support DPO or FUA
Dec 3 09:12:06 lnxwebr02 iscsid: Connection1:0 to [target:
iqn.1992-08.com.netapp:wnlsfas3240b, portal: 10.11.52.12,3260] through
[iface: default] is operational now
Dec 3 09:12:29 lnxwebr02 PAM-hulk[2881]: failed to connect stream socket
Dec 3 09:12:52 lnxwebr02 kernel: [ 418.719682] sdc:
Dec 3 09:12:52 lnxwebr02 kernel: [ 464.704180] connection1:0: detected
conn error (1021)
Dec 3 09:12:52 lnxwebr02 iscsid: Kernel reported iSCSI connection 1:0
error (1021 - ISCSI_ERR_SCSI_EH_SESSION_RST: Session was dropped as a
result of SCSI error recovery) state (3)
Dec 3 09:13:00 lnxwebr02 iscsid: connection1:0 is operational after
recovery (1 attempts)
Dec 3 09:13:45 lnxwebr02 kernel: [ 517.704102] connection1:0: detected
conn error (1021)
Dec 3 09:13:45 lnxwebr02 iscsid: Kernel reported iSCSI connection 1:0
error (1021 - ISCSI_ERR_SCSI_EH_SESSION_RST: Session was dropped as a
result of SCSI error recovery) state (3)
Dec 3 09:13:51 lnxwebr02 iscsid: connection1:0 is operational after
recovery (1 attempts)
I'm also seeing the following kernel message
Dec 3 09:14:36 lnxwebr02 kernel: [ 568.704067] connection1:0: detected
conn error (1021)
Dec 3 09:14:36 lnxwebr02 iscsid: Kernel reported iSCSI connection 1:0
error (1021 - ISCSI_ERR_SCSI_EH_SESSION_RST: Session was dropped as a
result of SCSI error recovery) state (3)
Dec 3 09:14:42 lnxwebr02 iscsid: connection1:0 is operational after
recovery (1 attempts)
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.729107] INFO: task async/0:2830
blocked for more than 120 seconds.
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.733430] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737702] async/0 D
ffff88020556c440 0 2830 2 0 0x00000080
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737718] ffff880205595950
0000000000000046 0000000000000000 ffff8802055959b8
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737729] ffff8802023eb938
ffff8802023eb848 ffff880205590f28 ffff880205590f28
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737738] ffff880028036fe8
ffff88020556c9f8 ffff880205595fd8 ffff880205595fd8
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737748] Call Trace:
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737761] [<ffffffff811231b0>] ?
sync_page+0x0/0x50
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737772] [<ffffffff814e9fe3>]
io_schedule+0x73/0xc0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737780] [<ffffffff811231ed>]
sync_page+0x3d/0x50
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737786] [<ffffffff814ea84a>]
__wait_on_bit_lock+0x5a/0xc0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737796] [<ffffffff811cd2c0>] ?
blkdev_get_block+0x0/0x70
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737802] [<ffffffff81123187>]
__lock_page+0x67/0x70
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737808] [<ffffffff81095ac0>] ?
wake_bit_function+0x0/0x50
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737815] [<ffffffff8112465a>]
do_read_cache_page+0xfa/0x1e0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737821] [<ffffffff811ce270>] ?
blkdev_readpage+0x0/0x20
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737827] [<ffffffff81124789>]
read_cache_page_async+0x19/0x20
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737833] [<ffffffff8112479e>]
read_cache_page+0xe/0x20
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737841] [<ffffffff81207c40>]
read_dev_sector+0x30/0x90
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737847] [<ffffffff8120a9d1>]
read_lba+0x101/0x110
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737852] [<ffffffff8120aec5>]
find_valid_gpt+0xd5/0x6b0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737861] [<ffffffff8106c831>] ?
release_console_sem+0x1e1/0x230
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737866] [<ffffffff8120b51f>]
efi_partition+0x7f/0x370
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737872] [<ffffffff814e9155>] ?
printk+0x41/0x44
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737877] [<ffffffff812089b7>]
rescan_partitions+0x1a7/0x470
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737899] [<ffffffffa0020351>] ?
sd_open+0x81/0x1f0 [sd_mod]
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737905] [<ffffffff811ce9d6>]
__blkdev_get+0x1b6/0x3c0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737910] [<ffffffff811cebf0>]
blkdev_get+0x10/0x20
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737916] [<ffffffff81207dee>]
register_disk+0x14e/0x1b0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737925] [<ffffffff81258646>]
add_disk+0xa6/0x160
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737933] [<ffffffffa00239cb>]
sd_probe_async+0x13b/0x210 [sd_mod]
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737938] [<ffffffff81095de6>] ?
add_wait_queue+0x46/0x60
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737947] [<ffffffff8109dd32>]
async_thread+0x102/0x250
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737955] [<ffffffff81059ec0>] ?
default_wake_function+0x0/0x20
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737962] [<ffffffff8109dc30>] ?
async_thread+0x0/0x250
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.737971] [<ffffffff810954a6>]
kthread+0x96/0xa0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.738044] [<ffffffff8100c20a>]
child_rip+0xa/0x20
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.738074] [<ffffffff81095410>] ?
kthread+0x0/0xa0
Dec 3 09:15:08 lnxwebr02 kernel: [ 600.738079] [<ffffffff8100c200>] ?
child_rip+0x0/0x20
Im trying to understand why after copying the vhd file I cant reconnect via
iscsi to the NETAPP, even after logging out and deleting the session.
Is there old references I need to remove or I'm I missing something else?
--
You received this message because you are subscribed to the Google Groups
"open-iscsi" group.
To view this discussion on the web visit
https://groups.google.com/d/msg/open-iscsi/-/i1V2Q9MiO7wJ.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to
[email protected].
For more options, visit this group at
http://groups.google.com/group/open-iscsi?hl=en.