[Kernel-packages] [Bug 1652185] Re: kernel BUG at /build/linux-7x12eW/linux-4.4.0/drivers/ata/sata_mv.c:2120!
Had another crash this morning, this time with a 3TB Hitachi_HDS5C3030ALA630_MJ1311YNG6KSGA Thankfully I was there to see the drive light stay on, so I had a clue which drive to take offline. Mar 14 07:18:31 monster kernel: [119755.500078] [ cut here ] Mar 14 07:18:31 monster kernel: [119755.500086] kernel BUG at /build/linux-9yOF0g/linux-4.4.0/drivers/ata/sata_mv.c:2120! Mar 14 07:18:31 monster kernel: [119755.500089] invalid opcode: [#1] SMP Mar 14 07:18:31 monster kernel: [119755.500093] Modules linked in: ufs qnx4 hfsplus hfs minix ntfs msdos jfs xfs pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) zfs(PO) zunicode(PO) zcommon(PO) znvpair(PO) spl(O) zavl(PO) nvidia_uvm(POE) gpio_ich coretemp kvm_intel kvm irqbypass input_leds serio_raw lpc_ich nvidia(POE) drm 8250_fintek i3000_edac shpchp mac_hid edac_core parport_pc ppdev lp parport autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear ahci libahci psmouse pata_acpi e1000e ptp pps_core sata_mv fjes Mar 14 07:18:31 monster kernel: [119755.500153] CPU: 3 PID: 214 Comm: scsi_eh_22 Tainted: P OE 4.4.0-66-generic #87-Ubuntu Mar 14 07:18:31 monster kernel: [119755.500156] Hardware name: Supermicro PDSM4+/PDSM4+, BIOS 6.00 11/04/2008 Mar 14 07:18:31 monster kernel: [119755.500160] task: 88003558e600 ti: 8800c8988000 task.ti: 8800c8988000 Mar 14 07:18:31 monster kernel: [119755.500163] RIP: 0010:[] [] mv_qc_prep+0x213/0x230 [sata_mv] Mar 14 07:18:31 monster kernel: [119755.500173] RSP: 0018:8800c898ba30 EFLAGS: 00010006 Mar 14 07:18:31 monster kernel: [119755.500176] RAX: 8800354565a0 RBX: 8800c8bf1d70 RCX: 0047 Mar 14 07:18:31 monster kernel: [119755.500178] RDX: 8800354565aa RSI: 880225cfe060 RDI: 8800c8bf1d70 Mar 14 07:18:31 monster kernel: [119755.500181] RBP: 8800c898ba48 R08: R09: 0001 Mar 14 07:18:31 monster kernel: [119755.500183] R10: 0023 R11: R12: 003f Mar 14 07:18:31 monster kernel: [119755.500186] R13: 8800c8bdd818 R14: 0001 R15: 8800c8bf1d70 Mar 14 07:18:31 monster kernel: [119755.500189] FS: () GS:88022fd8() knlGS: Mar 14 07:18:31 monster kernel: [119755.500192] CS: 0010 DS: ES: CR0: 8005003b Mar 14 07:18:31 monster kernel: [119755.500195] CR2: 7fc39dbb1b60 CR3: c0da4000 CR4: 06e0 Mar 14 07:18:31 monster kernel: [119755.500197] Stack: Mar 14 07:18:31 monster kernel: [119755.500199] 8800c8bf 0001 8800c8bf1e80 8800c898baa0 Mar 14 07:18:31 monster kernel: [119755.500204] 815dcbca 880225cfe098 81e2c3e0 8800c898bb70 Mar 14 07:18:31 monster kernel: [119755.500208] 000281d1c895 8800c898bbe0 8800c8bf1e80 Mar 14 07:18:31 monster kernel: [119755.500213] Call Trace: Mar 14 07:18:31 monster kernel: [119755.500221] [] ata_qc_issue+0x15a/0x390 Mar 14 07:18:31 monster kernel: [119755.500225] [] ata_exec_internal_sg+0x302/0x600 Mar 14 07:18:31 monster kernel: [119755.500229] [] ata_exec_internal+0x69/0xc0 Mar 14 07:18:31 monster kernel: [119755.500234] [] ata_read_log_page.part.11+0x196/0x1d0 Mar 14 07:18:31 monster kernel: [119755.500238] [] ata_eh_analyze_ncq_error+0x111/0x280 Mar 14 07:18:31 monster kernel: [119755.500242] [] ata_eh_link_autopsy+0x9a/0x950 Mar 14 07:18:31 monster kernel: [119755.500246] [] ata_eh_autopsy+0x2b/0xf0 Mar 14 07:18:31 monster kernel: [119755.500251] [] sata_pmp_error_handler+0x12/0x30 Mar 14 07:18:31 monster kernel: [119755.500256] [] mv_pmp_error_handler+0x93/0xa0 [sata_mv] Mar 14 07:18:31 monster kernel: [119755.500260] [] ata_scsi_port_error_handler+0x430/0x770 Mar 14 07:18:31 monster kernel: [119755.500264] [] ? ata_scsi_cmd_error_handler+0x11d/0x150 Mar 14 07:18:31 monster kernel: [119755.500268] [] ata_scsi_error+0xa0/0xe0 Mar 14 07:18:31 monster kernel: [119755.500272] [] scsi_error_handler+0xdb/0x8a0 Mar 14 07:18:31 monster kernel: [119755.500276] [] ? __schedule+0x3b6/0xa30 Mar 14 07:18:31 monster kernel: [119755.500280] [] ? scsi_eh_get_sense+0x240/0x240 Mar 14 07:18:31 monster kernel: [119755.500284] [] kthread+0xd8/0xf0 Mar 14 07:18:31 monster kernel: [119755.500288] [] ? kthread_create_on_node+0x1e0/0x1e0 Mar 14 07:18:31 monster kernel: [119755.500292] [] ret_from_fork+0x3f/0x70 Mar 14 07:18:31 monster kernel: [119755.500295] [] ? kthread_create_on_node+0x1e0/0x1e0 Mar 14 07:18:31 monster kernel: [119755.500298] Code: 66 89 48 0a e9 43 ff ff ff c6 47 35 30 e9 24 fe ff ff be 1c 08 00 00 48 c7 c7 68 b2 02 c0 e8 55 98 05 c1 8b 53 58 e9 93 fe ff ff <0f> 0b 48 83 e2 df 48 89 57 20 e9 4f fe ff ff 0f 1f 40 00 66 2e Mar 14 07:18:31 monster kernel: [119755.500343] RIP [] mv_qc_prep+0x213/0x230
[Kernel-packages] [Bug 1652185] Re: kernel BUG at /build/linux-7x12eW/linux-4.4.0/drivers/ata/sata_mv.c:2120!
I confirm the same crash bug in Kernel versions: 4.4.0-66-generic 4.4.0-64-generic 4.4.0-21-generic I could not test with latest upstream kernel because I needed ZFS (I looked into building it for latest upstream, but found no reasonable path). I can also confirm the crash is related to a drive with bad sectors. In my case a Seagate 2TB ata-ST2000DM001-1ER164_Z4Z55RQ2 Once I removed that drive, the crashing stopped. If I booted with drive offline, and then added it to a booted system, it did not crash until I imported the ZFS pool. I can try some simple tests with DD to confirm the crash in later or earlier kernels. Since I do not want to break a now working system, any testing would be done booting a USB Flash image with only the failing drive connected to a sata_mv port. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1652185 Title: kernel BUG at /build/linux- 7x12eW/linux-4.4.0/drivers/ata/sata_mv.c:2120! Status in linux package in Ubuntu: Incomplete Bug description: # lsb_release -rd Description: Ubuntu 16.04.1 LTS Release: 16.04 # apt-cache policy linux-image-4.4.0-57-generic linux-image-4.4.0-57-generic: Installed: 4.4.0-57.78 Candidate: 4.4.0-57.78 Version table: *** 4.4.0-57.78 500 500 http://us.archive.ubuntu.com/ubuntu xenial-updates/main amd64 Packages 500 http://security.ubuntu.com/ubuntu xenial-security/main amd64 Packages 100 /var/lib/dpkg/status Dec 22 16:59:20 crashplan-2 kernel: [ 9534.464076] [ cut here ] Dec 22 16:59:20 crashplan-2 kernel: [ 9534.464173] kernel BUG at /build/linux-7x12eW/linux-4.4.0/drivers/ata/sata_mv.c:2120! Dec 22 16:59:20 crashplan-2 kernel: [ 9534.464281] invalid opcode: [#1] SMP Dec 22 16:59:20 crashplan-2 kernel: [ 9534.464355] Modules linked in: binfmt_misc zfs(PO) zunicode(PO) zcommon(PO) znvpair(PO) spl(O) zavl(PO) gpio_ich ppdev i5000_edac edac_core parport_pc i5k_amb coretemp kvm parport irqbypass lpc_ich serio_raw input_leds 8250_fintek shpchp mac_hid ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi autofs4 btrfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbhid hid amdkfd amd_iommu_v2 radeon i2c_algo_bit ttm drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm pata_acpi e1000e ptp pps_core sata_mv floppy fjes Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466220] CPU: 0 PID: 198 Comm: scsi_eh_15 Tainted: P O4.4.0-57-generic #78-Ubuntu Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466355] Hardware name: Supermicro X7DB8/X7DB8, BIOS 6.00 12/03/2007 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466452] task: 880034c40cc0 ti: 880034c4c000 task.ti: 880034c4c000 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466564] RIP: 0010:[] [] mv_qc_prep+0x213/0x230 [sata_mv] Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466717] RSP: 0018:880034c4fa30 EFLAGS: 00010006 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466797] RAX: 8800353188c0 RBX: 8800352bdd70 RCX: 0047 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466893] RDX: 8800353188ca RSI: 88012a6af060 RDI: 8800352bdd70 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.466993] RBP: 880034c4fa48 R08: R09: 0001 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467092] R10: 0020 R11: R12: 003f Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467191] R13: 880035083c18 R14: 0001 R15: 8800352bdd70 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467291] FS: () GS:88012fc0() knlGS: Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467412] CS: 0010 DS: ES: CR0: 8005003b Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467489] CR2: 7f284278bc00 CR3: cb676000 CR4: 06f0 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467588] Stack: Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467628] 8800352bc000 0001 8800352bde80 880034c4faa0 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467774] 815d9caa 88012a6af098 81e2c360 880034c4fb70 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.467920] 000281d1863d 880034c4fbe0 8800352bde80 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.468017] Call Trace: Dec 22 16:59:20 crashplan-2 kernel: [ 9534.468017] [] ata_qc_issue+0x15a/0x390 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.468017] [] ata_exec_internal_sg+0x302/0x600 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.468017] [] ata_exec_internal+0x69/0xc0 Dec 22 16:59:20 crashplan-2 kernel: [ 9534.468017] [] ? blk_peek_request+0x4b/0x290 Dec 22
[Kernel-packages] [Bug 1654708] Re: Trying to reinstall ZFS on Xenial. Cannot mount existing zpool with identifier from /mnt/disk/by-id/xxxx
I just upgraded from 14.04 to 16.04 4.4.0-64-generic At first the modprobe zfs seemed to work if done manually after boot. Once I removed zfsutils-linux, and re-installed it would no longer work. I tried multiple variations of removing and re-adding ZFS, and did purge all old ZFS versions from Trusty install. The only fix I could get to function was to modify the source for zpl_xattr.c There were two calls to posix_acl_valid, one in __zpl_xattr_acl_set_access, and one in __zpl_xattr_acl_set_default. I changed them as described above, and did dpkg-reconfigure zfs-dkms The ZFS kernel module would not load until I rebooted, then it all came up and mounted my ZFS pools automatically. It appears this fix is already in the master ZFS on Linux source: https://github.com/zfsonlinux/zfs/blob/master/module/zfs/zpl_xattr.c Hopefully this will end up in Ubuntu soon? -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to zfs-linux in Ubuntu. https://bugs.launchpad.net/bugs/1654708 Title: Trying to reinstall ZFS on Xenial. Cannot mount existing zpool with identifier from /mnt/disk/by-id/ Status in zfs-linux package in Ubuntu: Confirmed Bug description: Description: Ubuntu 16.04.1 LTS Release: 16.04 zfs-dkms 0.6.5.6-0ubuntu15 I cant mount/install my zpool. so i tried reinstalling zfslinux-utils, which gives me three error messages like this: "zfs-mount.service is a disabled" or a static unit, not starting it."... If something doesn't work I try googling error messages. most answers not totally like my problem but related to dkms, so I tried loading zfs-dkms and reinstalling zfslinux-utils. I created the zpool with id from /mnt/disk/by-id/ - which is zfs good- practice. This may be causing problems with the mount. I dont know... Cheers. ProblemType: Package DistroRelease: Ubuntu 16.04 Package: zfs-dkms 0.6.5.6-0ubuntu15 ProcVersionSignature: Ubuntu 4.4.0-58.79-generic 4.4.35 Uname: Linux 4.4.0-58-generic x86_64 NonfreeKernelModules: zfs zunicode zcommon znvpair zavl ApportVersion: 2.20.1-0ubuntu2.4 Architecture: amd64 DKMSKernelVersion: 4.4.0-58-generic Date: Sat Jan 7 15:58:15 2017 DuplicateSignature: dkms:zfs-dkms:0.6.5.6-0ubuntu15:/var/lib/dkms/zfs/0.6.5.6/build/module/zfs/zpl_xattr.c:1284:12: error: too few arguments to function ‘posix_acl_valid’ InstallationDate: Installed on 2016-12-26 (12 days ago) InstallationMedia: Ubuntu 16.04.1 LTS "Xenial Xerus" - Beta amd64 (20161225) PackageVersion: 0.6.5.6-0ubuntu15 RelatedPackageVersions: dpkg 1.18.4ubuntu1.1 apt 1.2.18 SourcePackage: zfs-linux Title: zfs-dkms 0.6.5.6-0ubuntu15: zfs kernel module failed to build UpgradeStatus: No upgrade log present (probably fresh install) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/zfs-linux/+bug/1654708/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp