[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2019-07-24 Thread Brad Figg
** Tags added: cscc

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Released
Status in linux source package in Xenial:
  Fix Released
Status in linux source package in Bionic:
  Fix Released

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-10-01 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 4.15.0-36.39

---
linux (4.15.0-36.39) bionic; urgency=medium

  * CVE-2018-14633
- iscsi target: Use hex2bin instead of a re-implementation

  * CVE-2018-17182
- mm: get rid of vmacache_flush_all() entirely

linux (4.15.0-35.38) bionic; urgency=medium

  * linux: 4.15.0-35.38 -proposed tracker (LP: #1791719)

  * device hotplug of vfio devices can lead to deadlock in vfio_pci_release
(LP: #1792099)
- SAUCE: vfio -- release device lock before userspace requests

  * L1TF mitigation not effective in some CPU and RAM combinations
(LP: #1788563)
- x86/speculation/l1tf: Fix overflow in l1tf_pfn_limit() on 32bit
- x86/speculation/l1tf: Fix off-by-one error when warning that system has 
too
  much RAM
- x86/speculation/l1tf: Increase l1tf memory limit for Nehalem+

  * CVE-2018-15594
- x86/paravirt: Fix spectre-v2 mitigations for paravirt guests

  * CVE-2017-5715 (Spectre v2 s390x)
- KVM: s390: implement CPU model only facilities
- s390: detect etoken facility
- KVM: s390: add etoken support for guests
- s390/lib: use expoline for all bcr instructions
- s390: fix br_r1_trampoline for machines without exrl
- SAUCE: s390: use expoline thunks for all branches generated by the BPF JIT

  * Ubuntu18.04.1: cpuidle: powernv: Fix promotion from snooze if next state
disabled (performance) (LP: #1790602)
- cpuidle: powernv: Fix promotion from snooze if next state disabled

  * Watchdog CPU:19 Hard LOCKUP when kernel crash was triggered (LP: #1790636)
- powerpc: hard disable irqs in smp_send_stop loop
- powerpc: Fix deadlock with multiple calls to smp_send_stop
- powerpc: smp_send_stop do not offline stopped CPUs
- powerpc/powernv: Fix opal_event_shutdown() called with interrupts disabled

  * Security fix: check if IOMMU page is contained in the pinned physical page
(LP: #1785675)
- vfio/spapr: Use IOMMU pageshift rather than pagesize
- KVM: PPC: Check if IOMMU page is contained in the pinned physical page

  * Missing Intel GPU pci-id's (LP: #1789924)
- drm/i915/kbl: Add KBL GT2 sku
- drm/i915/whl: Introducing Whiskey Lake platform
- drm/i915/aml: Introducing Amber Lake platform
- drm/i915/cfl: Add a new CFL PCI ID.

  * CVE-2018-15572
- x86/speculation: Protect against userspace-userspace spectreRSB

  * Support Power Management for Thunderbolt Controller  (LP: #1789358)
- thunderbolt: Handle NULL boot ACL entries properly
- thunderbolt: Notify userspace when boot_acl is changed
- thunderbolt: Use 64-bit DMA mask if supported by the platform
- thunderbolt: Do not unnecessarily call ICM get route
- thunderbolt: No need to take tb->lock in domain suspend/complete
- thunderbolt: Use correct ICM commands in system suspend
- thunderbolt: Add support for runtime PM

  * random oopses on s390 systems using NVMe devices (LP: #1790480)
- s390/pci: fix out of bounds access during irq setup

  * [Bionic] Spectre v4 mitigation (Speculative Store Bypass Disable) support
for arm64 using SMC firmware call to set a hardware chicken bit
(LP: #1787993) // CVE-2018-3639 (arm64)
- arm64: alternatives: Add dynamic patching feature
- KVM: arm/arm64: Do not use kern_hyp_va() with kvm_vgic_global_state
- KVM: arm64: Avoid storing the vcpu pointer on the stack
- arm/arm64: smccc: Add SMCCC-specific return codes
- arm64: Call ARCH_WORKAROUND_2 on transitions between EL0 and EL1
- arm64: Add per-cpu infrastructure to call ARCH_WORKAROUND_2
- arm64: Add ARCH_WORKAROUND_2 probing
- arm64: Add 'ssbd' command-line option
- arm64: ssbd: Add global mitigation state accessor
- arm64: ssbd: Skip apply_ssbd if not using dynamic mitigation
- arm64: ssbd: Restore mitigation status on CPU resume
- arm64: ssbd: Introduce thread flag to control userspace mitigation
- arm64: ssbd: Add prctl interface for per-thread mitigation
- arm64: KVM: Add HYP per-cpu accessors
- arm64: KVM: Add ARCH_WORKAROUND_2 support for guests
- arm64: KVM: Handle guest's ARCH_WORKAROUND_2 requests
- arm64: KVM: Add ARCH_WORKAROUND_2 discovery through ARCH_FEATURES_FUNC_ID
- [Config] ARM64_SSBD=y

  * Reconcile hns3 SAUCE patches with upstream (LP: #1787477)
- Revert "UBUNTU: SAUCE: net: hns3: Optimize PF CMDQ interrupt switching
  process"
- Revert "UBUNTU: SAUCE: net: hns3: Fix for VF mailbox receiving unknown
  message"
- Revert "UBUNTU: SAUCE: net: hns3: Fix for VF mailbox cannot receiving PF
  response"
- Revert "UBUNTU: SAUCE: {topost} net: hns3: fix comments for
  hclge_get_ring_chain_from_mbx"
- Revert "UBUNTU: SAUCE: {topost} net: hns3: fix for using wrong mask and
  shift in hclge_get_ring_chain_from_mbx"
- Revert "UBUNTU: SAUCE: {topost} net: hns3: fix for reset_level default
  assignment probelm"
- Revert "UBUNTU: SAUCE: {topost} net: hns3: 

[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-10-01 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 4.4.0-137.163

---
linux (4.4.0-137.163) xenial; urgency=medium

  * CVE-2018-14633
- iscsi target: Use hex2bin instead of a re-implementation

  * CVE-2018-17182
- mm: get rid of vmacache_flush_all() entirely

linux (4.4.0-136.162) xenial; urgency=medium

  * linux: 4.4.0-136.162 -proposed tracker (LP: #1791745)

  * CVE-2017-5753
- bpf: properly enforce index mask to prevent out-of-bounds speculation
- Revert "UBUNTU: SAUCE: bpf: Use barrier_nospec() instead of osb()"
- Revert "bpf: prevent speculative execution in eBPF interpreter"

  * L1TF mitigation not effective in some CPU and RAM combinations
(LP: #1788563) // CVE-2018-3620 // CVE-2018-3646
- x86/speculation/l1tf: Fix overflow in l1tf_pfn_limit() on 32bit
- x86/speculation/l1tf: Fix off-by-one error when warning that system has 
too
  much RAM
- x86/speculation/l1tf: Increase l1tf memory limit for Nehalem+

  * CVE-2018-15594
- x86/paravirt: Fix spectre-v2 mitigations for paravirt guests

  * Xenial update to 4.4.144 stable release (LP: #1791080)
- KVM/Eventfd: Avoid crash when assign and deassign specific eventfd in
  parallel.
- x86/MCE: Remove min interval polling limitation
- fat: fix memory allocation failure handling of match_strdup()
- ALSA: rawmidi: Change resized buffers atomically
- ARC: Fix CONFIG_SWAP
- ARC: mm: allow mprotect to make stack mappings executable
- mm: memcg: fix use after free in mem_cgroup_iter()
- ipv4: Return EINVAL when ping_group_range sysctl doesn't map to user ns
- ipv6: fix useless rol32 call on hash
- lib/rhashtable: consider param->min_size when setting initial table size
- net/ipv4: Set oif in fib_compute_spec_dst
- net: phy: fix flag masking in __set_phy_supported
- ptp: fix missing break in switch
- tg3: Add higher cpu clock for 5762.
- net: Don't copy pfmemalloc flag in __copy_skb_header()
- skbuff: Unconditionally copy pfmemalloc in __skb_clone()
- xhci: Fix perceived dead host due to runtime suspend race with event 
handler
- x86/paravirt: Make native_save_fl() extern inline
- SAUCE: Add missing CPUID_7_EDX defines
- SAUCE: x86/speculation: Expose indirect_branch_prediction_barrier()
- x86/pti: Mark constant arrays as __initconst
- x86/asm/entry/32: Simplify pushes of zeroed pt_regs->REGs
- x86/entry/64/compat: Clear registers for compat syscalls, to reduce
  speculation attack surface
- x86/speculation: Clean up various Spectre related details
- x86/speculation: Fix up array_index_nospec_mask() asm constraint
- x86/xen: Zero MSR_IA32_SPEC_CTRL before suspend
- x86/mm: Factor out LDT init from context init
- x86/mm: Give each mm TLB flush generation a unique ID
- SAUCE: x86/speculation: Use Indirect Branch Prediction Barrier in context
  switch
- x86/speculation: Use IBRS if available before calling into firmware
- x86/speculation: Move firmware_restrict_branch_speculation_*() from C to 
CPP
- selftest/seccomp: Fix the seccomp(2) signature
- xen: set cpu capabilities from xen_start_kernel()
- x86/amd: don't set X86_BUG_SYSRET_SS_ATTRS when running under Xen
- SAUCE: Preserve SPEC_CTRL MSR in new inlines
- SAUCE: Add Knights Mill to NO SSB list
- x86/process: Correct and optimize TIF_BLOCKSTEP switch
- x86/process: Optimize TIF_NOTSC switch
- Revert "x86/cpufeatures: Add FEATURE_ZEN"
- Revert "x86/cpu/AMD: Fix erratum 1076 (CPB bit)"
- x86/cpu/AMD: Fix erratum 1076 (CPB bit)
- x86/cpufeatures: Add FEATURE_ZEN
- x86/xen: Add call of speculative_store_bypass_ht_init() to PV paths
- x86/cpu: Re-apply forced caps every time CPU caps are re-read
- block: do not use interruptible wait anywhere
- clk: tegra: Fix PLL_U post divider and initial rate on Tegra30
- ubi: Introduce vol_ignored()
- ubi: Rework Fastmap attach base code
- ubi: Be more paranoid while seaching for the most recent Fastmap
- ubi: Fix races around ubi_refill_pools()
- ubi: Fix Fastmap's update_vol()
- ubi: fastmap: Erase outdated anchor PEBs during attach
- Linux 4.4.144

  * CVE-2017-5715 (Spectre v2 s390x)
- s390: detect etoken facility
- s390/lib: use expoline for all bcr instructions
- SAUCE: s390: use expoline thunks for all branches generated by the BPF JIT

  * Xenial update to 4.4.143 stable release (LP: #1790884)
- compiler, clang: suppress warning for unused static inline functions
- compiler, clang: properly override 'inline' for clang
- compiler, clang: always inline when CONFIG_OPTIMIZE_INLINING is disabled
- compiler-gcc.h: Add __attribute__((gnu_inline)) to all inline declarations
- x86/asm: Add _ASM_ARG* constants for argument registers to 
- ocfs2: subsystem.su_mutex is required while accessing the item->ci_parent
- bcm63xx_enet: correct clock usage
- bcm63xx_enet: do 

[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-10-01 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 3.13.0-160.210

---
linux (3.13.0-160.210) trusty; urgency=medium

  * CVE-2018-14633
- iscsi target: Use hex2bin instead of a re-implementation

  * CVE-2018-14634
- exec: Limit arg stack to at most 75% of _STK_LIM

linux (3.13.0-159.209) trusty; urgency=medium

  * linux: 3.13.0-159.209 -proposed tracker (LP: #1791754)

  * L1TF mitigation not effective in some CPU and RAM combinations
(LP: #1788563) // CVE-2018-3620 // CVE-2018-3646
- x86/speculation/l1tf: Fix overflow in l1tf_pfn_limit() on 32bit
- x86/speculation/l1tf: Fix off-by-one error when warning that system has 
too
  much RAM
- x86/speculation/l1tf: Increase l1tf memory limit for Nehalem+

  * CVE-2018-15594
- x86/paravirt: Fix spectre-v2 mitigations for paravirt guests

  * i40e NIC not recognized (LP: #1789215)
- SAUCE: i40e_bpo: Import the i40e driver from Xenial 4.4
- SAUCE: i40e_bpo: Add a compatibility layer
- SAUCE: i40e_bpo: Don't probe for NICs supported by the in-tree driver
- SAUCE: i40e_bpo: Rename the driver to i40e_bpo
- SAUCE: i40e_bpo: Hook the driver into the kernel tree
- [Config] Add CONFIG_I40E_BPO=m

  * Probable regression with EXT3 file systems and CVE-2018-1093 patches
(LP: #1789131)
- ext4: fix bitmap position validation

  * CVE-2018-3620 // CVE-2018-3646
- mm: x86 pgtable: drop unneeded preprocessor ifdef
- x86/asm: Move PUD_PAGE macros to page_types.h
- x86/asm: Add pud/pmd mask interfaces to handle large PAT bit
- x86/asm: Fix pud/pmd interfaces to handle large PAT bit
- x86/mm: Fix regression with huge pages on PAE
- SAUCE: x86/speculation/l1tf: Protect NUMA hinting PTEs against speculation
- Revert "UBUNTU: [Config] disable NUMA_BALANCING"

  * CVE-2018-15572
- x86/retpoline: Fill RSB on context switch for affected CPUs
- x86/speculation: Protect against userspace-userspace spectreRSB

  * CVE-2018-6555
- SAUCE: irda: Only insert new objects into the global database via 
setsockopt

  * CVE-2018-6554
- SAUCE: irda: Fix memory leak caused by repeated binds of irda socket

  * BUG: soft lockup - CPU#0 stuck for 23s! [kworker/0:1:1119] (LP: #1788817)
- drm/ast: Fixed system hanged if disable P2A

  * errors when scanning partition table of corrupted AIX disk (LP: #1787281)
- partitions/aix: fix usage of uninitialized lv_info and lvname structures
- partitions/aix: append null character to print data from disk

 -- Stefan Bader   Mon, 24 Sep 2018 19:38:31
+0200

** Changed in: linux (Ubuntu Trusty)
   Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Released
Status in linux source package in Xenial:
  Fix Released
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached

[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-10-01 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 3.13.0-160.210

---
linux (3.13.0-160.210) trusty; urgency=medium

  * CVE-2018-14633
- iscsi target: Use hex2bin instead of a re-implementation

  * CVE-2018-14634
- exec: Limit arg stack to at most 75% of _STK_LIM

linux (3.13.0-159.209) trusty; urgency=medium

  * linux: 3.13.0-159.209 -proposed tracker (LP: #1791754)

  * L1TF mitigation not effective in some CPU and RAM combinations
(LP: #1788563) // CVE-2018-3620 // CVE-2018-3646
- x86/speculation/l1tf: Fix overflow in l1tf_pfn_limit() on 32bit
- x86/speculation/l1tf: Fix off-by-one error when warning that system has 
too
  much RAM
- x86/speculation/l1tf: Increase l1tf memory limit for Nehalem+

  * CVE-2018-15594
- x86/paravirt: Fix spectre-v2 mitigations for paravirt guests

  * i40e NIC not recognized (LP: #1789215)
- SAUCE: i40e_bpo: Import the i40e driver from Xenial 4.4
- SAUCE: i40e_bpo: Add a compatibility layer
- SAUCE: i40e_bpo: Don't probe for NICs supported by the in-tree driver
- SAUCE: i40e_bpo: Rename the driver to i40e_bpo
- SAUCE: i40e_bpo: Hook the driver into the kernel tree
- [Config] Add CONFIG_I40E_BPO=m

  * Probable regression with EXT3 file systems and CVE-2018-1093 patches
(LP: #1789131)
- ext4: fix bitmap position validation

  * CVE-2018-3620 // CVE-2018-3646
- mm: x86 pgtable: drop unneeded preprocessor ifdef
- x86/asm: Move PUD_PAGE macros to page_types.h
- x86/asm: Add pud/pmd mask interfaces to handle large PAT bit
- x86/asm: Fix pud/pmd interfaces to handle large PAT bit
- x86/mm: Fix regression with huge pages on PAE
- SAUCE: x86/speculation/l1tf: Protect NUMA hinting PTEs against speculation
- Revert "UBUNTU: [Config] disable NUMA_BALANCING"

  * CVE-2018-15572
- x86/retpoline: Fill RSB on context switch for affected CPUs
- x86/speculation: Protect against userspace-userspace spectreRSB

  * CVE-2018-6555
- SAUCE: irda: Only insert new objects into the global database via 
setsockopt

  * CVE-2018-6554
- SAUCE: irda: Fix memory leak caused by repeated binds of irda socket

  * BUG: soft lockup - CPU#0 stuck for 23s! [kworker/0:1:1119] (LP: #1788817)
- drm/ast: Fixed system hanged if disable P2A

  * errors when scanning partition table of corrupted AIX disk (LP: #1787281)
- partitions/aix: fix usage of uninitialized lv_info and lvname structures
- partitions/aix: append null character to print data from disk

 -- Stefan Bader   Mon, 24 Sep 2018 19:38:31
+0200

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Released
Status in linux source package in Xenial:
  Fix Released
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially

[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-10-01 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 4.4.0-137.163

---
linux (4.4.0-137.163) xenial; urgency=medium

  * CVE-2018-14633
- iscsi target: Use hex2bin instead of a re-implementation

  * CVE-2018-17182
- mm: get rid of vmacache_flush_all() entirely

linux (4.4.0-136.162) xenial; urgency=medium

  * linux: 4.4.0-136.162 -proposed tracker (LP: #1791745)

  * CVE-2017-5753
- bpf: properly enforce index mask to prevent out-of-bounds speculation
- Revert "UBUNTU: SAUCE: bpf: Use barrier_nospec() instead of osb()"
- Revert "bpf: prevent speculative execution in eBPF interpreter"

  * L1TF mitigation not effective in some CPU and RAM combinations
(LP: #1788563) // CVE-2018-3620 // CVE-2018-3646
- x86/speculation/l1tf: Fix overflow in l1tf_pfn_limit() on 32bit
- x86/speculation/l1tf: Fix off-by-one error when warning that system has 
too
  much RAM
- x86/speculation/l1tf: Increase l1tf memory limit for Nehalem+

  * CVE-2018-15594
- x86/paravirt: Fix spectre-v2 mitigations for paravirt guests

  * Xenial update to 4.4.144 stable release (LP: #1791080)
- KVM/Eventfd: Avoid crash when assign and deassign specific eventfd in
  parallel.
- x86/MCE: Remove min interval polling limitation
- fat: fix memory allocation failure handling of match_strdup()
- ALSA: rawmidi: Change resized buffers atomically
- ARC: Fix CONFIG_SWAP
- ARC: mm: allow mprotect to make stack mappings executable
- mm: memcg: fix use after free in mem_cgroup_iter()
- ipv4: Return EINVAL when ping_group_range sysctl doesn't map to user ns
- ipv6: fix useless rol32 call on hash
- lib/rhashtable: consider param->min_size when setting initial table size
- net/ipv4: Set oif in fib_compute_spec_dst
- net: phy: fix flag masking in __set_phy_supported
- ptp: fix missing break in switch
- tg3: Add higher cpu clock for 5762.
- net: Don't copy pfmemalloc flag in __copy_skb_header()
- skbuff: Unconditionally copy pfmemalloc in __skb_clone()
- xhci: Fix perceived dead host due to runtime suspend race with event 
handler
- x86/paravirt: Make native_save_fl() extern inline
- SAUCE: Add missing CPUID_7_EDX defines
- SAUCE: x86/speculation: Expose indirect_branch_prediction_barrier()
- x86/pti: Mark constant arrays as __initconst
- x86/asm/entry/32: Simplify pushes of zeroed pt_regs->REGs
- x86/entry/64/compat: Clear registers for compat syscalls, to reduce
  speculation attack surface
- x86/speculation: Clean up various Spectre related details
- x86/speculation: Fix up array_index_nospec_mask() asm constraint
- x86/xen: Zero MSR_IA32_SPEC_CTRL before suspend
- x86/mm: Factor out LDT init from context init
- x86/mm: Give each mm TLB flush generation a unique ID
- SAUCE: x86/speculation: Use Indirect Branch Prediction Barrier in context
  switch
- x86/speculation: Use IBRS if available before calling into firmware
- x86/speculation: Move firmware_restrict_branch_speculation_*() from C to 
CPP
- selftest/seccomp: Fix the seccomp(2) signature
- xen: set cpu capabilities from xen_start_kernel()
- x86/amd: don't set X86_BUG_SYSRET_SS_ATTRS when running under Xen
- SAUCE: Preserve SPEC_CTRL MSR in new inlines
- SAUCE: Add Knights Mill to NO SSB list
- x86/process: Correct and optimize TIF_BLOCKSTEP switch
- x86/process: Optimize TIF_NOTSC switch
- Revert "x86/cpufeatures: Add FEATURE_ZEN"
- Revert "x86/cpu/AMD: Fix erratum 1076 (CPB bit)"
- x86/cpu/AMD: Fix erratum 1076 (CPB bit)
- x86/cpufeatures: Add FEATURE_ZEN
- x86/xen: Add call of speculative_store_bypass_ht_init() to PV paths
- x86/cpu: Re-apply forced caps every time CPU caps are re-read
- block: do not use interruptible wait anywhere
- clk: tegra: Fix PLL_U post divider and initial rate on Tegra30
- ubi: Introduce vol_ignored()
- ubi: Rework Fastmap attach base code
- ubi: Be more paranoid while seaching for the most recent Fastmap
- ubi: Fix races around ubi_refill_pools()
- ubi: Fix Fastmap's update_vol()
- ubi: fastmap: Erase outdated anchor PEBs during attach
- Linux 4.4.144

  * CVE-2017-5715 (Spectre v2 s390x)
- s390: detect etoken facility
- s390/lib: use expoline for all bcr instructions
- SAUCE: s390: use expoline thunks for all branches generated by the BPF JIT

  * Xenial update to 4.4.143 stable release (LP: #1790884)
- compiler, clang: suppress warning for unused static inline functions
- compiler, clang: properly override 'inline' for clang
- compiler, clang: always inline when CONFIG_OPTIMIZE_INLINING is disabled
- compiler-gcc.h: Add __attribute__((gnu_inline)) to all inline declarations
- x86/asm: Add _ASM_ARG* constants for argument registers to 
- ocfs2: subsystem.su_mutex is required while accessing the item->ci_parent
- bcm63xx_enet: correct clock usage
- bcm63xx_enet: do 

[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-09-17 Thread Mauricio Faria de Oliveira
SRU Verification: all 3 stable releases successfully survive the disk
probe/partition scan of the disk image with corrupted AIX partition
table.

Updating verification tags.  Thanks!

Trusty:
---

$ uname -a
Linux trusty 3.13.0-159-generic #209-Ubuntu SMP Wed Sep 12 10:01:59 UTC 2018 
x86_64 x86_64 x86_64 GNU/Linux

$ sudo losetup --find --show --partscan disk.img 
[  691.319981]  loop0: unsupported AIX LVM version 30 found
[  691.319981]  unknown partition table
/dev/loop0


Xenial:
---

$ uname -a
Linux trusty 4.4.0-136-generic #162-Ubuntu SMP Tue Sep 11 15:12:40 UTC 2018 
x86_64 x86_64 x86_64 GNU/Linux

$ sudo losetup --find --show --partscan disk.img
/dev/loop0


Bionic:
---

$ uname -a
Linux trusty 4.15.0-35-generic #38-Ubuntu SMP Wed Sep 12 10:38:24 UTC 2018 
x86_64 x86_64 x86_64 GNU/Linux

$ sudo losetup --find --show --partscan disk.img
/dev/loop0


** Tags removed: verification-needed-bionic verification-needed-trusty 
verification-needed-xenial
** Tags added: verification-done-bionic verification-done-trusty 
verification-done-xenial

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Committed
Status in linux source package in Xenial:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-09-14 Thread Brad Figg
This bug is awaiting verification that the kernel in -proposed solves
the problem. Please test the kernel and update this bug with the
results. If the problem is solved, change the tag 'verification-needed-
bionic' to 'verification-done-bionic'. If the problem still exists,
change the tag 'verification-needed-bionic' to 'verification-failed-
bionic'.

If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: verification-needed-bionic

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Committed
Status in linux source package in Xenial:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-09-13 Thread Brad Figg
This bug is awaiting verification that the kernel in -proposed solves
the problem. Please test the kernel and update this bug with the
results. If the problem is solved, change the tag 'verification-needed-
trusty' to 'verification-done-trusty'. If the problem still exists,
change the tag 'verification-needed-trusty' to 'verification-failed-
trusty'.

If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: verification-needed-trusty

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Committed
Status in linux source package in Xenial:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-09-12 Thread Brad Figg
This bug is awaiting verification that the kernel in -proposed solves
the problem. Please test the kernel and update this bug with the
results. If the problem is solved, change the tag 'verification-needed-
xenial' to 'verification-done-xenial'. If the problem still exists,
change the tag 'verification-needed-xenial' to 'verification-failed-
xenial'.

If verification is not done by 5 working days from today, this fix will
be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how
to enable and use -proposed. Thank you!


** Tags added: verification-needed-xenial

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Committed
Status in linux source package in Xenial:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-09-04 Thread Kleber Sacilotto de Souza
** Changed in: linux (Ubuntu Bionic)
   Status: New => Fix Committed

** Changed in: linux (Ubuntu Xenial)
   Status: New => Fix Committed

** Changed in: linux (Ubuntu Trusty)
   Status: New => Fix Committed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  Fix Committed
Status in linux source package in Xenial:
  Fix Committed
Status in linux source package in Bionic:
  Fix Committed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-27 Thread Stefan Bader
Replaced nomination for Precise by one for Trusty which appears to be
the intended series.

** Also affects: linux (Ubuntu Trusty)
   Importance: Undecided
   Status: New

** No longer affects: linux (Ubuntu Precise)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Trusty:
  New
Status in linux source package in Xenial:
  New
Status in linux source package in Bionic:
  New

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-24 Thread Rohit
Hey,

I am so sorry, I accidentally changed the status of this bug. I
apologize for the inconvenience. please rollback my status change.

** Changed in: linux (Ubuntu)
   Status: Fix Committed => Fix Released

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Precise:
  New
Status in linux source package in Xenial:
  New
Status in linux source package in Bionic:
  New

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-23 Thread Seth Forshee
** Also affects: linux (Ubuntu Bionic)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Xenial)
   Importance: Undecided
   Status: New

** Also affects: linux (Ubuntu Precise)
   Importance: Undecided
   Status: New

** Changed in: linux (Ubuntu)
   Status: Triaged => Fix Committed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Fix Committed
Status in linux source package in Precise:
  New
Status in linux source package in Xenial:
  New
Status in linux source package in Bionic:
  New

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-16 Thread Joseph Salisbury
** Changed in: linux (Ubuntu)
   Importance: Undecided => Medium

** Changed in: linux (Ubuntu)
   Status: Confirmed => Triaged

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Triaged

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-15 Thread Mauricio Faria de Oliveira
The logs and kernel version provided in the test comments should be sufficient 
to understand/confirm the problem and affected versions.
Changing the status of the bug to Confirmed.

** Changed in: linux (Ubuntu)
   Status: Incomplete => Confirmed

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-15 Thread Mauricio Faria de Oliveira
Test on Trusty

Before:

$ uname -a
Linux bionic 3.13.0-155-generic #205-Ubuntu SMP Fri Aug 10 15:53:26 UTC 2018 
x86_64 x86_64 x86_64 GNU/Linux
$ echo 9 | sudo tee /proc/sys/kernel/printk 
 
9
$ sudo losetup --find --show --partscan rlv_grkgld.1mb  

[  270.506420] partition (null) (3 pp's found) is not contiguous
[  270.510221] partition (null) (1 pp's found) is not contiguous
[  270.513952] partition (null) (68 pp's found) is not contiguous
...
[  270.593589] partition (null) (3 pp's found) is not contiguous
[  270.595603] partition (null) (2 pp's found) is not contiguous
[  270.597428] BUG: unable to handle kernel paging request at 1000
[  270.599525] IP: [] strnlen+0xd/0x40
[  270.601404] PGD 0
[  270.601404] Oops:  [#1] SMP
[  270.601404] Modules linked in: squashfs isofs nls_iso8859_1 kvm_intel kvm 
serio_raw sch_fq_codel iscsi
_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 
btrfs libcrc32c raid10 raid456
 async_memcpy async_raid6_recov async_pq async_xor async_tx xor raid6_pq raid1 
raid0 multipath linear psm
ouse floppy
[  270.601404] CPU: 1 PID: 972 Comm: losetup Not tainted 3.13.0-155-generic 
#205-Ubuntu
[  270.601404] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 
1.10.2-1ubuntu1 04/01/2014
[  270.601404] task: 88003998e000 ti: 88003b1b6000 task.ti: 
88003b1b6000
[  270.601404] RIP: 0010:[]  [] 
strnlen+0xd/0x40
[  270.601404] RSP: 0018:88003b1b7888  EFLAGS: 00010086
[  270.601404] RAX: 81a674a1 RBX: 81ecbdec RCX: fffe
[  270.601404] RDX: 1000 RSI:  RDI: 1000
[  270.601404] RBP: 88003b1b7888 R08:  R09: 
[  270.601404] R10: 813e27f0 R11: 88003b1b773e R12: 1000
[  270.601404] R13: 81ecc1c0 R14:  R15: 
[  270.601404] FS:  7fbcbba18740() GS:88003ee8() 
knlGS:
[  270.601404] CS:  0010 DS:  ES:  CR0: 80050033
[  270.601404] CR2: 1000 CR3: 3a9fe000 CR4: 0670
[  270.601404] Stack:
[  270.601404]  88003b1b78c0 8137c0ab 81ecbdec 
81ecc1c0
[  270.601404]  88003b1b79c0 81a939c6 81a939c6 
88003b1b7928
[  270.601404]  8137d521 0086 88003b1b773e 
000c
[  270.601404] Call Trace:
[  270.601404]  [] string.isra.5+0x3b/0xf0
[  270.601404]  [] vsnprintf+0x1c1/0x610
[  270.601404]  [] vscnprintf+0xd/0x30
[  270.601404]  [] vprintk_emit+0x111/0x530
[  270.601404]  [] printk+0x67/0x69
[  270.601404]  [] aix_partition+0x613/0x620
[  270.601404]  [] ? radix_tree_lookup_slot+0xe/0x10
[  270.601404]  [] msdos_partition+0x870/0x890
[  270.601404]  [] ? read_cache_page+0x21/0x30
[  270.601404]  [] ? read_dev_sector+0x2d/0x90
[  270.601404]  [] ? snprintf+0x39/0x40
[  270.601404]  [] ? parse_solaris_x86+0x230/0x230
[  270.601404]  [] check_partition+0x10a/0x240
[  270.601404]  [] rescan_partitions+0xb7/0x2c0
[  270.601404]  [] blkdev_ioctl+0xef/0x7d0
[  270.601404]  [] ? schedule_timeout+0x279/0x310
[  270.601404]  [] ioctl_by_bdev+0x33/0x40
[  270.601404]  [] loop_set_status+0x39a/0x3b0
[  270.601404]  [] loop_set_status64+0x50/0x70
[  270.601404]  [] lo_ioctl+0x1e8/0x730
[  270.601404]  [] blkdev_ioctl+0x21f/0x7d0
[  270.601404]  [] ? system_call_after_swapgs+0x156/0x170
[  270.601404]  [] block_ioctl+0x41/0x50
[  270.601404]  [] do_vfs_ioctl+0x2e3/0x4d0
[  270.601404]  [] ? system_call_after_swapgs+0x117/0x170
[  270.601404]  [] ? system_call_after_swapgs+0x110/0x170
[  270.601404]  [] ? system_call_after_swapgs+0x109/0x170
[  270.601404]  [] ? system_call_after_swapgs+0x102/0x170
[  270.601404]  [] ? system_call_after_swapgs+0xfb/0x170
[  270.601404]  [] ? system_call_after_swapgs+0xf4/0x170
[  270.601404]  [] ? system_call_after_swapgs+0xed/0x170
[  270.601404]  [] ? system_call_after_swapgs+0xe6/0x170
[  270.601404]  [] SyS_ioctl+0x81/0xa0
[  270.601404]  [] ? system_call_after_swapgs+0xb5/0x170
[  270.601404]  [] system_call_fastpath+0x1a/0x1f
[  270.601404] Code: c0 01 80 38 00 75 f7 48 29 f8 5d c3 31 c0 5d c3 66 66 66 
66 66 2e 0f 1f 84 00 00 00
00 00 55 48 85 f6 48 8d 4e ff 48 89 e5 74 2a <80> 3f 00 74 25 48 89 f8 31 d2 eb 
10 0f 1f 80 00 00 00 00 48 83
[  270.601404] RIP  [] strnlen+0xd/0x40
[  270.601404]  RSP 
[  270.601404] CR2: 1000
[  270.732715] ---[ end trace c22abe83af8ab594 ]---


After:

$  uname -a
Linux bionic 3.13.0-155-generic #205+sf181954.1 SMP Wed Aug 15 17:05:33 -03 
2018 x86_64 x86_64 x86_64 GNU/Linux
$ echo 9 | sudo tee /proc/sys/kernel/printk 
 
9
$ sudo losetup --find --show --partscan rlv_grkgld.1mb  
/dev/loop0

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  e

[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-15 Thread Mauricio Faria de Oliveira
Test on Cosmic:


Before:

$ uname -a
Linux bionic 4.17.0-7-generic #8-Ubuntu SMP Fri Aug 3 15:55:36 UTC 2018 x86_64 
x86_64 x86_64 GNU/Linux
$ echo 9 | sudo tee /proc/sys/kernel/printk
9
$ sudo losetup --find --show --partscan rlv_grkgld.1mb

[   37.274451] partition (null) (20 pp's found) is not contiguous
[   37.278839] partition (null) (8 pp's found) is not contiguous
[   37.281066] partition (null) (8 pp's found) is not contiguous
...
[   37.330987] partition (null) (1 pp's found) is not contiguous
[   37.334092] partition (null) (63 pp's found) is not contiguous
[   37.336130] partition (null) (1 pp's found) is not contiguous

After:

$ uname -a
Linux bionic 4.17.0-7-generic #8+sf181954.1 SMP Wed Aug 15 13:40:48 -03 2018 
x86_64 x86_64 x86_64 GNU/Linux
$ echo 9 | sudo tee /proc/sys/kernel/printk
9
$ sudo losetup --find --show --partscan rlv_grkgld.1mb
/dev/loop0

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-15 Thread Mauricio Faria de Oliveira
Test on Bionic

Before:

$ uname -a
Linux bionic 4.15.0-32-generic #35-Ubuntu SMP Fri Aug 10 17:58:07 UTC 2018 
x86_64 x86_64 x86_64 GNU/Linux
$ echo 9 | sudo tee /proc/sys/kernel/printk
9
$ sudo losetup --find --show --partscan rlv_grkgld.1mb

[   48.192538] partition (null) (5 pp's found) is not contiguous
[   48.195061] partition (null) (3 pp's found) is not contiguous
[   48.198784] partition (null) (2 pp's found) is not contiguous
..
[   48.368026] partition (null) (2 pp's found) is not contiguous
[   48.370136] partition (null) (2 pp's found) is not contiguous
[   48.373262] partition (null) (3 pp's found) is not contiguous

After:

$ uname -a
Linux bionic 4.15.0-32-generic #35+sf181954.1 SMP Wed Aug 15 12:52:37 -03 2018 
x86_64 x86_64 x86_64 GNU/Linux
$ echo 9 | sudo tee /proc/sys/kernel/printk
9
$ sudo losetup --find --show --partscan rlv_grkgld.1mb
/dev/loop0

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1787281

Title:
  errors when scanning partition table of corrupted AIX disk

Status in linux package in Ubuntu:
  Incomplete

Bug description:
  [Impact]

   * Users with disks/LUNs used for AIX operating system installations
 previously, which possibly undergone overwrites/corruption on the
 partition table, might hit kernel failures during partition scan
 of such disk/LUN, and possibly hang the system (seen with retries).

   * The Linux kernel should be robust to corrupted disk data, performing
 a better sanitization/checks and not failing.

   * The fix are a couple of simple logic changes to make the code
 of the AIX partition table parser more robust.

  [Test Case]

   * Run the partition scan on the (trimmed) disk image of the AIX lun.
 (It's not provided here since it contains customer data), with this
 command:

 $ sudo losetup --find --show --partscan rlv_grkgld.1mb

   * On failure, the command hangs, and messages like these are printed
 to the console, depending on the kernel version (see tests below)

 [  270.506420] partition (null) (3 pp's found) is not contiguous

 [  270.597428] BUG: unable to handle kernel paging request at 
1000
 [  270.599525] IP: [] strnlen+0xd/0x40

   * On success, the command prints a loop device name, for example:

 /dev/loop0
   
  [Regression Potential] 

   * Low. Both changes are simple improvements in logic.

   * This affects users which mount disks/LUNs from the AIX OS;
 it should only change behavior for users which relied on a
 uninitialized variables to work correctly during partition
 scan of those disks/LUNs which should be rare as the code
 is likely to fail as we observe in this scenario.
 
   * This has been tested on Cosmic, Bionic, Xenial, and Trusty.

  
  [Other Info]

   * Patches will be sent to the kernel-team mailing list.

  Bug Description:
  ---

  We've recently received a disk image from an AIX LUN that when
  attached on Linux displayed errors on console, then eventually
  hung the system (specially if the SCSI bus was re-scanned, and
  leading to another partition scan).

  Apparently the LUN was originally installed with AIX and later
  exercised with some I/O stress/overwrites which caused certain
  bits to be wrong in just the right way for Linux to get a NULL
  pointer and invalid data.

  This is the test-case used ('--partscan' is the important bit).
$ sudo losetup --show --find --partscan aix-lun.img

  Since the original code is old, it affects several releases.
  It's interesting to fix this on 14.04 and up, on which IBM
  Power servers were initially supported (since they can run
  AIX too, and possibly hit this due to an already used disk/LUN).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787281/+subscriptions

-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp


[Kernel-packages] [Bug 1787281] Re: errors when scanning partition table of corrupted AIX disk

2018-08-15 Thread Mauricio Faria de Oliveira
Test on Xenial

Before:

Last login: Wed Aug 15 19:29:38 2018 from 192.168.122.1
$ uname -a
Linux bionic 4.4.0-133-generic #159-Ubuntu SMP Fri Aug 10 07:31:43 UTC 2018 
x86_64 x86_64 x86_64 GNU/Linux
$ echo 9 | sudo tee /proc/sys/kernel/printk 
 
9   
 
$ sudo losetup --find --show --partscan rlv_grkgld.1mb 

[  309.251832] partition (null) (21 pp's found) is not contiguous
[  309.254557] partition (null) (2 pp's found) is not contiguous  
[  309.256893] partition (null) (1 pp's found) is not contiguous
...
[  309.335870] partition (null) (2 pp's found) is not contiguous
[  309.338133] partition (null) (64 pp's found) is not contiguous
[  309.339719] partition (null) (1 pp's found) is not contiguous
[  309.345218] BUG: unable to handle kernel paging request at 1040
[  309.347776] IP: [] strnlen+0x9/0x40
[  309.349813] PGD 0 
[  309.350719] Oops:  [#1] SMP  
[  309.351987] Modules linked in: isofs kvm_intel kvm irqbypass input_leds 
joydev serio_raw sch_fq_codel
ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp 
libiscsi scsi_transport_i
scsi ip_tables x_tables autofs4 btrfs raid10 raid456 async_raid6_recov 
async_memcpy async_pq async_xor as
ync_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear psmouse floppy
[  309.367248] CPU: 1 PID: 979 Comm: losetup Not tainted 4.4.0-133-generic 
#159-Ubuntu
[  309.369461] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 
1.10.2-1ubuntu1 04/01/2014
[  309.372403] task: 880039a78000 ti: 88003b2f task.ti: 
88003b2f
[  309.375267] RIP: 0010:[]  [] 
strnlen+0x9/0x40
[  309.377806] RSP: 0018:88003b2f3830  EFLAGS: 00010086 
[  309.379278] RAX: 81cd2b01 RBX: 8211eb6c RCX: 
[  309.381360] RDX: 1040 RSI:  RDI: 1040
[  309.384074] RBP: 88003b2f3830 R08:  R09: 
[  309.385894] R10: 88003b2f39e8 R11: 0251 R12: 1040
[  309.387906] R13: 8211ef40 R14:  R15: 
[  309.389937] FS:  7f7e0163c740() GS:88003fc8() 
knlGS:
[  309.392496] CS:  0010 DS:  ES:  CR0: 80050033
[  309.394040] CR2: 1040 CR3: 39a48000 CR4: 0670
[  309.396505] Stack:   
[  309.397306]  88003b2f3868 8140d8cb 8211eb6c 
8211ef40
[  309.400370]  88003b2f3980 81d00816 81d00816 
88003b2f38d8
[  309.403210]  8140f385 0092 0092 
000c
[  309.406231] Call Trace:   
[  309.407114]  [] string.isra.4+0x3b/0xd0
  
[  309.408818]  [] vsnprintf+0x135/0x530  
 
[  309.410869]  [] vscnprintf+0xd/0x30

[  309.412283]  [] vprintk_emit+0x123/0x520   

[  309.413983]  [] vprintk_default+0x29/0x40  
[  309.415495]  [] printk+0x5a/0x76   
[  309.417463]  [] aix_partition+0x5d1/0x600  
[  309.419595]  [] ? pagecache_get_page+0x2d/0x1c0
[  309.421218]  [] msdos_partition+0x86b/0x880
[  309.423173]  [] ? blkdev_readpages+0x20/0x20   
[  309.424963]  [] ? read_cache_page+0x19/0x20
   
[  309.426515]  [] ? read_dev_sector+0x2d/0x90
[  309.428280]  [] ? snprintf+0x49/0x60   
[  309.429707]  [] ? parse_solaris_x86+0x210/0x210
[  309.432081]  [] check_partition+0x140/0x220
   
[  309.433845]  [] rescan_partitions+0xbb/0x2b0   
   
[  309.435421]  [] ? security_capable+0x4e/0x70   
   
[  309.437006]  [] __blkdev_reread_part+0x65/0x70
[  309.438598]  [] blkdev_reread_part+0x23/0x40
[  309.440173]  [] loop_reread_partitions+0x28/0x50
[  309.442811]  [] loop_set_status+0x39a/0x3d0
[  309.444381]  [] loop_set_status64+0x50/0x70
[  309.446146]  [] lo_ioctl+0xf1/0x8b0
[  309.447547]  [] blkdev_ioctl+0x25e/0x910
[  309.449056]  [] ? __fd_install+0x33/0xe0   
[  309.451395]  [] block_ioctl+0x3d/0x50  
[  309.453275]  [] do_vfs_ioctl+0x2af/0x4b0
[  309.455019]  [] ? putname+0x54/0x60 
[  309.456434]  [] ? do_sys_open+0x1bf/0x2a0  
[  309.457943]  [] SyS_ioctl+0x79/0x90
[  309.459886]  [] entry_SYSCALL_64_fastpath+0x22/0xc1
[  309.462722] Code: 00 00 80 3f 00 55 48 89 e5 74 11 48 89 f8 48 83 c0 01 80 
38 00 75 f7 48 29 f8 5d c3
31 c0 5d c3 66 90 55 48 85 f6 48 89 e5 74 2e <80> 3f 00 74 29 48 8d 57 01 48 8d 
04 37 eb 0d 48 8d 4a 01 8
0 79