Re: [s390x] New regression was found on kernel-4.16

2018-04-09 Thread Ming Lei
On Mon, Apr 09, 2018 at 06:18:04PM +0800, Li Wang wrote:
> Hi,
> 
> I got this BUG_ON() on s390x platform with kernel-v4.16.0.
> 
> [1.200196] [ cut here ]
> [1.200201] kernel BUG at block/bio.c:1798!
> [1.200228] illegal operation: 0001 ilc:1 [#1] SMP
> [1.200230] Modules linked in: dasd_eckd_mod(+) dasd_mod qeth(+)
> qdio lcs ctcm ccwgroup fsm dm_mirror dm_region_hash dm_log dm_mod
> [1.200236] CPU: 1 PID: 16 Comm: kworker/1:0 Not tainted 4.16.0 #1
> [1.200237] Hardware name: IBM 2827 H43 400 (z/VM 6.4.0)
> [1.200243] Workqueue: events do_kick_device [dasd_mod]
> [1.200245] Krnl PSW : 8cb7d53b 7073145d
> (bio_split+0xcc/0xd0)
> [1.200250]R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3
> CC:2 PM:0 RI: 0 EA:3
> [1.200252] Krnl GPRS:  
> 01cc4400 000 0
> [1.200253]0140 7ccdb400
>  000 8
> [1.200254]01cc4400 03d101df1640
> 1000 000 001cc4400
> [1.200255] 0077f088
> 7ccbb590 000 07ccbb560
> [1.200262] Krnl Code: 004311e8: a7f4ffecbrc
>  15,4311c 0
> [1.200262]004311ec: a7f40001brc
>  15,4311e e
> [1.200262]   #004311f0: a7f40001brc
>  15,4311f 2
> [1.200262]   >004311f4: 0707bcr
>  0,%r7
> [1.200262]004311f6: 0707bcr
>  0,%r7
> [1.200262]004311f8: c004brcl
>  0,4311f8
> [1.200262]004311fe: eb6ff0480024stmg
>  %r6,%r15 ,72(%r15)
> [1.200262]00431204: a7f13f80tmll
>  %r15,162 56
> [1.200275] Call Trace:
> [1.200278] ([<01088020>] 0x1088020)
> [1.200281]  [<00443d54>] blk_queue_split+0x4a4/0x608
> [1.200283]  [<0044ab6c>] blk_mq_make_request+0x7c/0x640
> [1.200286]  [<0043b148>] generic_make_request+0x108/0x2b0
> [1.200288]  [<0043b384>] submit_bio+0x94/0x158
> [1.200290]  [<0034eede>] submit_bh_wbc+0x1b6/0x200
> [1.200292]  [<0034fcc4>] block_read_full_page+0x3d4/0x3f0
> [1.200294]  [<0026bdde>] do_read_cache_page+0x1ae/0x380
> [1.200296]  [<0026bfe0>] read_cache_page+0x30/0x40
> [1.200298]  [<00455490>] read_dev_sector+0x58/0xe8
> [1.200300]  [<00459f4e>] read_lba+0xfe/0x1b0
> [1.200301]  [<0045a5be>] find_valid_gpt+0xe6/0x618
> [1.200303]  [<0045add8>] efi_partition+0x2e8/0x358
> [1.200304]  [<00457720>] check_partition+0x158/0x288
> [1.200306]  [<00455bd8>] rescan_partitions+0xd8/0x3e0
> [1.200307]  [<00450870>] blkdev_reread_part+0x40/0x60
> [1.200312]  [<03ff80165384>] dasd_scan_partitions+0x64/0x140
> [dasd_mod]
> [1.200317]  [<03ff8015fb86>] dasd_change_state+0xb6e/0xc18
> [dasd_mod]
> [1.200322]  [<03ff8015fc78>] do_kick_device+0x48/0x98
> [dasd_mod]
> [1.200325]  [<0015f50e>] process_one_work+0x19e/0x420
> [1.200327]  [<0015f7e2>] worker_thread+0x52/0x458
> [1.200328]  [<00165bb4>] kthread+0x134/0x168
> [1.200333]  [<00716722>] kernel_thread_starter+0x6/0xc
> [1.200335]  [<0071671c>] kernel_thread_starter+0x0/0xc
> [1.200335] Last Breaking-Event-Address:
> [1.200337]  [<004311f0>] bio_split+0xc8/0xd0
> [1.200337]
> [1.200338] Kernel panic - not syncing: Fatal exception:
> panic_on_oops

Turns out one fundamental issue, please test: 

e9092d0d9796 ("Fix subtle macro variable shadowing in min_not_zero()")

and see discussion in the thread:

https://marc.info/?t=15232891724&r=1&w=2

-- 
Ming


Re: [s390x] New regression was found on kernel-4.16

2018-04-09 Thread Ming Lei
Hi Li Wang,

On Mon, Apr 09, 2018 at 06:18:04PM +0800, Li Wang wrote:
> Hi,
> 
> I got this BUG_ON() on s390x platform with kernel-v4.16.0.

Today I saw this bug too, from my first look, seems it is because
that get_max_io_size() returns zero in blk_bio_segment_split().

And I trigger that in one MD queue, and Xiao Ni is looking at it.

Thanks,
Ming