** Tags added: verification-needed
** Tags added: tpp
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
IO's are issued with incorrect Scatter Gather Buffer
Status in
** Description changed:
-
+ [Impact]
We have observed that OS is sending the IO's (SCSI Read/Write) with incorrect
Scatter Gather Buffer address .
i.e
- OS is sending the IO with 64 bit Scatter Gather Buffer address , such
+ OS is sending the IO with 64 bit Scatter Gather Buffer
How to Recreate
1. Install Ubuntu 18.04.01 on a separate storage disk.
2. Create VDs on H840/H740 using 9 physical disks
a. 16 sliced RAID5; 10 GB each
b. 16 sliced RAID1; 10 GB each
c. 16 sliced RAID6; 10 GB each
3. Install JAVA
4. Download and copy vdbench to system
a.
Broadcom submitted the fix upstream:
https://marc.info/?l=linux-scsi=154514161019901=2
https://marc.info/?l=linux-scsi=154514161319908=2
https://marc.info/?l=linux-scsi=154514161619911=2
and the commit IDs of these patches from upstream:
894169db1246 scsi: megaraid_sas: Use 63-bit DMA addressing
63 bit masking will cause double buffering of data ?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
IO's are issued with incorrect Scatter Gather Buffer
Status in linux
The 63bit dma_mask was just a test to understand the adress-wraparound by the
fw. Nothing specific.
PERC didn't go offline. Always ended up with OOM btw.
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
Just would like to understand more about the system behavior when we
change the DMA Mask from 64 bit to 63 bit
do we expect any double buffering ?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
@sasikumarpc -
We observed this scenario where the PERC goes offline:
sgl_ptr->Address = f000
sgl_ptr->Length = 1000
We do not see a roll-over here..
Kernel is indicating to read 0x1000 bytes starting from address
0xf000 i.e till 0x which very well aligns
Yes , It seems that bug is still not fixed
FYI - We also observed page fault in 4.18 kernel
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
IO's are issued with incorrect
With the 4.18.0-8-generic kernel, I see AMD IOMMU IO_PAGE_FAULTs for invalid
addresses
AMD-Vi: Event logged [IO_PAGE_FAULT device=e1:00.0 domain=0x
address=0xfffd14c0 flags=0x0030]
PERC controller is still present btw, but the vdbench stress crashed due
to this page faults.
**
I see the disks going offline during stress in mainline
4.19.0-041900rc7-generic as well.
Although the symptoms look different in 4.15 and 4.18 kernels, I feel
the bug is not fixed; address being accessed is still out-of-range in
this 16GB server.
--
You received this bug notification because
With latest 4.15 LTS kernels, I see the vdbench process crash and the
PERC controller and the related disks go offline..
** Attachment added: "4.15 - vdbench - disks go offline"
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1795453/+attachment/5204195/+files/4.15-stress_fails.txt
--
Yes below 4.18 kernel (which is 4.15) fixes only with increased memory
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
IO's are issued with incorrect Scatter Gather Buffer
@sasikumarpc - Ok. So when you say below v4.18 kernel fixes it, does it
mean with increased memory or is it with the same setup where the issue
was originally seen?
Issue resolved Kernel Version
-
4.18.0-8-generic
--
You received this bug notification because you are
> Also another version (other than default version of Ubuntu 18.04.01)
of 4.15 kernel fixes the issue
Can you share this kernel version as well?
Actually adding additional memory to the system which has
4.15.0-34-generic kernel fixes the issue , not the different version of
4.15 kernel fixes the
apport information
** Tags added: apport-collected uec-images
** Description changed:
We have observed that OS is sending the IO's (SCSI Read/Write) with incorrect
Scatter Gather Buffer address .
i.e
OS is sending the IO with 64 bit Scatter Gather Buffer address , such
that
> Also another version (other than default version of Ubuntu 18.04.01)
of 4.15 kernel fixes the issue
Can you share this kernel version as well?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
Issue reported Kernel Version
-
4.15.0-34-generic
Issue resolved Kernel Version
-
4.18.0-8-generic
Linux user 4.18.0-8-generic #9~18.04.1-Ubuntu SMP Mon Sep 17
12:49:27 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
It looks
sasikumarpc -
Can you share the specific kernel versions that you are using, where the issue
is fixed?
Can you also share the logs of pass & failure cases?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
It will be good if you could provide us the exact commit (Patch) which
fixes the issue
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
IO's are issued with incorrect
Also another version (other than default version of Ubuntu 18.04.01) of
4.15 kernel fixes the issue
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
IO's are issued with
Could you also test the latest upstream 4.15 stable kernel? It is available
from:
http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.15.18/
We should be able to perform a reverse bisect to identify the commits in
4.18 that resolve this bug.
** Changed in: linux (Ubuntu)
Importance: Undecided
Looks like the issue is not seen with 4.18 kernel , please let me know
still Ubuntu requires the log with 4.15 kernel ?
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
we are in the process of collecting logs
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1795453
Title:
IO's are issued with incorrect Scatter Gather Buffer
Status in linux package in
moving to kernel itself rather than the hwe package
** Package changed: linux-signed-hwe (Ubuntu) => linux (Ubuntu)
** Changed in: linux (Ubuntu)
Status: Confirmed => New
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in
25 matches
Mail list logo