On 19 January 2016 at 17:02, Mike Belopuhov <[email protected]> wrote:
> On 18 January 2016 at 23:20, Jonathon Sisson <[email protected]> wrote:
>> On Mon, Jan 18, 2016 at 08:30:21PM +0100, Mike Belopuhov wrote:
>>> On Mon, Jan 18, 2016 at 20:25 +0100, Mike Belopuhov wrote:
>>> > On Sun, Jan 17, 2016 at 20:46 -0800, Jonathon Sisson wrote:
>>> > > Hi,
>>> > >
>>> > > First off, thank you for OpenBSD in general, and thank you specifically
>>> > > for the PV drivers on OpenBSD =)  The day of migrating workloads to AWS
>>> > > gets ever closer for me, and I appreciate everything the OpenBSD dev
>>> > > team does.
>>> > >
>>> > > I've found what appears to be a repeatable crash that results in this:
>>> > >
>>> > > panic: xnf0: save vs spell: 214
>>> > >
>>> > > Stopped at      Debugger+0x9:   leave
>>> > >    TID    PID    UID     PRFLAGS     PFLAGS  CPU  COMMAND
>>> > >  14532   9243      0         0x3  0x4000000    1  python2.7
>>> > > * 7215   9243      0         0x3  0x4000000    0  python2.7
>>> > >
>>> > > Debugger() at Debugger+0x9
>>> > > panic() at panic+0xfe
>>> > > xnf_encap() at xnf_encap+0x1a9
>>> > > xnf_start() at xnf_start+0x7f
>>> > > ifq_serialize() at ifq_serialize+0xd9
>>> > > if_enqueue() at if_enqueue+0x71
>>> > > ether_output() at ether_output+0x166
>>> > > ip_output() at ip_output+0x6d3
>>> > > tcp_output() at tcp_output+0x87e
>>> > > tcp_usrreq() at tcp_usrreq+0x3fc
>>> > > sosend() at sosend+0x3d8
>>> > > dofilewritev() at dofilewritev+0x205
>>> > > sys_write() at sys_write+0x89
>>> > > syscall() at syscall+0x368
>>> > > --- syscall (number 4) ---
>>> > > end of kernel
>>> > > end trace frame: 0x9a8c96a2800, count: 1
>>> > > 0x9a91790279a:
>>> > > --db_more--
>>> > >
>>> > > I'm unable to run further commands at the console, as AWS does not
>>> > > provide console.
>>> > >
>>> > > I'm using this test machine to build CURRENT and upload it to an s3
>>> > > bucket that I've been using for STABLE builds.  The python code is
>>> > > the awscli installed via py-pip running on Python 2.7.11.  The precise
>>> > > command is:
>>> > >
>>> > > aws s3 sync /usr/rel/ s3://$AWS_BUCKET_NAME/path/
>>> > >
>>> > > If there is any further testing I can provide, I am more than happy
>>> > > to provide any details you need.
>>> > >
>>> > > -Jonathon
>>> > >
>>> >
>>> > Can you please try the diff below on top of a -current kernel
>>> > (I've pushed some additional Xen fixes just now).
>>> >
>>> > You should be able to copy the kernel into the AWS instance.
>>> >
>>> > My math wasn't correct here and txeof would unload a chain before
>>> > we would've processed all descriptors/fragments.
>>> >
>>>
>>> A slight amendment to the diff (forgot one chunk).
>>>
>>>
>> Of course, 2 minutes after I sent that last email I get this:
>>
>> panic: xnf0: save vs spell: 129
>>
>> Stopped at      Debugger+0x9:   leave
>>    TID    PID    UID     PRFLAGS     PFLAGS  CPU  COMMAND
>> *13456  16606      0         0x3  0x4000000    1  python2.7
>>  10910  10910      0     0x14000      0x210    0  softnet
>>
>> Debugger() at Debugger+0x9
>> panic() at panic+0xfe
>> xnf_encap() at xnf_encap+0x1b7
>> xnf_start() at xnf_start+0x7f
>> ifq_serialize() at ifq_serialize+0xd9
>> if_enqueue() at if_enqueue+0x71
>> ether_output() at ether_output+0x166
>> ip_output() at ip_output+0x6d3
>> tcp_output() at tcp_output+0x87e
>> tcp_usrreq() at tcp_usrreq+0x3fc
>> sosend() at sosend+0x3d8
>> dofilewritev() at dofilewritev+0x205
>> sys_write() at sys_write+0x89
>> syscall() at syscall+0x368
>> --- syscall (number 4) ---
>> end of kernel
>> end trace frame: 0x2f508906d00, count: 1
>> 0x2f4e69f079a:
>> --db_more--
>>
>> This time took considerably longer, as multiple uploads were successful
>> prior to the panic.  This time I had to download install58.iso and then
>> upload it to a test bucket to get it to panic.
>>
>> My apologies for the false confirmation.
>>
>> -Jonathon
>
> That's OK.  Thank you for taking your time to test it.
> I can reproduce the problem with tcpbench as well and
> hopefully will have a solution soon.

OK, this should be fixed in current.  Please update and test :-)

Reply via email to