On Mon, Jan 18, 2016 at 08:30:21PM +0100, Mike Belopuhov wrote:
> On Mon, Jan 18, 2016 at 20:25 +0100, Mike Belopuhov wrote:
> > On Sun, Jan 17, 2016 at 20:46 -0800, Jonathon Sisson wrote:
> > > Hi,
> > > 
> > > First off, thank you for OpenBSD in general, and thank you specifically
> > > for the PV drivers on OpenBSD =)  The day of migrating workloads to AWS
> > > gets ever closer for me, and I appreciate everything the OpenBSD dev
> > > team does.
> > > 
> > > I've found what appears to be a repeatable crash that results in this:
> > > 
> > > panic: xnf0: save vs spell: 214
> > > 
> > > Stopped at      Debugger+0x9:   leave
> > >    TID    PID    UID     PRFLAGS     PFLAGS  CPU  COMMAND
> > >  14532   9243      0         0x3  0x4000000    1  python2.7
> > > * 7215   9243      0         0x3  0x4000000    0  python2.7
> > > 
> > > Debugger() at Debugger+0x9
> > > panic() at panic+0xfe
> > > xnf_encap() at xnf_encap+0x1a9
> > > xnf_start() at xnf_start+0x7f
> > > ifq_serialize() at ifq_serialize+0xd9
> > > if_enqueue() at if_enqueue+0x71
> > > ether_output() at ether_output+0x166
> > > ip_output() at ip_output+0x6d3
> > > tcp_output() at tcp_output+0x87e
> > > tcp_usrreq() at tcp_usrreq+0x3fc
> > > sosend() at sosend+0x3d8
> > > dofilewritev() at dofilewritev+0x205
> > > sys_write() at sys_write+0x89
> > > syscall() at syscall+0x368
> > > --- syscall (number 4) ---
> > > end of kernel
> > > end trace frame: 0x9a8c96a2800, count: 1
> > > 0x9a91790279a:
> > > --db_more--
> > > 
> > > I'm unable to run further commands at the console, as AWS does not
> > > provide console.
> > > 
> > > I'm using this test machine to build CURRENT and upload it to an s3
> > > bucket that I've been using for STABLE builds.  The python code is 
> > > the awscli installed via py-pip running on Python 2.7.11.  The precise 
> > > command is:
> > > 
> > > aws s3 sync /usr/rel/ s3://$AWS_BUCKET_NAME/path/
> > > 
> > > If there is any further testing I can provide, I am more than happy
> > > to provide any details you need.
> > > 
> > > -Jonathon
> > > 
> > 
> > Can you please try the diff below on top of a -current kernel
> > (I've pushed some additional Xen fixes just now).
> > 
> > You should be able to copy the kernel into the AWS instance.
> > 
> > My math wasn't correct here and txeof would unload a chain before
> > we would've processed all descriptors/fragments.
> > 
> 
> A slight amendment to the diff (forgot one chunk).
> 
Updated sources, applied the patch and recompiled kernel.

I can't reproduce now with the same upload commands.

Even with a bit heavier testing I've been unable to get the instance
to panic again.

Thanks!  I really appreciate the PV drivers =)

-Jonathon

Reply via email to