Hello Daniel, Thanks for the response! I will try both of these on a test server. It might take a few days but I'll update this thread when I have an update.
Have a great weekend! Brian On Fri, Jan 9, 2015 at 6:32 PM, Daniel Farina <[email protected]> wrote: > On Fri, Jan 9, 2015 at 1:31 PM, <[email protected]> wrote: > > Hello! > > > > First of all, this is my first post to this user group. If I'm in the > wrong > > place please don't hesitate to point me in a different direction. > > You got it right :) > > > Starting around mid-December I've been unable to complete a backup-push. > > After running for an hour or so the server stops responding to network > > requests. The only thing I can do is wait until backup-push finishes and > > then I can ssh back in to the server. > > Maybe it's swamping everything. Try the I/O rate limiting option (see > readme). > > > Once back online I can find the following problems: > > > > dmesg repeats this error: [1107575.808936] xen_netfront: xennet: skb > rides > > the rocket: 19 slots > > Wal-e complains about HTTP 500 when pushing files to S3 (sorry, I don't > have > > a copy of this error handy) > > That's potentially important. Can you make it handy? > > > My server is configured as follows (let me know if more info is helpful): > > > > amazon ec2 i2.4xlarge > > ubuntu 14.04 lts > > postgres 9.3 > > wal-e 7.3 > > database size is ~2.4TB > > > > From what I've been able to find so far there may be a bug in the xennet > > driver that is causing the "rides the rocket" error, see here and here. > > I've tried turning some of the suggested features off with ethtool as > > suggested in the links and it seems to have prevented the "rides to the > > rocket" errors but backup-push still doesn't complete. > > > > I've since used an older backup-push to get another server going for > testing > > and it too has the same problem. > > > > Has anyone else seen this? If so, were you able to resolve it? > > Nope. > > Also, try the current WAL-E master. Compared to 0.7.3, I have > drastically optimized the buffer management. Performance is perhaps > even ten times better, which matters for an instance of your size. > -- You received this message because you are subscribed to the Google Groups "wal-e" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
