Peter Xu <pet...@redhat.com> writes: > On Thu, Sep 14, 2023 at 12:57:08PM -0300, Fabiano Rosas wrote: >> I managed to reproduce it. It's not the return path error. In hindsight >> that's obvious because that error happens in the 'recovery' test and this >> one in the 'plain' one. Sorry about the noise. > > No worry. It's good to finally identify that. > >> >> This one reproduced with just 4 iterations of preempt/plain. I'll >> investigate.
It seems that we're getting a tcp disconnect (ECONNRESET) on when doing that shutdown() on postcopy_qemufile_src. The one from commit 6621883f93 ("migration: Fix potential race on postcopy_qemufile_src"). I'm trying to determine why that happens when other times it just returns 0 as expected. Could this mean that we're kicking the dest too soon while it is still receiving valid data?