ctubbsii edited a comment on issue #1791: URL: https://github.com/apache/accumulo/issues/1791#issuecomment-763108854
> This tells me that either the assert in this test is making an incorrect assumption about the nature of tablets on a suspended server, or the test is behaving correctly and catching a valid error. Any thoughts would be helpful. That is a correct interpretation of the reported error. The intent of the suspending feature is to ensure that tablets get reassigned to the same server as before, if the outage is brief enough. One would need to investigate to determine why they are migrating... but here are a few possible scenarios: * the suspend time elapsed and the tablets are free to migrate before we are able to check (possibly because the tserver took too long to restart or WAL recovery on tablet load is causing the suspend time to elapse before we check the tablet states) * the tserver fully recovered and then participated in subsequent migrations * we're splitting tablets and creating new migrations (possibly because splits weren't stabilized before killing the tserver) * the balancer is misbehaving, and rebalancing when it isn't supposed to * they migrated before we killed the tserver, but we didn't record it correctly in the test ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
