ctubbsii commented on issue #1791:
URL: https://github.com/apache/accumulo/issues/1791#issuecomment-763108854


   > This tells me that either the assert in this test is making an incorrect 
assumption about the nature of tablets on a suspended server, or the test is 
behaving correctly and catching a valid error. Any thoughts would be helpful.
   
   That is a correct interpretation of the reported error. The intent of the 
suspending feature is to ensure that tablets get reassigned to the same server 
as before, if the outage is brief enough. One would need to investigate to 
determine why they are migrating... but here are a few possible scenarios:
   
   * the suspend time elapsed and the tablets are free to migrate before we are 
able to check (possibly because the tserver took too long to restart or WAL 
recovery on tablet load is causing the suspend time to elapse before we check 
the tablet states)
   * the tserver fully recovered and then participated in subsequent migrations
   * we're splitting tablets and creating new migrations (possibly because 
splits weren't stabilized before killing the tserver)
   * the balancer is misbehaving, and rebalancing when it isn't supposed to
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to