No, by design, FATE operations wouldn't be removed on shutdown as it would leave the system in an inconsistent state.

Think back to the common problem we're trying to solve with FATE: what happens to a partial table creation during unexpected shutdown. We the FATE runner comes back up (the master), it needs to be able to rollback to the last known "good" state of the FATE op, and then proceed. The FATE op _should_ just proceed on its own after a restart.

However, it sounds like you might have run into a bug with FATE which is causing this situation to happen. Have you done any more digging into why these operations were "stuck" and timing out? If you are confident that these are orphaned and the system isn't properly managing them, you would have to kill them yourself.

Jeff Kubina wrote:
I had three fates in the accumulo fate queue that had no tasks assigned
to them and could not be deleted or failed. All create/delete/clone etc
commands were timing out. I did a hard shutdown of all services. I had
planned to do "accumulo org.apache.accumulo.server.fate.Admin kill
<txid>" but that command is deprecated and replaced with the fate shell
command. Since the fate shell command failed does a hard shutdown ensure
that all the fates in zookeeper are removed or do I need to delete them
manually?

Reply via email to