Andrew Wong created KUDU-2136:
---------------------------------

             Summary: Add a "crashed"/"failed" mode to tablets
                 Key: KUDU-2136
                 URL: https://issues.apache.org/jira/browse/KUDU-2136
             Project: Kudu
          Issue Type: Improvement
            Reporter: Andrew Wong


There are a number of errors that currently crash Kudu tablet servers (e.g. 
disk failure). In the push to keep tablet servers alive in spite of these 
failures, the affected tablets should no longer service any type of request. 
Writes should not proceed, scans should be bounced to another tablet server, 
flushes and compactions should exit early, etc. The tablet should act as though 
it were deleted, with the exception that its data is not yet deleted, for the 
sake of durability in case it is the last remaining replica.

This mode may be need to be entered from a myriad of tablet operations: from 
corruptions in reading a cfile, from disk failures when flushing to disk (while 
locks may be held), etc.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to