andrewglowacki opened a new issue #2331:
URL: https://github.com/apache/accumulo/issues/2331


   **Problem**
   The first step in flushing mutations from the Batch Writer to the Tablet 
Servers involves binning mutations by destination tablet server. If the tablet 
server throws an OutOfMemoryError or any non-Exception based Throwable, the 
error will be thrown on the client side. The error will not be caught and will 
cause the bin thread to die without reporting the fact that an error occurred. 
This is unrecoverable and leaves the batch writer stuck waiting for mutations 
get flushed that never will.
   
   **Affected versions**
   1.10.x, main
   
   **To Reproduce**
   Not easily reproducible.
   One could write a rogue iterator that throws an OutOfMemoryError when the 
metadata table is being scanned.
   
   **Bug Location**
   
https://github.com/apache/accumulo/blob/5f20e38628abd6385a5ee652634f4d748ecd013f/core/src/main/java/org/apache/accumulo/core/clientImpl/TabletServerBatchWriter.java#L706
   
   this:
   `catch (Exception e) {`
   should be changed to:
   `catch (Throwable e) {`
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to