alerman opened a new pull request, #3140:
URL: https://github.com/apache/accumulo/pull/3140

   This MR adds an option to optionally skip trash for Import files only.
   On our clusters, we find that we have to skip trash due to pressure on the 
namenodes and the scale that we deal with.
   
   We have seen a few instances of files going missing. We are still in the 
process of investigating (we thought it would be fixed by #2792 but was not.). 
When this happens because of our use case we can typically track down the data 
that contributed to Import files. However, data that has been compacted by one 
or more tablet servers is more difficult to tack down. By enabling this new 
option, we will have time to catch this issue and recover the data before the 
trash is emptied while we determine the cause of the files going missing.
   
   Intention is to backport this to 1.10 once the approach is agreed upon


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to