alerman opened a new pull request, #3140: URL: https://github.com/apache/accumulo/pull/3140
This MR adds an option to optionally skip trash for Import files only. On our clusters, we find that we have to skip trash due to pressure on the namenodes and the scale that we deal with. We have seen a few instances of files going missing. We are still in the process of investigating (we thought it would be fixed by #2792 but was not.). When this happens because of our use case we can typically track down the data that contributed to Import files. However, data that has been compacted by one or more tablet servers is more difficult to tack down. By enabling this new option, we will have time to catch this issue and recover the data before the trash is emptied while we determine the cause of the files going missing. Intention is to backport this to 1.10 once the approach is agreed upon -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
