kevinrr888 opened a new pull request, #4452:
URL: https://github.com/apache/accumulo/pull/4452

   closes #4422
   
   This fixes the error with external compactions reporting invalid (>100%) 
progress when the compactions include bulk imported files. This occurs because 
the estimated number of entries for bulk imported files is 0. Following the 
suggestion of Keith, instead of modifying the bulk import code to compute the 
estimated entries, I modified the Compactor code to compute the estimated 
number of entries on files with 0 entries. This value is computed by opening 
the RFile and summing the number of entries in each of the IndexEntry's which 
overlap with the extent of the external compaction.
   
   Also added a test to ExternalCompactionProgressIT to ensure that this fix 
works as intended. This tests that the Compactor and CompactionCoordinator 
report a valid progress.
   
   I unfortunately was not able to ensure that these changes are also reflected 
in the Accumulo Monitor page. Large compactions would only briefly show up in 
the Running Compactions table, always with a progress of 0. I also could not 
use something like a SlowIterator to slow the progress down (I believe since 
SlowIterator is part of a test package, it isn't available). If anyone is able 
to ensure that these changes are reflected in the Monitor page, or is able to 
provide guidance on how I can better test this against the Monitor page, that 
would be helpful. The environment I used to run an external compaction was 
jshell using uno.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to