Matt Peterson created ACCUMULO-4657:
---------------------------------------

             Summary: BulkImport Performance Bottleneck
                 Key: ACCUMULO-4657
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-4657
             Project: Accumulo
          Issue Type: Improvement
            Reporter: Matt Peterson
            Priority: Minor


Outputting every "loaded" entry in the table is excessive, especially for 
tables with multiple simultaneous bulk imports and multiple references to the 
same file.  This can cause performance problems.  Even when the log level was 
reduced, there was blocking within log4j.  By doing that check once outside the 
loop and only logging at trace level, bulk import performance improves for such 
usages.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to