[GitHub] [accumulo] keith-turner commented on pull request #3336: fixes #473 avoids scanning entire table metadata for bulk import

via GitHub Mon, 01 May 2023 07:10:24 -0700


keith-turner commented on PR #3336:
URL: https://github.com/apache/accumulo/pull/3336#issuecomment-1529753812


   Looked at bulk import V1 in 2.1 branch and I don't think this optimization 
could apply to bulk V1 because there is no good way to know what files are 
imported to what tablets.  In bulk V1 intermediate tservers do all of the work 
of figuring out what files go where and these tservers know something about the 
metadata.  However the manager does not know enough to limit the clean up 
scans.  
   
   We could make this change in 2.1, but it would only benefit bulk import v2.  
Bulk import V1 would continue to scan the entire table for each bulk import 
when doing cleanup.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [accumulo] keith-turner commented on pull request #3336: fixes #473 avoids scanning entire table metadata for bulk import

Reply via email to