keith-turner commented on PR #3336: URL: https://github.com/apache/accumulo/pull/3336#issuecomment-1529753812
Looked at bulk import V1 in 2.1 branch and I don't think this optimization could apply to bulk V1 because there is no good way to know what files are imported to what tablets. In bulk V1 intermediate tservers do all of the work of figuring out what files go where and these tservers know something about the metadata. However the manager does not know enough to limit the clean up scans. We could make this change in 2.1, but it would only benefit bulk import v2. Bulk import V1 would continue to scan the entire table for each bulk import when doing cleanup. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
