[I] Add backpressure option for bulk import [accumulo]

via GitHub Wed, 30 Oct 2024 08:46:21 -0700


ddanielr opened a new issue, #5023:
URL: https://github.com/apache/accumulo/issues/5023


   **Is your feature request related to a problem? Please describe.**
   Bulk import will continue to push files into tablets to a point where user 
scan performance can degrade. 
   A user can monitor the number of queued compactions before submitting new 
bulk import operations. 
   However, this approach blocks all bulk import operations even if the data 
was going to tablets which would not degrade scan performance. 
   
   **Describe the solution you'd like**
   A bulk import limit threshold property based off of the `table.max.file` 
property value should be added. 
   This would allow bulk import to continue importing into tablets that are not 
exceeding this new property value and either wait indefinitely or block the 
fate on those specific tablets. 
   
   The bulk import operation should then export the tablets information in 
order for the user or  a separate process can take that input and schedule 
higher priority compaction jobs to unblock the bulk import operation. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] Add backpressure option for bulk import [accumulo]

Reply via email to