keith-turner commented on a change in pull request #1614:
URL: https://github.com/apache/accumulo/pull/1614#discussion_r428104619



##########
File path: 
core/src/main/java/org/apache/accumulo/core/clientImpl/bulk/BulkImport.java
##########
@@ -149,6 +150,11 @@ public void load()
       if (mappings.isEmpty())
         throw new IllegalArgumentException("Attempted to import zero files 
from " + srcPath);
 
+      long tabletMaxSize = conf.getCount(Property.MASTER_BULK_MAX_TABLETS);
+      if (tabletMaxSize > 0 && mappings.keySet().size() > tabletMaxSize)

Review comment:
       I just looked at #1559 and saw the following sentence.  I would be in 
favor of preventing a single file from going to too many tablets.  If someone 
has a large number of tablets and each file is going to a few tablets, they are 
probably ok with that like you said.
   
   > It would be nice if we could establish a threshold in the bulk import 
process to abort when encountering a rfile that maps to more than a specified 
number of extents.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to