pgaref commented on a change in pull request #1823:
URL: https://github.com/apache/hive/pull/1823#discussion_r567807669



##########
File path: common/src/java/org/apache/hadoop/hive/conf/HiveConf.java
##########
@@ -4509,7 +4509,7 @@ private static void populateLlapDaemonVarsSet(Set<String> 
llapDaemonVarsSetLocal
         "Minimum allocation possible from LLAP buddy allocator. Allocations 
below that are\n" +
         "padded to minimum allocation. For ORC, should generally be the same 
as the expected\n" +
         "compression buffer size, or next lowest power of 2. Must be a power 
of 2."),
-    LLAP_ALLOCATOR_MAX_ALLOC("hive.llap.io.allocator.alloc.max", "16Mb", new 
SizeValidator(),
+    LLAP_ALLOCATOR_MAX_ALLOC("hive.llap.io.allocator.alloc.max", "4Mb", new 
SizeValidator(),

Review comment:
       The issue here is that LLAP_ALLOCATOR_MAX_ALLOC is also used as the ORC 
Writer buffer size (thus the change).
   
   Initial buffer size check was introduced in 
[ORC-238](https://github.com/apache/orc/pull/171/files) even though it was only 
applied when buffer size was enforced from table properties. Later, on ORC-1.6 
this was enforced for the [Writer buffer size in 
general](https://github.com/apache/orc/blob/0128f817b0ab28fa2d0660737234ac966f0f5c50/java/core/src/java/org/apache/orc/impl/WriterImpl.java#L171).
   
   The max bufferSize argument can be up to 2^(3*8 - 1) -- meaning less than 
8Mb and since we enforce the size to be power of 2 the next available is 4Mb.
   
   The main question here is if there is a reason for the upper limit to be < 8 
Mb (cc @prasanthj that might know more here) -- or if we should decouple the 
two configuration (LLAP alloc and ORC Writer buffer size).
   
   I believe the best thing to do for now is open a new Ticket to track this 
(as this will either require more work on LLAP, or a new release on ORC) -- and 
I do not expect this to cause any major issues until then. @mustafaiman what do 
you think?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to