szlta opened a new pull request #2750:
URL: https://github.com/apache/hive/pull/2750


   Since HiveIcebergInputformat doesn't implement any LLAP marker interfaces, 
cache affinity is never tried, and so any split containing ORC file parts may 
go to a random LLAP daemon, causing subpar hit ratio later.
   
   So we should:
   
   let HS2 know that cache affinity is required for this inputformat
   prevent Iceberg from grouping separate files together in one combined split 
in case of LLAP execution
   provide proper getPath() result for HiveIcebergSplit, so that 
HostAffinitySplitLocationProvider calculates different hashes for different 
files (right now getPath() returns table location only)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: gitbox-unsubscr...@hive.apache.org
For additional commands, e-mail: gitbox-h...@hive.apache.org

Reply via email to