avijayanhwx opened a new pull request #540: Hdds 2914 master
URL: https://github.com/apache/hadoop-ozone/pull/540
 
 
   ## What changes were proposed in this pull request?
   While running the TPCDS hive benchmark, certain queries are failing with the 
following error.
   
   > Caused by: java.io.IOException: File 
o3fs://hive.warehouse.fqdn:9862/warehouse/tablespace/managed/hive/100/inventory/delta_0000001_0000001_0000/bucket_00000
 should have had overlap on block starting at 0
   
   The problem was the wrong usage of the offset field while populating Ozone's 
block location class. The offset field in BlockLocation refers to the block 
offset in an OzoneKey and not the byte offset inside the datanode file. In our 
case, the latter is always 0, and when we use 0 in the Block Locations, the 
hive ORC file splitter overwrites the blocks at '0' when there are more than 1 
blocks. 
   
   ## What is the link to the Apache JIRA
   https://issues.apache.org/jira/browse/HDDS-2914
   
   ## How was this patch tested?
   Manually tested by running the failed TPCDS queries again.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: ozone-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: ozone-issues-h...@hadoop.apache.org

Reply via email to