Re: [I] [SUPPORT] OOM errors while creating a table using Bulk Insert operation [hudi]

via GitHub Wed, 23 Oct 2024 16:59:02 -0700


dataproblems commented on issue #12116:
URL: https://github.com/apache/hudi/issues/12116#issuecomment-2433045490


   @ad1happy2go - Given that this is creating the table, there is only a single 
commit requested. Both the commit.requested and commit.inflight objects are 0 B 
in size. Since we never get to the .commit file as the job fails before writing 
all of the data. 
   
   The spark job is merely reading from S3 and writing the data back in hudi 
format on our end, there are no operations we perform which would result in the 
dataset being collected on the driver, so I would defer to you on that front - 
usually it's in the mapToPair operation in HoodieJavaRdd file or in the save 
operation as seen in the previous screenshots. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [I] [SUPPORT] OOM errors while creating a table using Bulk Insert operation [hudi]

Reply via email to