thomasg19930417 commented on issue #8882:
URL: https://github.com/apache/hudi/issues/8882#issuecomment-1578523112

   
![image](https://github.com/apache/hudi/assets/20243868/cef75b74-0a49-448f-9e8e-661423a16783)
   Why is this implemented here? mapreduce.input.fileinputformat.split.maxsize 
(this unit should be bytes) is usually very large. According to this logic, all 
inputs may end up with only one split. I did a test and it is indeed In this 
way, if the default is 256m, there will only be one map task. When I set this 
parameter to 1, n map tasks will start. Is there any limit to this?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to