Github user HyukjinKwon commented on the issue:
https://github.com/apache/spark/pull/23014
> The reason is that each bucket file is too big
Can you elaborate please? Is it because we don't chunk each file into
multiple splits when we read bucketed table?--- --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
