FelixYBW commented on issue #5718: URL: https://github.com/apache/incubator-gluten/issues/5718#issuecomment-2254871073
The root cause is large batch size of R2C after parquet scan, when row size is too large like too many columns are scanned, or one column has very large complex datatype. The solution is to decrease the batch size. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
