jinchengchenghh opened a new pull request, #10934: URL: https://github.com/apache/incubator-gluten/pull/10934
The lock in WholeStageResultIterator restrict the cpu task produce batch, move the lock to here can let threads produce first batch with 1 GB in advance. Maybe the threads should prepare more data and let the GPU consume, this depends on the GPU operator time. If the number of batch is too much, prefetch the first batch should have less optimization. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
