AnandInguva commented on PR #29542: URL: https://github.com/apache/beam/pull/29542#issuecomment-1834199277
I will create a github issue to track >> I wonder if performance and pipeline cost would improve if we can find a way to pass-through columns that do not need to be processed to tft, converting them to bytes if necessary, avoiding the shuffle step. Performance testing MLTransform should help here. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
