kennknowles opened a new issue, #18439: URL: https://github.com/apache/beam/issues/18439
Reading compressed text files using textio.ReadFromText() is about five times slower compared to the corresponding transform in Java SDK. This is specially noticeable when using DirectRunner. We should look into how we can improve efficiency of this so that Python SDK is more comparable to Java SDK in this regard. Imported from Jira [BEAM-2531](https://issues.apache.org/jira/browse/BEAM-2531). Original Jira may contain additional context. Reported by: chamikara. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
