shunping commented on issue #31040: URL: https://github.com/apache/beam/issues/31040#issuecomment-2569771603
Notice that when `content-type=application/gzip` and `content-encoding=gzip`, GCS considers the file is doubly compressed(https://cloud.google.com/storage/docs/transcoding#gzip-gzip), which is against the actual content we store here (aka. a gzipped text file). Therefore, an exception is thrown there. For any `UnicodeDecodeError`, most of them are due to the fact that users specifies "UNCOMPRESSED" on a gzip file. We can let it be this way, or we can give a better and more informative error if "content-type" and/or "content-encoding" does not match the specified "compression type". @kennknowles WDYT? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
