dongjoon-hyun commented on a change in pull request #33064:
URL: https://github.com/apache/spark/pull/33064#discussion_r658440661
##########
File path: core/src/main/scala/org/apache/spark/deploy/SparkHadoopUtil.scala
##########
@@ -487,6 +490,18 @@ private[spark] object SparkHadoopUtil extends Logging {
if
(conf.getOption("spark.hadoop.fs.s3a.downgrade.syncable.exceptions").isEmpty) {
hadoopConf.set("fs.s3a.downgrade.syncable.exceptions", "true")
}
+ // In Hadoop 3.3.1, AWS region handling with the default "" endpoint only
works
+ // in EC2 deployments or when the AWS CLI is installed.
+ // The workaround is to set the name of the S3 endpoint explicitly,
+ // if not already set. See HADOOP-17771.
+ // This change is harmless on older versions and compatible with
+ // later Hadoop releases
+ if (hadoopConf.get("fs.s3a.endpoint", "").isEmpty &&
+ hadoopConf.get("fs.s3a.endpoint.region") == null) {
Review comment:
Two more spaces.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]