LuciferYang commented on pull request #31294:
URL: https://github.com/apache/spark/pull/31294#issuecomment-765818993


   Yes, ` tryDownloadSpark ` method also try to access the Internet, but it 
doesn't seem necessary to change it, the reasons are as follows:
   
   1.
   
https://github.com/apache/spark/blob/d5d1c84bf44bf64d2c0e118e5c9fa05e2018ddba/sql/hive/src/test/scala/org/apache/spark/sql/hive/HiveExternalCatalogVersionsSuite.scala#L83-L91
   
   will return `None` because internet can't be accessed.
   
   2. `sites` will be `"https://archive.apache.org/dist"; :+ 
PROCESS_TABLES.releaseMirror`  and there is only one more download failure will 
be printed because internet can't be accessed.
   
   ```
   19:50:35.123 WARN 
org.apache.spark.sql.hive.HiveExternalCatalogVersionsSuite: Failed to download 
Spark 3.0.1 from 
https://archive.apache.org/dist/spark/spark-3.0.1/spark-3.0.1-bin-hadoop3.2.tgz:
 Network is unreachable (connect failed)
   
   ```
   
   Then the test will use `PROCESS_TABLES.releaseMirror(SPARK_RELEASE_MIRROR)` 
to download Spark release package and not access internet,  so the access to 
the Internet behavior in `tryDownloadSpark` method will not affect the test 
results.
   
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to