jiayuasu opened a new pull request, #2528:
URL: https://github.com/apache/sedona/pull/2528

   ## Did you read the Contributor Guide?
   
   - Yes, I have read the [Contributor 
Rules](https://sedona.apache.org/latest/community/rule/) and [Contributor 
Development Guide](https://sedona.apache.org/latest/community/develop/)
   
   ## Is this PR related to a ticket?
   
   - Yes, and the PR name follows the format `[GH-XXX] my subject`. Closes 
#2489 
   
   
   ## What changes were proposed in this PR?
   
   This pull request improves the reliability and user experience of 
downloading large dependencies in the Docker installation scripts for Spark, 
Zeppelin, Sedona, and related tools. The main changes introduce a helper 
function to show throttled progress updates during downloads, switch some 
downloads to faster mirrors with checksum verification, and enhance error 
handling and feedback for users.
   
   **Enhanced download feedback and reliability:**
   
   * Added a `download_with_progress` helper function to 
`docker/install-spark.sh` and `docker/install-zeppelin.sh` that downloads files 
in the background and prints progress updates every 5 seconds, including final 
size and error messages. This improves user visibility into long downloads and 
makes troubleshooting easier. 
[[1]](diffhunk://#diff-750c969ccb006f6f5c4553cad3487d236f71976672d55a4c6e34e26c82a691f4R27-R89)
 
[[2]](diffhunk://#diff-4cbc7691181a9715b9e7685b9ad33fc81b0f99bcae7a17a8fa79172c6ccef626L26-R69)
   * Replaced direct `curl` calls with the new progress-enabled function for 
downloading Spark, Zeppelin, Hadoop AWS, and AWS SDK jars, ensuring consistent 
feedback and error handling across installations. 
[[1]](diffhunk://#diff-750c969ccb006f6f5c4553cad3487d236f71976672d55a4c6e34e26c82a691f4R27-R89)
 
[[2]](diffhunk://#diff-4cbc7691181a9715b9e7685b9ad33fc81b0f99bcae7a17a8fa79172c6ccef626L26-R69)
   
   **Performance and integrity improvements:**
   
   * Changed Spark download in `docker/install-spark.sh` to use the Lyra 
Hosting mirror for faster downloads, while verifying the archive's integrity 
using a SHA-512 checksum from the official Apache archive. This ensures both 
speed and security.
   
   **User experience improvements:**
   
   * Added progress bar and error display options to `curl` commands for 
Sedona, gresearch spark extension, and GeoTools jar downloads in 
`docker/install-sedona.sh`, providing clearer feedback during installations.
   
   ## How was this patch tested?
   
   Passed local build
   
   ## Did this PR include necessary documentation updates?
   
   - No, this PR does not affect any public API so no need to change the 
documentation.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to