[ 
https://issues.apache.org/jira/browse/HADOOP-18786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17846044#comment-17846044
 ] 

ASF GitHub Bot commented on HADOOP-18786:
-----------------------------------------

ctubbsii commented on PR #5789:
URL: https://github.com/apache/hadoop/pull/5789#issuecomment-2108580048

   > The main thing I want to be sure is from this build, what gets into the 
distro? only stuff from the maven repo, right? that is: this PR MUST NOT force 
updates in the binaries we ship.
   
   I don't quite understand the question. The premise seems to be that the 
current build is only grabbing stuff from the Maven repo. However, that's not 
true currently, and that's part of the problem. The build currently grabs stuff 
from the archives, and not just from the Maven repo. Those are the URLs that 
this PR changes... to use the ASF CDN instead of the archives. The only change 
that might affect the distro is a couple of tools do not have that version 
available in the CDN anymore, so a version bump was necessary to be able to 
grab it from the CDN instead of from the archives. However, I don't know if 
those affect the binaries in the distro either, or if those are only used as 
unshipped build tools. But even if it does change the binaries in some way, the 
current situation of automatically going to the ASF archives cannot continue... 
it makes offline builds very hard, and the download of things from the archives 
causes frequent builds to trigger automated bans of ASF services, because the 
archives aren't meant to be used this way (for routine builds).




> Hadoop build depends on archives.apache.org
> -------------------------------------------
>
>                 Key: HADOOP-18786
>                 URL: https://issues.apache.org/jira/browse/HADOOP-18786
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: build
>    Affects Versions: 3.3.6
>            Reporter: Christopher Tubbs
>            Priority: Critical
>              Labels: pull-request-available
>
> Several times throughout Hadoop's source, the ASF archive is referenced, 
> including part of the build that downloads Yetus.
> Building a release from source should not require access to the ASF archives, 
> as that contributes to end users being subject to throttling and blocking by 
> INFRA, for "abuse" of the archives, even though they are merely building a 
> current ASF release from source. This is particularly problematic for 
> downstream packagers who must build from Hadoop's source, or for CI/CD 
> situations that depend on Hadoop's source, and particularly problematic for 
> those end users behind a NAT gateway, because even if Hadoop's use of the 
> archive is modest, it adds up for multiple users.
> The build should be modified, so that it does not require access to fixed 
> versions in the archives (or should work with the upstream of those dependent 
> projects to publish their releases elsewhere, for routine consumptions). In 
> the interim, the source could be updated to point to the current dependency 
> versions available on downloads.apache.org.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to