Tsz-wo Sze created HADOOP-19206:
-----------------------------------
Summary: Hadoop release contains a 530MB bundle-2.23.19.jar
Key: HADOOP-19206
URL: https://issues.apache.org/jira/browse/HADOOP-19206
Project: Hadoop Common
Issue Type: Improvement
Components: build
Reporter: Tsz-wo Sze
The size of Hadoop binary release (v3.4.0) is 1.7 GB.
{code:java}
hadoop-3.4.0$du -h -d 1
$du -h -d 1 .
2.0M ./bin
260K ./libexec
72K ./include
212K ./sbin
184K ./etc
232K ./licenses-binary
316M ./lib
1.4G ./share
1.7G .
{code}
A large component is bundle-2.23.19.jar, which is [AWS Java SDK ::
Bundle|https://mvnrepository.com/artifact/software.amazon.awssdk/bundle/2.23.19]
{code:java}
hadoop-3.4.0$ls -lh share/hadoop/tools/lib/bundle-2.23.19.jar
-rw-r--r--@ 1 szetszwo staff 530M Mar 4 15:41
share/hadoop/tools/lib/bundle-2.23.19.jar
{code}
We should revisit if such a large jar is really needed to be included in the
release.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]