[
https://issues.apache.org/jira/browse/HADOOP-15739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16612151#comment-16612151
]
Steve Loughran commented on HADOOP-15739:
-----------------------------------------
mvn dependency analysis isn't 100% reliable as it doesn't notice transitive use
or reflection
I prefer to dump the tree to a file and review by hand, which is what I see you
doing too. That one diffs nicely.
{code}
mvn dependency:tree -Dverbose > target/dependencies.txt
{code}
# given junit is pulling in hamcrest-core, don't worry about it. Hamcrest
library gives you more assertions (like on the state of collections)...its
worth knowing what it offers and picking it up if you ever need them.
# jax-b & jsr311: implies something things its doing XML & REST calls. If
nothing is doing that, fine.
# jackson-databind is part of the jackson set, which does need careful
management, so if it is being used, yes, make it explicit.
regarding hadoop-cloud-storage/pom.xml , it's goal is to be the "lean" edition
of all the cloud store artifacts, one which cuts out hadoop-common and expects
apps to have imported it elsewhere (and excluded stuff they don't want).
See: [https://github.com/apache/spark/blob/master/hadoop-cloud/pom.xml#L219]
for an example, noting that we still have to purge stuff.
The goal I have is that
* if you depend on hadoop-azure direct, you get everything needed for wasb and
abfs to work, including hadoop-common, hadoop-auth and other dependencies.
* if you depend on hadoop-cloud-storage, you get everything wasb and abfs to
work, *excluding hadoop-common/hadoop-auth*, and excluding any other cruft
which has got in
* That someone, some time will produce a hadoop-cloud-storage-shaded which also
shades the store stuff, depends on the other shaded hadoop artifacts, and
selectively uses shaded external JARs. (not the AWS-SDK, even though its
brittle, as that's already a 50MB shaded artifact. For the azure SDKs it'd
depend on how big it was and how traumatic updates proved to be)
For now, regarding those maven warnings, I'd recommend working out where those
dependencies were being used, and make decisions about provided/compile/test
based on those.
> ABFS: remove unused maven dependencies and add used undeclared dependencies
> ---------------------------------------------------------------------------
>
> Key: HADOOP-15739
> URL: https://issues.apache.org/jira/browse/HADOOP-15739
> Project: Hadoop Common
> Issue Type: Sub-task
> Components: build, fs/azure
> Affects Versions: HADOOP-15407
> Reporter: Da Zhou
> Assignee: Da Zhou
> Priority: Major
> Attachments: HADOOP-15739-HADOOP-15407-001.patch
>
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]