Hi!

This is a discussion about altering the way we handle dependencies and
shading in Flink.
I ran into quite a view problems trying to adjust / fix some shading issues
during release validation.

The issue is tracked under: https://issues.apache.org/jira/browse/FLINK-6529
Bring this discussion thread up because it is a bigger issue

*Problem*

Currently, Flink shades dependencies like ASM and Guava into all jars of
projects that reference it and relocate the classes.

There are some drawbacks to that approach, let's discuss them at the
example of ASM:

  - The ASM classes are for example in flink-core, flink-java, flink-scala,
flink-runtime, etc.

  - Users that reference these dependencies have the classes multiple times
in the classpath. That is unclean (works, through, because the classes are
identical). The same happens when building the final dist. jar.

  - Some of these dependencies require to include license files in the
shaded jar. It is hard to impossible to build a good automatic solution for
that, partly due to Maven's very poor cross-project path support

  - Most importantly: Scala does not support shading really well. Scala
classes have references to classes in more places than just the class names
(apparently for Scala reflect support). Referencing a Scala project with
shaded ASM still requires to add a reference to unshaded ASM (at least as a
compile dependency).

*Proposal*

I propose that we build and deploy a asm-flink-shaded version of ASM and
directly program against the relocated namespaces. Since we never use
classes that we relocate in public interfaces, Flink users will never see
the relocated class names. Internally, it does not hurt to use them.

  - Proper maven dependency management, no hidden (shaded) dependencies

  - One copy of each class for shaded dependencies

  - Proper Scala interoperability

  - Natural License management (license is part of deployed
asm-flink-shaded jar)


Happy to hear thoughts!

Stephan

Reply via email to