>
> ... but I'm not sure that running different code with --jar and --packages
> is a good idea, even if the differences are only in references to shaded
> classes.

If one works without shading, why can the other not work without shading?


I agreed that it should work for both consistently. That's what PR 1908 is
trying to resolve. Before PR 1908, `--packages` will pull the lib
iceberg-core and its transitive dependencies, which takes a different path
then the `--jar` option, as with the `--jar` option, the Polaris Spark
client only needs the Iceberg-spark-runtime dependency. With PR 1908, we
removed the dependency of iceberg-core, and only depends on
Iceberg-spark-runtime[1], which is consistent with the `--jar` option.

1. https://github.com/apache/polaris/pull/1908/files#r2159414955

Yufei


On Fri, Jun 20, 2025 at 10:08 AM Dmitri Bourlatchkov <di...@apache.org>
wrote:

> I definitely agree that we should resolve this issue for 1.0.
>
> ... but I'm not sure that running different code with --jar and --packages
> is a good idea, even if the differences are only in references to shaded
> classes.
>
> If one works without shading, why can the other not work without shading?
>
> Thanks,
> Dmitri.
>
> On Fri, Jun 20, 2025 at 12:52 PM yun zou <yunzou.colost...@gmail.com>
> wrote:
>
> > As for the following point
> > I believe that regardless of the method of including the Client into
> Spark
> > runtime, the code has to be exactly the same.... and I doubt it is the
> same
> > now. WDYT?
> >
> > The code included in the jar for Spark Client is different now with the
> > change, because it
> > now uses a class in a different package, even though they do the same
> > thing. However,
> > I think it is a good change, it simplifies our dependency and avoids
> > potential compatibility issue
> > due to the shading of iceberg-spark-runtime. I definitely agree we should
> > also include this also in 1.0.
> >
> > Best Regards,
> > Yun
> >
> > On Fri, Jun 20, 2025 at 9:47 AM yun zou <yunzou.colost...@gmail.com>
> > wrote:
> >
> > >
> > > *-- What is the maven artifact that Spark can automatically pull
> > > (via--packages)*
> > >
> > > Our spark client pulls the following:
> > >
> > > org.apache.polaris#polaris-spark-3.5_2.12
> > >
> > > org.apache.polaris#polaris-core
> > >
> > > org.apache.polaris#polaris-api-management-model
> > >
> > > org.apache.iceberg#iceberg-spark-runtime-3.5_2.12
> > >
> > >
> > > Prior to the change, it also pulled iceberg-core and avro 1.20.0.
> > >
> > >
> > > *-- Does that artifact use shaded dependencies*
> > >
> > > Any usage of classes from iceberg-spark-runtime uses the shaded
> libraries
> > > shipped along with the artifacts.
> > >
> > >
> > >
> > > *-- Does that artifact depend on the Iceberg Spark bundle?*
> > >
> > > If you are referring to our spark client, it depends on
> > iceberg-spark-runtime,
> > > not other bundles.
> > >
> > >
> > >
> > > *-- Is the _code_ running in Spark the same when the Polaris Spark
> Client
> > > ispulled via --packages and via --jars?*
> > >
> > >
> > > yes, the jar and package will use the same code, where the jar simply
> > > packs everything
> > >
> > > for the user and there is no need to download any other dependency.
> > >
> > >
> > > Best Regards,
> > >
> > > Yun
> > >
> > >
> > >
> > > On Fri, Jun 20, 2025 at 9:18 AM Dmitri Bourlatchkov <di...@apache.org>
> > > wrote:
> > >
> > >> Some questions for clarification:
> > >>
> > >> * What is the maven artifact that Spark can automatically pull (via
> > >> --packages)?
> > >> * Does that artifact use shaded dependencies?
> > >> * Does that artifact depend on the Iceberg Spark bundle?
> > >> * Is the _code_ running in Spark the same when the Polaris Spark
> Client
> > is
> > >> pulled via --packages and via --jars?
> > >>
> > >> I know I could have figured that out from code, but I'm asking here
> > >> because
> > >> I think we may need to review our approach to publishing these
> > artifacts.
> > >>
> > >> I believe that regardless of the method of including the Client into
> > Spark
> > >> runtime, the code has to be exactly the same.... and I doubt it is the
> > >> same
> > >> now. WDYT?
> > >>
> > >> Thanks,
> > >> Dmitri.
> > >>
> > >>
> > >> On Fri, Jun 20, 2025 at 10:15 AM Dmitri Bourlatchkov <
> di...@apache.org>
> > >> wrote:
> > >>
> > >> > Hi All,
> > >> >
> > >> > Re: PR [1908] let's use this thread to clarify the problems we're
> > trying
> > >> > to solve and options for solutions.
> > >> >
> > >> > As for me, it looks like some refactoring in the way the Spark
> Client
> > is
> > >> > built and published may be needed.
> > >> >
> > >> > I think it makes sense to clarify this before 1.0 to avoid changes
> to
> > >> > Maven coordinates right after 1.0
> > >> >
> > >> > [1908] https://github.com/apache/polaris/pull/1908
> > >> >
> > >> > Thanks,
> > >> > Dmitri.
> > >> >
> > >> >
> > >>
> > >
> >
>

Reply via email to