Yes. I think we have agreed that we will make sure things are described
clearly in both the spec and website for
the critical fields added.

We are currently trying to get a webpage out for the Generic Table support
in Polaris.

Best Regards,
Yun

On Wed, Jun 11, 2025 at 3:09 PM Dmitri Bourlatchkov <di...@apache.org>
wrote:

> As for the evolution, I do think it is a good strage to evolve step by
> step, instead of trying to standardize
> everything in one shot.
>
>
> This approach makes sense to me, but we need to be explicit about it in the
> spec.
>
> Cheers,
> Dmitri.
>
> On Wed, Jun 11, 2025 at 5:45 PM yun zou <yunzou.colost...@gmail.com>
> wrote:
>
> > > I mean a doc page similar to [1] that explains what Generic Tables are,
> > how
> > to use them in Spark, how to use them is some other query engine, and
> most
> > importantly the planned evolution for the Generic Tables API and
> > specification.
> >
> > Yes, we can definitely add a webpage to describe the current guarantee of
> > generic table
> > support, and we can mention that it is currently a beta version. I am
> > currently working on this.
> >
> >
> > As for the evolution, I do think it is a good strage to evolve step by
> > step, instead of trying to standardize
> > everything in one shot.
> > As we have mentioned during design discussion, standardization of some
> > fields across different
> > engines and different formats are very challenging, such as schema where
> > different engines support different
> > data types.  So we will need more thoughts when adding those fields, the
> > base location is just one of the easy fields.
> >
> >
> > Best Regards,
> > Yun
> >
> >
> >
> >
> > On Wed, Jun 11, 2025 at 2:17 PM Dmitri Bourlatchkov <di...@apache.org>
> > wrote:
> >
> > > > Can you explain what is a proper plain English spec for this feature?
> > >
> > > I mean a doc page similar to [1] that explains what Generic Tables are,
> > how
> > > to use them in Spark, how to use them is some other query engine, and
> > most
> > > importantly the planned evolution for the Generic Tables API and
> > > specification.
> > >
> > > IMHO, given this discussion thread, we can only offer a "beta" in 1.0.
> > > Meaning the spec and API are subject to change without backward
> > > compatibility guarantees.
> > >
> > > [1] https://polaris.apache.org/in-dev/unreleased/policy/
> > >
> > > Cheers,
> > > Dmitri.
> > >
> > > On Wed, Jun 11, 2025 at 4:46 PM Yufei Gu <flyrain...@gmail.com> wrote:
> > >
> > > > There are solid use cases for adding generic-table support with the
> > Spark
> > > > plugin:
> > > >
> > > >    - Single Catalog, Many Formats – Keep Delta, CSV, Parquet (and
> > future
> > > >    formats) side-by-side in one place instead of juggling separate
> > > > catalogs.
> > > >    - Seamless Migrations – Let teams move data from one format to
> > another
> > > >    without breaking queries or governance workflows.
> > > >
> > > > Happy to brainstorm more improvements and next steps!
> > > >
> > > > Now that  [1543] is merged and adds some concrete specialization to
> > > Generic
> > > > > Tables API, I believe it is even more important to make a proper
> > plain
> > > > > English spec for this feature before 1.0.
> > > >
> > > > We've cut the branch for 1.0 release already, and PR 1543 won't be a
> > part
> > > > of 1.0 release. Can you explain what is a proper plain
> > > > English spec for this feature? I am glad to review it if you propose
> > one.
> > > >
> > > >
> > > > Yufei
> > > >
> > > >
> > > > On Wed, Jun 11, 2025 at 11:53 AM Dmitri Bourlatchkov <
> di...@apache.org
> > >
> > > > wrote:
> > > >
> > > > > Thanks, Laurent, for bringing up spec "readiness" and, I guess, by
> > > > > extension backward compatibility concerns.
> > > > >
> > > > > Regardless of how deep current spec is in Polaris, I believe it is
> > > > > important to have it written down as an artifact in the Polaris
> > repo. I
> > > > > know we had a design doc at some point, but the project is defined
> by
> > > > what
> > > > > is in the repository, plus discussion docs can quickly get out of
> > sync
> > > > with
> > > > > actual code. I believe I raised this point before.
> > > > >
> > > > > The API change merged under [1543] is not sufficient to inform
> users
> > of
> > > > > Polaris about the Generic Tables feature. I tend to regard comments
> > in
> > > > Open
> > > > > API yaml files as similar to javadoc. They are good for developers
> > > > working
> > > > > with that specific aspect of the system, but do not provide a
> > holistic
> > > > > view.
> > > > >
> > > > > Now that  [1543] is merged and adds some concrete specialization to
> > > > Generic
> > > > > Tables API, I believe it is even more important to make a proper
> > plain
> > > > > English spec for this feature before 1.0.
> > > > >
> > > > > [1543] https://github.com/apache/polaris/pull/1543
> > > > >
> > > > > Cheers,
> > > > > Dmitri.
> > > > >
> > > > > On Wed, Jun 11, 2025 at 10:56 AM Laurent Goujon
> > > > <laur...@dremio.com.invalid
> > > > > >
> > > > > wrote:
> > > > >
> > > > > > What I was trying to say is that i'm sure there's plenty of value
> > for
> > > > > > spark, but in it's current state the value is little from a
> Polaris
> > > > point
> > > > > > of view as an open catalog service?
> > > > > >
> > > > > > Of course we can follow-up on that but is the current spec still
> > > > > considered
> > > > > > wip or when 1.0 will be released, we would have to keep
> supporting
> > it
> > > > > even
> > > > > > if we come up with something more comprehensive?
> > > > > >
> > > > > > On Wed, Jun 11, 2025, 00:22 Eric Maynard <
> eric.w.mayn...@gmail.com
> > >
> > > > > wrote:
> > > > > >
> > > > > > > > I don't think there's a lot of value where the specification
> > of a
> > > > > table
> > > > > > > format is left to the client
> > > > > > > Considering that you currently can use non-Iceberg tables in
> > > Polaris
> > > > > with
> > > > > > > the Spark client and it works end-to-end, I'd have a hard time
> > > > agreeing
> > > > > > > that there is no value.
> > > > > > >
> > > > > > > But I think this discussion is maybe best moved to another
> > thread.
> > > > The
> > > > > > > incremental change to add a location may make sense for the
> > > existing
> > > > > > > generic table implementation, even if later we reach a
> consensus
> > to
> > > > rip
> > > > > > it
> > > > > > > out and replace it with something more "comprehensive".
> > > > > > >
> > > > > > > --EM
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Reply via email to