OK, let's move forward then

I will prepare https://github.com/apache/polaris-catalogs-migrator
repository as a transition phase.

> I also support inviting Ajantha as a committer.
That will be a separate discussion with PPMC.

> Still really prefer a separate repository, at least for now. We can always 
> merge later. It also makes the "migration to ASF" easier.
>
> The "Nessie Iceberg-catalog-migrator tool" has been built to support 
> migrations from any catalog to any other catalog. This means, that the test 
> matrix is quite complex and I expect it to become even more complex and time 
> consuming. IMHO there's no need to "bother" Polaris "main CI" for every PR 
> with catalog-migrator tests. I would also expect a different release cycle - 
> no need to tie both together.

Understood, let's use a specific repo for now.

> Better use org.apache.polaris.catalogmigrator as the base group ID.

Yes, that was the intent indeed.

> This requires https://github.com/apache/polaris/pull/785. I'd also prefer to 
> keep the ITs against Nessie and add Polaris.

The itests will probably evolve a lot depending of the new use of
catalogs-migrator.
The itests bring a bunch of dependencies (Nessie, Hive, ...). As we
are talking about test dependencies, that's OK.

>
> We don't know yet how these (and other features) will look like and not how 
> it'll be related to the donated tool.
>

I'm very enthusiastic about catalogs-migrator: I see a lot of potential :)
For instance, the first "obvious" move is probably catalogs-migrator
will evolve from a "standalone tool" to a library that we can use in
Polaris server (federated catalog, etc).

>
> Please coordinate the migration from the source repo with me. We need to 
> clean some things up on the projectnessie Github org side first.
>

Ack, I will ping you :)

For the rest of the community, no objection to starting with
https://github.com/apache/polaris-catalogs-migrator ?

Regards
JB

>
> Robert
>
> On 20.02.25 09:57, Jean-Baptiste Onofré wrote:
> > Hi Dmitri
> >
> > About "evolution plan", I see the catalog migrator tool evolving as a
> > set of beans/providers that will be used in both CLI, and some server
> > features (like federated catalogs or "foreign catalogs").
> > We should not focus too much on catalog migrator as it is today but
> > more how it will be tomorrow.
> >
> > That's why I'm more in favor of preparing the field and donating as a
> > module in the Polaris repo.
> >
> > Regards
> > JB
> >
> > On Thu, Feb 20, 2025 at 6:38 AM Dmitri Bourlatchkov <di...@apache.org> 
> > wrote:
> >> +1 to accept the catalog migrator tool.
> >>
> >> I support inviting Ajantha as a committer.
> >>
> >> As to the source location, I tend to think that a separate repo makes sense
> >> with the current state of the code, but I also agree that the overhead of
> >> that may be too much, given that the codebase is small. I'm fine with
> >> either a separate repo or a new module in the current Polaris repo.
> >>
> >> What is the general plan for the evolution of the migrator tool? Are we
> >> talking about integrating it into Polaris Servers or will it remain a
> >> standalone tool as it is now?
> >>
> >> Thanks,
> >> Dmitri.
> >>
> >> On Wed, Feb 19, 2025 at 11:39 AM Jean-Baptiste Onofré <j...@nanthrax.net>
> >> wrote:
> >>
> >>> Hi folks,
> >>>
> >>> Let me try to sum-up this topic.
> >>>
> >>> 1. Catalog Migration landing
> >>> It seems we have a preference to land catalog-migrator as a module on
> >>> the main polaris repo.
> >>> Robert expressed comments about CI, release cycle, dependencies.
> >>>
> >>> My view on that is that the purpose of the catalog-migrator is to
> >>> evolve, and could become a key component for features like federated
> >>> catalogs.
> >>> Due to that, I think we can consider catalog-migrator as a module,
> >>> integrated in the Polaris CLI, or in the Polaris server,
> >>>
> >>> Robert, does it work for you ?
> >>>
> >>> 2. Code/PR prep
> >>> I propose to work directly with Ajantha (main contributor of the
> >>> catalog-migrator) to prepare the code heading to a PR. We need:
> >>> - integrate in Polaris repo and gradle
> >>> - rename all packages to use org.apache.polaris
> >>> - add ASF header in all files
> >>> - refactore cli to use polaris style/naming
> >>> - refactore intTest to use Polaris instead of Nessie
> >>> - check the dependencies in the cli uber jar (hadoop, hive, ...) and
> >>> cleanup LICENSE/NOTICE there
> >>> - update README and cleanup other files
> >>> It should be pretty fast and we should be able to create a PR for
> >>> review/donation.
> >>>
> >>> Thoughts ?
> >>>
> >>> Regards
> >>> JB
> >>>
> >>> On Tue, Feb 11, 2025 at 7:45 PM Jean-Baptiste Onofré <j...@nanthrax.net>
> >>> wrote:
> >>>> Hi folks,
> >>>>
> >>>> As discussed with some of you, we would like to propose donating the
> >>>> Nessie Iceberg Catalog migrator tool
> >>>> (https://github.com/projectnessie/iceberg-catalog-migrator) to Apache
> >>>> Polaris.
> >>>>
> >>>> A SGA has been already signed in case we accept the donation.
> >>>>
> >>>> In terms of donation, I propose the following:
> >>>> - the iceberg-catalog-migration can land in a separate Polaris repo
> >>>> (as it is today for Nessie):
> >>>> https://github.com/apache/polaris-catalog-migration or land as a
> >>>> module in polaris repo directly
> >>>> - we need to prepare the donation by changing the package names, etc
> >>>> - I would suggest to consider inviting one of the main contributor of
> >>>> Catalog Migrator (ajantha-bhat) as Polaris committer
> >>>>
> >>>> WDYT ? Do we accept the iceberg-catalog-migrator tool in Polaris (we
> >>>> can do a formal vote if we don't have obvious consensus) ?
> >>>>
> >>>> Regards
> >>>> JB
>
> --
> Robert Stupp
> @snazy
>

Reply via email to