Re: [PROPOSAL][VOTE] Removing `build-chain`, the custom Jenkins framework, and structuring the codebase in a duo-repo setup for 10.2 onwards

Francisco Javier Tirado Sarti Tue, 18 Feb 2025 02:50:09 -0800

Hi Tiago
Thanks for the proposal, which I think is one that deserves careful
consideration and some discussion before getting voted, but I can
anticipate it is workable from my perspective, so I'm not going to oppose
it as I did with the previous one (which is public I considered not
feasible)
Since the root cause behind all these discussion is the mono/multi repo
question, let me share my thought about it
When Kogito was initially designed ( I was not here, but doing some
historical inference ;))  we were in the era of "Microservice for
everything" (a reflection of Netflix success on using this kind of
architecture) or, in other terms, "Kill the monolith" (a reflection on how
science computer guys tend to adopt fashion in a black-white way)
Although the repo structure of a project does not necessarily resemble the
architecture view, it tends to do so, and Kogito was no exception...
partially.
The runtimes microservice has its own repo (kogito-runtimes), but it was
kind of split (for historical reasons) into one additional repo, containing
drools stuff (which acts as a library for runtimes)
At the same time, we have the kogito-app repo, which depends on runtime
(and hence in drools too)  that  holds several microservices: data-index,
data-audit, job-services... and trusty (which I feel should be removed
eventually). If we were being strict about mapping one microservice to one
repo, every one of them will be in their own repo (with another repo
containing the common libraries shared by all of them)
In a sense, the backend repositories kind of follow the philosophy you are
now proposing: one repo for the core microservice (runtimes), another repo
for the rest of the stuff (apps) and drools (because of historical reason)
with a repo of its own. Over that, there were some additional repos: kie
docs (which is actually divided into at least two repos holding
documentation ;)) and kie examples. And of course, the repo you are more
familiar with, the tools one.
The current situation is good (although not optimal) for a developer
willing to contribute to a particular area. Currently, drools guys just
work on drools (and sometime take a look at runtimes it something gets
broken), runtimes guys like me hardly might  look into drools repo and
eventually contribute is some areas of apps (in may case, data-index), so
the stuff that we currently check out in our local machines is reasonable.
That won't really change if we have a backend repo containing everything (I
guess most of us have checked out drools, runtimes and apps in our local
envs) .  However, I feel we lost the proper separation of concerns which,
in any case, more than in the repo structure depends on us, which are
supposed  to be smart guys aware of the different responsibilities of the
different components, not mixing them in the code (Though, this not always
happened, for example with the Quarkus and springboot stuff)
After this quick historical survey, since it is clear that we are
architecturally tending to a monolith (for example, we are promoting
embedded data index and data index in same db than runtimes), it is logical
that you tend also to coalesce repos (which, again, are already partially
coalesced) into two: backend (which you called Java) and frontend (which
you called tools) and I do not think there are strong arguments to prevent
that move (it won't be my preferred option, I think we should have one repo
for common stuff to all engines and one repo per engine, but this is the
architect inside me talking ;))
 However, I still feel that docs and examples should be kept in separate
repos.
Why keep them separate? Because I feel there might be a lot of guys out
there that will be happy just reading the docs and testing the examples,
without having to check out the backend or frontend in their computers.
Which is true for users is also true for contributors. It is true that most
of the contribution to docs and examples is done by the developers that
actually wrote frontend and backend code, but it is also true that usually
there are (or should be)  "doc specialists" that just work on examples
readmes and in docs.
However, this example/docs thing is a minor concern (it would be an extra
step to the proposal, once you have kie and extras in place, it should not
be that hard to set up an example repo that depends on the other two) .
Anyway, as a summary of this long e-mail, I think having two repos storing
the source code is a consistent approach.
+0




On Tue, Feb 18, 2025 at 12:22 AM Tiago Bento <tiagobe...@apache.org> wrote:

> Hi everyone,
>
> As a final attempt to shift the focus of the Apache KIE community away
> from build systems and codebase structure and to allow us to move
> forward with a good release cadence while fostering innovation in the
> Open-Source Business Automation space, Alex and I wrote what we hope
> to be the last infrastructure-related proposal from us (for quite a
> while, at least).
>
> We sincerely believe that, if approved, this proposal will put Apache
> KIE in a very advantageous place to welcome new contributors,
> encourage current contributors to do their best work yet and multiply
> the reach of our technology to even more users. We hope everyone
> reading this feels heard as we try to incorporate all the feedback we
> gathered since our first day inside Apache.
>
> We opted to go for a [VOTE] directly to avoid risking an endless
> conversation again. While we understand this is not the preferred
> approach for contributing, this subject is still sensitive enough for
> us to have chosen this method, hopefully for the last time.
>
> Tiago and Alex.
>
> #
> # Problem scope
>
> Recently during the many threads that were raised in this mailing
> list, it became clear that there’s a general concern about the
> direction of the codebase structure and a continued feedback against
> the monorepo-ization of the Apache KIE codebase, which seems to be
> stealing our energy and making it impossible to focus on what we all
> do best: features and innovation.
>
> It’s also clear that the current infrastructure hasn’t been helpful.
> The level of complexity necessary to maintain the current repositories
> is holding us back from a more streamlined approach to PR checks and
> CI and, most importantly, to our releases, preventing us from
> maintaining a good cadence.
>
> This proposal brings a compromise to the table, between the current
> poly-repo and a monorepo: a duo-repo, with one repo named `kie`,
> hosting all Java runtime code, and another repo named `kie-extras`,
> hosting everything else (including Docs, Examples, Websites, Container
> images, Editors, Web apps, Quakus Dev UIs, Jenkins release jobs etc).
>
> The duo-repo-ization of the Apache KIE codebase will allow us to
> dramatically simplify the automations we have for PR checks, CI and
> release jobs as well as the operation for daily contributions,
> stopping the dependence we have on SNAPSHOTs, for example. The
> ultimate goal is to replace the complicated `build-chain`-based PR
> checks and the custom Jenkins framework we have for CI/releases.
>
> #
> # What is not in scope
>
>   - Cross-repo PRs; as it leads to systems similar to `build-chain`
> and the custom Jenkins framework. We want to avoid going in that
> direction again.
>   - Sparse-checkouts mechanism for `kie`; Due to the complexity of
> setting up something like this, and the other mechanisms that exist to
> mitigate the increase in size of the repo, this won’t be pursued
> initially.
>   - A complete review of all repositories under `apache/incubator-kie-*`
>
> #
> # The proposal
>
> Changes to repositories:
>   - `drools` is:
>       - Renamed to `kie`, keeping the most-starred repository we have on
> GitHub.
>       - Updated to include `optaplanner`, `kogito-runtimes` and
> `kogito-apps`; preserving Git history.
>           - New top-level modules will be created for each repository,
> making things simpler at the beginning.
>           - Top-level modules prefixed with `drools-` go to a new
> folder called `drools`, and top-level modules prefixed with `kie-` go
> to a new folder called `kie`.
>   - `kie-tools` is:
>       - Renamed to `kie-extras`; reflecting its multi-purposed nature
> of hosting a variety of different package types
>   - `optaplanner`, `kogito-runtimes`, and `kogito-apps` are:
>       - Updated with READMEs/descriptions pointing to the new `kie` repo
>       - Archived by Apache INFRA once `kie` is fully operational
>   - `kogito-pipelines` is:
>       - Updated with a disclaimer on its README saying it’s no longer used.
>       - Archived by Apache INFRA once `kie` is fully operational
>
> Changes to automations:
>   - Jenkins is almost completely reset, except for a few jobs
> necessary for the Release Procedure [1].
>   - Timestamped SNAPSHOTs stop being published every Sunday.
>   - Regular SNAPSHOTs stop being published every day.
>   - `apache.snapshots` repository is cleaned up so that no SNAPSHOTs
> are available anymore.
>   - PRs sent to `kie` will be checked using a new GitHub Actions
> hosted inside it and WON’T build `kie-extras`
>   - PRs sent to `kie-extras` will be checked as they already are, with
> a new step at the beginning that will build (or fetch from a cached
> GitHub Actions artifact) `kie` at a versioned commit SHA.
>   - A new weekly bot is created to update the versioned `kie` SHA at
> `kie-extras` via PR. Contributors to `kie-extras` start their week
> taking a look at it and working together to get that merged ASAP.
>   - A new release job stored in `kie-extras` is created to build and
> publish `kie` on a Release Candidate.
>
> Communication:
>   - Apache KIE’s website and documentation are updated to reflect the
> new duo-repo structure.
>
> ### Impact on operation and contributions
>
>   - Contributors to `optaplanner`, `kogito-runtimes` and `kogito-apps`
> will move to `kie`
>   - PR checks will take the same time (or less, if we’re able to
> capitalize more on parallel builds, given every module is going to be
> part of the same build in Maven’s reactor).
>   - Unified Maven dependency management, less surface to manage
> external dependencies.
>   - Contributions to `kie-extras` that depend on `kie` will continue
> being bi-phased, the same way they are today with the timestamped
> SNAPSHOTs.
>
> ### Notes on SNAPSHOTs
>
>   - According to Apache’s guidelines on unreleased artifacts, we
> should be very conscious about our SNAPSHOT usage in regards to
> encouraging users to consume them or even to depend on them. See
> https://infra.apache.org/release-distribution.html#unreleased
>   - Users will always have access to the latest release and will be
> encouraged to step on the development itself to access the bleeding
> edge (Building 999-SNAPSHOT locally from `kie` @ `main`).
>   - Given the discrepancy between `kie` and `kie-extras`, to make CI
> simpler and not delve into cross-repo PR systems, it would be
> confusing to have some SNAPSHOTs published from `kie-extras` that are
> not aligned with SNAPSHOTs from `kie`.
>   - Given the simplified structure of the codebase, publishing
> SNAPSHOTs would encourage usage that’s either not aligned with Apache
> or with our strategy of not having cross-repo PRs. We can revisit that
> in the future if we find a justification for such. This proposal
> doesn’t want to get in the way of any future work, just establish a
> comprehensive baseline for us to evolve on top of.
>
> #
> # Actionable items (plan)
>
> Phase 1 (New structure)
>   - A new GitHub Actions workflow is created on `drools` for PR checks
> (without building any other repos)
>   - `optaplanner` is incorporated as a top-level folder on `drools`,
> preserving Git history
>   - `optaplanner`’s README is updated, pointing to `drools` as its new
> home. Contributions to it are frozen.
>   - `kogito-runtimes` is incorporated as a top-level folder on
> `drools`, preserving Git history
>   - `kogito-runtimes`’s README is updated, pointing to `drools` as its
> new home. Contributions to it are frozen.
>   - `kogito-apps` is incorporated as a top-level folder on `drools`,
> preserving Git history
>   - `kogito-apps`’s README is updated, pointing to `drools` as its new
> home. Contributions to it are frozen.
>   - `drools` is renamed to `kie` (GitHub redirects will make sure
> users accessing/cloning `drools` automatically go to `kie`)
>   - `kie`’s README is updated to reflect its new top-level packages.
>   - `kie-tools` is renamed to `kie-extras`.
>   - `kie-tools` has its reference to `kie` updated to a versioned
> commit SHA, at the same time as its PR checks and CI are updated to
> build `kie` prior to building itself.
>
> Phase 2 (Cleanup)
>   - `build-chain`-based GitHub Actions are deleted from all repositories.
>   - The custom Jenkins framework jobs are removed from Jenkins
>   - Update Documentation and Websites to reflect new codebase structure
>
> Phase 3 (Release)
>   - A new release job is created on Jenkins (hosted on `kie-extras`)
> to release Maven modules from `kie`.
>   - The Release Procedure for 10.2 is updated to accommodate the new
> codebase structure
>
> Phase 4 (Archive Repos)
>   - kogito-pipelines, optaplanner, kogito-runtimes, and kogito-apps
> are archived by INFRA
>
> #
> # Commitment and rough timeline
>
> Alex and Tiago are fully committed to executing this proposal with a
> start date right after 10.1.0 is fully released (approved by IPMC and
> published to all external registries).
>
> The rough estimate to execute is about 4 weeks; with [HEADS UP]s
> coming to the mailing list whenever a change approaches.
>
> 10.2.0 will be the first release already with the new structure, where
> the new release jobs will be validated.
>
>
> [1]
> https://docs.google.com/document/d/15V-1e62bmmWeOdFUnSJHymPYoeb0TTwp0bhxPuJn4zo
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@kie.apache.org
> For additional commands, e-mail: dev-h...@kie.apache.org
>
>

Re: [PROPOSAL][VOTE] Removing `build-chain`, the custom Jenkins framework, and structuring the codebase in a duo-repo setup for 10.2 onwards

Reply via email to