Hi, Just a quick update that the PR to add Cloudberry support in MADlib is now complete:
https://github.com/apache/madlib/pull/627 The implementation is ready, and I would really appreciate it if MADlib community members could help review it. Any feedback, suggestions, or approvals would be very welcome. Thanks a lot for your time and support! Best, Dianjin Wang On Tue, Oct 21, 2025 at 1:03 AM Ed Espino <[email protected]> wrote: > > Hi Dianjin, Max, and all, > > I believe bringing the Apache Cloudberry (Incubating) MADlib work into the > open by contributing directly to Apache MADlib is an excellent step toward > reviving the project. The project would very much welcome Pull Requests > introducing Cloudberry support in MADlib. Tentatively, we can plan for this > work to be included in an upcoming *Apache MADlib 2.2.0* release. > > I recommend basing the work on the *madlib2-master* branch ( > https://github.com/apache/madlib/tree/madlib2-master), which introduces > full Python 3 support and reflects the current direction of the MADlib 2 > code base. The older *master* branch ( > https://github.com/apache/madlib/tree/master) retains Python 2 support and > is considered deprecated. > > Starting from the madlib2-master branch would ensure long-term > maintainability and alignment with the modern Python ecosystem. It also > provides a solid foundation for integrating Cloudberry-specific > enhancements while remaining compatible with future Apache MADlib releases. > > Ideally, with this and future contributions, you and others may also want > to consider interest in becoming members of the Apache MADlib PMC to help > guide the project into the future. > > Best regards, > -=e > > -- > Ed Espino > Apache Cloudberry (Incubating) & MADlib > > On Mon, Oct 20, 2025 at 4:50 AM Dianjin Wang <[email protected]> wrote: > > > Hi all, > > > > Happy Monday! > > > > I want to move this discussion forward and share some recent progress > > from our side. > > > > Our team has implemented support for Apache Cloudberry based on the > > latest madlib-master branch in the internal repository. After a > > discussion with Max, we believe that contributing this Cloudberry > > support back to the MADlib upstream would be a better and more > > sustainable approach. > > > > We see this as a great opportunity to help bring new vitality to the > > Apache MADlib project, while also enabling Cloudberry to benefit from > > broader ecosystem integration — a potential win-win for both > > communities. Now interested in taking concrete steps to move this > > forward and contribute our work upstream. > > > > Before proceeding, we’d love to hear the community’s thoughts. > > > > If the community agrees to accept Cloudberry support upstream, would > > submitting a Pull Request be the appropriate way to start? We’d also > > appreciate feedback from Ed and other community members once the PR is > > ready. > > > > Best, > > Dianjin Wang > > > > On Thu, Sep 25, 2025 at 3:53 PM Dianjin Wang <[email protected]> > > wrote: > > > > > > Can count me in. I'm interested in exploring the opportunity to revive > > > the MADlib project as an individual on the marketing and community > > > side. I can provide some ideas. > > > > > > > > > Best, > > > Dianjin Wang > > > > > > On Thu, Sep 25, 2025 at 3:39 PM Ed Espino <[email protected]> wrote: > > > > > > > > Hello Max, > > > > > > > > Thanks for reaching out. The latest Apache MADlib 2 code base is in the > > > > madlib2-master branch. The project team made the decision to use this > > > > branch due to the significant rewrite needed to move from Python 2 to > > > > Python 3. > > > > > > > > I believe your interest may be in bringing the latest MADlib support to > > > > Apache Cloudberry (Incubating). With the Greenplum DB project having > > gone > > > > closed source, there are currently no active downstream users of > > MADlib. If > > > > there is interest in reviving the Apache MADlib community, it would be > > > > great to start that discussion. There may also be opportunities for > > synergy > > > > with the Cloudberry project that we can explore together. > > > > > > > > Additionally, as you may know, I also serve on the Cloudberry PPMC. I > > > > haven’t had the chance to bring these potential collaborative efforts > > to > > > > the wider Cloudberry community consistently, and this may be a good > > > > opportunity to do so. > > > > > > > > I’d be glad to provide some guidance if you’d like to pursue this > > further. > > > > > > > > Best regards, > > > > -=e > > > > > > > > -- > > > > Ed Espino > > > > Apache Cloudberry (Incubating) & MADlib > > > > > > > > > > > > On Thu, Sep 25, 2025 at 12:13 AM Max Yang <[email protected]> > > wrote: > > > > > > > > > Hi there, > > > > > > > > > > Do you know which MADlib branch is used by gp7 > > > > > Is this the branch? > > https://github.com/apache/madlib/commits/madlib2-master > > > > > , > > > > > or is it the master branch? > > > > > I see that the master branch is older than madlib2-master. > > > > > > > > > > Appreciated in advance. > > > > > > > > > > Best regards, Max Yang > > > > > > >
