We're making progress.
I can see that Viraj has just landed PHOENIX-7601, and Rajeshbabu has
released Omid 1.1.1.
Thank you!

At the moment, the following outstanding issues are on my radar:

https://issues.apache.org/jira/browse/PHOENIX-7191
https://issues.apache.org/jira/browse/PHOENIX-7193

These are bugs in my non-ZK registry implementation, which were found
during HBase 3 work.
I have some PRs up, but they may not be complete. I will push for reviews
once I have the HBase 3 tests passing, and possibly updated them based on
that.

We also have a number of very flakey tests, see:

https://ci-hadoop.apache.org/job/Phoenix/job/Phoenix-mulitbranch/job/master/test_results_analyzer/



On Tue, Jan 30, 2024 at 7:09 AM Istvan Toth <st...@cloudera.com> wrote:

> As Viraj wrote, those are just plans.
> If HBase 3 won't be released by the time the other features are ready,
> then it won't make it into 5.3.
> If other major features are ready by that time, then they will be
> included. (though we are not aware of any now)
>
> As for the new major version, in the past Phoenix didn't have a
> compatibility module system,
> so a new branch was required,  which didn't support older HBases. Also,
> the API changes between HBase 1.x and 2.x were much larger,
> The HBase 2 and 3 API are pretty similar, apart from the removal of
> deprecated 1.x APIs. (and the protobuf/protocol thing, which requires a
> rather ugly hack).
>
> I will start the discussion on how we can add HBase 3 support as soon as I
> have a working POC patch.
>
> We could call 5.3 6.0 instead, after all Phoenix isn't using a strict
> semantic versioning, but then 6.0 would also support HBase 2.
> If we do not come to a consensus on the version name, we can always have a
> vote on it.
>
> I think that the main motivation is that the community wants to maintain
> as few branches as possible.
>
> Istvan
>
> On Mon, Jan 29, 2024 at 10:02 PM Stephen Jiang <syuanjiang...@gmail.com>
> wrote:
>
>> I am not sure how close HBase 3.0 is.  Even if it is only less than one
>> year away, the adoption would be low at the beginning.  I don't think 5.3
>> should wait for that.  And traditionally,  Phoenix would have a major
>> release to support the HBase major release (4.x for HBase 1.x and 5.x for
>> HBase 2.x), in this case, we are talking about Phoenix 6.0 for HBase 3.0.
>>
>> Maybe we should adopt the HBase release model: master branch for next
>> major
>> release (6.0) and branch-5.x branch for next 5.x minor release and
>> branch-5.2 for 5.2 minor release.
>>
>> Thanks
>> Stephen
>>
>>
>>
>> On Thu, Jan 25, 2024 at 11:50 PM Viraj Jasani <vjas...@apache.org> wrote:
>>
>> > Sounds good.
>> >
>> > Planned major changes for 5.3.0:
>> >
>> > 1. JSON support.
>> > 2. HBase 3.0 support.
>> > 3. CDC feature (leveraging uncovered global index framework and JSON
>> > support).
>> >
>> >
>> > On Wed, Jan 24, 2024 at 8:22 PM Kadir Ozdemir
>> > <kozde...@salesforce.com.invalid> wrote:
>> >
>> > > I suggest including another major change for 5.3, Phoenix CDC,
>> > > https://issues.apache.org/jira/browse/PHOENIX-7001. The PR for it
>> will
>> > be
>> > > posted soon.
>> > >
>> > > On Thu, Jan 25, 2024 at 5:35 AM Viraj Jasani <vjas...@apache.org>
>> wrote:
>> > >
>> > > > Thanks Istvan! I agree with your points. It’s really been a while we
>> > are
>> > > > talking about releasing 5.2.0 and yet due to bandwidth issues,
>> unable
>> > to
>> > > do
>> > > > so.
>> > > >
>> > > > I agree to getting these fixes out, cut 5.2 and start the release
>> work
>> > > and
>> > > > in the meantime I also need to prepare 5.1 backport.
>> > > >
>> > > > For 5.3, let's plan HBase 3.0 support and JSON as major changes.
>> > > >
>> > > >
>> > > > On Mon, Jan 22, 2024 at 9:17 PM Istvan Toth <st...@apache.org>
>> wrote:
>> > > >
>> > > > > Hi!
>> > > > >
>> > > > > In my opinion cutting 5.2 now only makes sense IF we DO NOT plan
>> to
>> > > > release
>> > > > > the outstanding big features (like JSON) in 5.2. , otherwise it's
>> > just
>> > > > > extra work to maintain more  branches.
>> > > > >
>> > > > > Having said that, releasing a 5.2 and 5.1.4 with the data
>> integrity
>> > > fixes
>> > > > > real soon, and then releasing 5.3 in a few months with JSON, and
>> any
>> > > > other
>> > > > > outstanding big features
>> > > > > that are close to being finished (and HBase 3.0 support, if it's
>> > ready
>> > > by
>> > > > > then) would not be a bad idea.
>> > > > >
>> > > > > On the CLDR side the only outstanding big feature which could
>> impact
>> > > > > Viraj's integrity work is HBase 3.0 support, and even that is only
>> > > > because
>> > > > > it may require some larger refactors of existing code, not
>> because it
>> > > > would
>> > > > > change the actual behaviour or algorithms.
>> > > > >
>> > > > > Phoenix used to have several minor releases per year, the current
>> > state
>> > > > of
>> > > > > extreme longevity of 5.1 and several big new features being added
>> to
>> > it
>> > > > > (like uncovered indexes) is not ideal.
>> > > > > Releasing 5.2 and 5.3 relatively close together could be a return
>> to
>> > a
>> > > > > quicker cadence for minor releases, which could also help with the
>> > > public
>> > > > > image and adoption of Phoenix.
>> > > > >
>> > > > > We were talking about releasing 5.2 at least a year ago, and I
>> have
>> > > > started
>> > > > > working on that then, but then emergencies have come up at
>> $dayjob,
>> > > and I
>> > > > > could not see that through.
>> > > > > (So I am in part responsible for the lack of minor releases)
>> > > > >
>> > > > > regards
>> > > > > Istvan
>> > > > >
>> > > > >
>> > > > > On Mon, Jan 22, 2024 at 11:35 PM Viraj Jasani <vjas...@apache.org
>> >
>> > > > wrote:
>> > > > >
>> > > > > > Sorry for the late reply.
>> > > > > >
>> > > > > > > Do you think cutting 5.2.0 now makes sense?
>> > > > > >
>> > > > > > No problem with that. I can cut 5.2 branch by the end of this
>> week
>> > or
>> > > > at
>> > > > > > the start of next week.
>> > > > > >
>> > > > > > If there is any very big change or feature ready for merge to
>> > master
>> > > > > branch
>> > > > > > with PR approvals already in place, please do let me know so
>> that I
>> > > can
>> > > > > > help collaborate on how best we can get it merged without
>> impacting
>> > > 5.2
>> > > > > > release if required. My main motivation was for any big change
>> to
>> > go
>> > > > > > through newly introduced tests so that we know that anything
>> > > additional
>> > > > > is
>> > > > > > not broken, and also to prioritize for upcoming 5.2.0 and 5.1.4
>> > > > releases.
>> > > > > > Moreover, there are several PRs getting merged on the master
>> > branch,
>> > > we
>> > > > > can
>> > > > > > continue that as long as they are not very big changes, which
>> might
>> > > > > require
>> > > > > > significant time to understand any correlation with data
>> integrity
>> > > > > issues.
>> > > > > >
>> > > > > > The PR is also ready for review with some additional cases fixed
>> > last
>> > > > > week:
>> > > > > >
>> > > >
>> > >
>> >
>> https://urldefense.com/v3/__https://github.com/apache/phoenix/pull/1736__;!!DCbAVzZNrAf4!D4OVjUp2EWW2BqhGnBxsapDX_AHsibRphIpoFBWfgRsd3dsAikrFLo6PGxdTzGbSXJJ2fJ0j9mcz3asXMXo$
>> > > > > >
>> > > > > > Depending on the review bandwidth, I am hopeful we should be
>> good
>> > to
>> > > > land
>> > > > > > them sooner.
>> > > > > >
>> > > > > >
>> > > > > > On Wed, Jan 17, 2024 at 11:31 AM Rushabh Shah
>> > > > > > <rushabh.s...@salesforce.com.invalid> wrote:
>> > > > > >
>> > > > > > > Thank you Viraj for initiating this thread.
>> > > > > > >
>> > > > > > > > Given the critical nature of these issues, I would like to
>> > > propose
>> > > > > that
>> > > > > > > we
>> > > > > > > treat this as a high priority for the upcoming 5.2.0 release,
>> and
>> > > not
>> > > > > > > include any other feature or big change to master branch
>> until we
>> > > > merge
>> > > > > > > this.
>> > > > > > >
>> > > > > > > Do you think cutting 5.2.0 now makes sense? This will enable
>> > other
>> > > > > > > developers to merge features into master branch (5.3.0) and
>> you
>> > can
>> > > > > take
>> > > > > > > some more time to make sure we cover all the corner cases for
>> the
>> > > > data
>> > > > > > > integrity issues that you uncovered.
>> > > > > > >
>> > > > > > >
>> > > > > > > On Fri, Jan 5, 2024 at 6:38 PM Viraj Jasani <
>> vjas...@apache.org>
>> > > > > wrote:
>> > > > > > >
>> > > > > > > > Sounds good, thanks Rajeshbabu. I will try to get the first
>> PR
>> > > out
>> > > > > next
>> > > > > > > > week and while reviews happen in parallel, will try to get
>> 5.1
>> > PR
>> > > > > soon.
>> > > > > > > >
>> > > > > > > >
>> > > > > > > > On Wed, Jan 3, 2024 at 8:49 PM rajeshb...@apache.org <
>> > > > > > > > chrajeshbab...@gmail.com> wrote:
>> > > > > > > >
>> > > > > > > > > Hi Viraj,
>> > > > > > > > >
>> > > > > > > > > Would be better to include the changes in 5.1.4  as in any
>> > way
>> > > it
>> > > > > > will
>> > > > > > > > take
>> > > > > > > > > at least 3-4 days to complete the omid release.
>> > > > > > > > >
>> > > > > > > > > Thanks,
>> > > > > > > > > Rajeshbabu.
>> > > > > > > > >
>> > > > > > > > >
>> > > > > > > > > On Thu, Jan 4, 2024 at 5:06 AM Viraj Jasani <
>> > > vjas...@apache.org>
>> > > > > > > wrote:
>> > > > > > > > >
>> > > > > > > > > > Thank you Kadir and Geoffrey for your replies!!
>> > > > > > > > > >
>> > > > > > > > > > > How does this affect 5.1.4, which is also listed as a
>> Fix
>> > > > > Version
>> > > > > > > for
>> > > > > > > > > > > PHOENIX-7106?
>> > > > > > > > > >
>> > > > > > > > > > Yes, it also needs to be ported to 5.1. Once the master
>> PR
>> > is
>> > > > up
>> > > > > > for
>> > > > > > > > > final
>> > > > > > > > > > review, I would start working on the backport PR.
>> > > > > > > > > > We just need some more additional testing to ensure old
>> > > client
>> > > > > > (e.g.
>> > > > > > > > > 5.1.3)
>> > > > > > > > > > is compatible with the new server with the changes.
>> > > > > > > > > >
>> > > > > > > > > > Hence, yes it is now a blocker for upcoming 5.1.4 as
>> well
>> > > since
>> > > > > > 5.1.4
>> > > > > > > > RC
>> > > > > > > > > > preparation is still pending (while Omid release is in
>> > > > progress).
>> > > > > > > > > > Otherwise if 5.1.4 was ready for release, I would have
>> > > proposed
>> > > > > > > > immediate
>> > > > > > > > > > 5.1.5 release to include the changes proposed with
>> > > > PHOENIX-7106.
>> > > > > > > > > >
>> > > > > > > > > >
>> > > > > > > > > > On Wed, Jan 3, 2024 at 3:08 PM Geoffrey Jacoby <
>> > > > > gjac...@apache.org
>> > > > > > >
>> > > > > > > > > wrote:
>> > > > > > > > > >
>> > > > > > > > > > > I agree that data integrity issues are a higher
>> priority
>> > > than
>> > > > > > > feature
>> > > > > > > > > > > development, so I also support the decision. The fact
>> > that
>> > > > > > several
>> > > > > > > of
>> > > > > > > > > the
>> > > > > > > > > > > major remaining 5.2 features are currently being
>> > developed
>> > > in
>> > > > > > > > > > long-running
>> > > > > > > > > > > feature branches also helps, as work can continue
>> there
>> > at
>> > > > the
>> > > > > > cost
>> > > > > > > > of
>> > > > > > > > > a
>> > > > > > > > > > > rebase later.
>> > > > > > > > > > >
>> > > > > > > > > > > How does this affect 5.1.4, which is also listed as a
>> Fix
>> > > > > Version
>> > > > > > > for
>> > > > > > > > > > > PHOENIX-7106? From the bug description it also sounds
>> > like
>> > > > > 5.1.3
>> > > > > > > and
>> > > > > > > > > the
>> > > > > > > > > > > forthcoming .4 are affected, since we have server-side
>> > > paging
>> > > > > in
>> > > > > > > 5.1.
>> > > > > > > > > > (Feel
>> > > > > > > > > > > free to move that to a separate thread if you feel it
>> > > should
>> > > > > be a
>> > > > > > > > > > separate
>> > > > > > > > > > > discussion.) Should this be a blocker for releasing
>> > 5.1.4?
>> > > > > > > > > > >
>> > > > > > > > > > > Geoffrey
>> > > > > > > > > > >
>> > > > > > > > > > >
>> > > > > > > > > > > On Wed, Jan 3, 2024 at 5:06 PM Kadir Ozdemir <
>> > > > > > > > > > > ka...@gsuite.cloud.apache.org>
>> > > > > > > > > > > wrote:
>> > > > > > > > > > >
>> > > > > > > > > > > > Being a database, Phoenix has to make sure that the
>> > data
>> > > > > stays
>> > > > > > on
>> > > > > > > > > disk
>> > > > > > > > > > > > intact and its queries return correct data. In this
>> > case,
>> > > > > > Phoenix
>> > > > > > > > > fails
>> > > > > > > > > > > to
>> > > > > > > > > > > > return correct data for some queries if their scans
>> > > > > experience
>> > > > > > > > region
>> > > > > > > > > > > > movement. Now that we know these data integrity
>> issues
>> > > and
>> > > > > how
>> > > > > > to
>> > > > > > > > > > > reproduce
>> > > > > > > > > > > > them, fixing them should be our first priority. So,
>> I
>> > > fully
>> > > > > > > support
>> > > > > > > > > > this
>> > > > > > > > > > > > proposal.
>> > > > > > > > > > > >
>> > > > > > > > > > > > On Wed, Jan 3, 2024 at 10:58 PM Viraj Jasani <
>> > > > > > vjas...@apache.org
>> > > > > > > >
>> > > > > > > > > > wrote:
>> > > > > > > > > > > >
>> > > > > > > > > > > > > Hello,
>> > > > > > > > > > > > >
>> > > > > > > > > > > > > I would like to bring PHOENIX-7106
>> > > > > > > > > > > > > <
>> > > > > > > >
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>> >
>> https://urldefense.com/v3/__https://issues.apache.org/jira/browse/PHOENIX-7106__;!!DCbAVzZNrAf4!FZG5sv55IC1NqItQLY7GKWgUG2Do0gSta01gOiSdd36Dx3XHGtQx4M3c9visVXIt9DctPQzS-orob9vhzrCfVA$
>> > > > > > > > > to everyone's
>> > > > > > > > > > > > > attention here and brief about the data integrity
>> > > issues
>> > > > > that
>> > > > > > > we
>> > > > > > > > > have
>> > > > > > > > > > > in
>> > > > > > > > > > > > > various coprocessors. Majority of the issues are
>> > > related
>> > > > to
>> > > > > > the
>> > > > > > > > > fact
>> > > > > > > > > > > that
>> > > > > > > > > > > > > we do not return valid rowkey for certain
>> queries. If
>> > > any
>> > > > > > > region
>> > > > > > > > > > moves
>> > > > > > > > > > > in
>> > > > > > > > > > > > > the middle of the scan, the HBase client relies on
>> > the
>> > > > last
>> > > > > > > > > returned
>> > > > > > > > > > > > rowkey
>> > > > > > > > > > > > > and accordingly changes the scan boundaries while
>> the
>> > > > > scanner
>> > > > > > > is
>> > > > > > > > > > > getting
>> > > > > > > > > > > > > reset to continue the scan operation. If the
>> region
>> > > does
>> > > > > not
>> > > > > > > > move,
>> > > > > > > > > > scan
>> > > > > > > > > > > > is
>> > > > > > > > > > > > > not expected to return invalid data, however if
>> the
>> > > > region
>> > > > > > > moves
>> > > > > > > > in
>> > > > > > > > > > the
>> > > > > > > > > > > > > middle of ongoing scan operation, scan would
>> return
>> > > > > > > > > invalid/incorrect
>> > > > > > > > > > > > data
>> > > > > > > > > > > > > causing data integrity issues.
>> > > > > > > > > > > > >
>> > > > > > > > > > > > > Given the critical nature of these issues, I would
>> > like
>> > > > to
>> > > > > > > > propose
>> > > > > > > > > > that
>> > > > > > > > > > > > we
>> > > > > > > > > > > > > treat this as a high priority for the upcoming
>> 5.2.0
>> > > > > release,
>> > > > > > > and
>> > > > > > > > > not
>> > > > > > > > > > > > > include any other feature or big change to master
>> > > branch
>> > > > > > until
>> > > > > > > we
>> > > > > > > > > > merge
>> > > > > > > > > > > > > this. The PR is still not ready as additional
>> changes
>> > > are
>> > > > > > still
>> > > > > > > > in
>> > > > > > > > > my
>> > > > > > > > > > > > > local, requiring rebase with the current master.
>> > > > > > > > > > > > >
>> > > > > > > > > > > > > I would get back to this discuss thread as soon as
>> > the
>> > > PR
>> > > > > and
>> > > > > > > the
>> > > > > > > > > doc
>> > > > > > > > > > > are
>> > > > > > > > > > > > > updated with the latest findings so far. The
>> changes
>> > > > > include
>> > > > > > > many
>> > > > > > > > > of
>> > > > > > > > > > > our
>> > > > > > > > > > > > > coproc scanner implementations and hence it would
>> > > require
>> > > > > > > > > significant
>> > > > > > > > > > > > > review as well.
>> > > > > > > > > > > > > It would be great if we can hold on to merging any
>> > > > feature
>> > > > > or
>> > > > > > > big
>> > > > > > > > > > > change
>> > > > > > > > > > > > to
>> > > > > > > > > > > > > master branch until this gets in so as to not
>> > > complicate
>> > > > > > > > > > > > merging/rebasing.
>> > > > > > > > > > > > > Once this is merged to the master branch, I would
>> > like
>> > > to
>> > > > > cut
>> > > > > > > 5.2
>> > > > > > > > > > > branch
>> > > > > > > > > > > > > from master and we can move forward with 5.2.0
>> > release.
>> > > > > > > > > > > > >
>> > > > > > > > > > > > > Please let me know if this looks good or if you
>> have
>> > > any
>> > > > > > other
>> > > > > > > > high
>> > > > > > > > > > > > > priority work for 5.2.0.
>> > > > > > > > > > > > >
>> > > > > > > > > > > >
>> > > > > > > > > > >
>> > > > > > > > > >
>> > > > > > > > >
>> > > > > > > >
>> > > > > > >
>> > > > > >
>> > > > >
>> > > >
>> > >
>> >
>>
>
>
> --
> *István Tóth* | Sr. Staff Software Engineer
> *Email*: st...@cloudera.com
> cloudera.com <https://www.cloudera.com>
> [image: Cloudera] <https://www.cloudera.com/>
> [image: Cloudera on Twitter] <https://twitter.com/cloudera> [image:
> Cloudera on Facebook] <https://www.facebook.com/cloudera> [image:
> Cloudera on LinkedIn] <https://www.linkedin.com/company/cloudera>
> ------------------------------
> ------------------------------
>

Reply via email to