Hi! In my opinion cutting 5.2 now only makes sense IF we DO NOT plan to release the outstanding big features (like JSON) in 5.2. , otherwise it's just extra work to maintain more branches.
Having said that, releasing a 5.2 and 5.1.4 with the data integrity fixes real soon, and then releasing 5.3 in a few months with JSON, and any other outstanding big features that are close to being finished (and HBase 3.0 support, if it's ready by then) would not be a bad idea. On the CLDR side the only outstanding big feature which could impact Viraj's integrity work is HBase 3.0 support, and even that is only because it may require some larger refactors of existing code, not because it would change the actual behaviour or algorithms. Phoenix used to have several minor releases per year, the current state of extreme longevity of 5.1 and several big new features being added to it (like uncovered indexes) is not ideal. Releasing 5.2 and 5.3 relatively close together could be a return to a quicker cadence for minor releases, which could also help with the public image and adoption of Phoenix. We were talking about releasing 5.2 at least a year ago, and I have started working on that then, but then emergencies have come up at $dayjob, and I could not see that through. (So I am in part responsible for the lack of minor releases) regards Istvan On Mon, Jan 22, 2024 at 11:35 PM Viraj Jasani <vjas...@apache.org> wrote: > Sorry for the late reply. > > > Do you think cutting 5.2.0 now makes sense? > > No problem with that. I can cut 5.2 branch by the end of this week or at > the start of next week. > > If there is any very big change or feature ready for merge to master branch > with PR approvals already in place, please do let me know so that I can > help collaborate on how best we can get it merged without impacting 5.2 > release if required. My main motivation was for any big change to go > through newly introduced tests so that we know that anything additional is > not broken, and also to prioritize for upcoming 5.2.0 and 5.1.4 releases. > Moreover, there are several PRs getting merged on the master branch, we can > continue that as long as they are not very big changes, which might require > significant time to understand any correlation with data integrity issues. > > The PR is also ready for review with some additional cases fixed last week: > https://github.com/apache/phoenix/pull/1736 > > Depending on the review bandwidth, I am hopeful we should be good to land > them sooner. > > > On Wed, Jan 17, 2024 at 11:31 AM Rushabh Shah > <rushabh.s...@salesforce.com.invalid> wrote: > > > Thank you Viraj for initiating this thread. > > > > > Given the critical nature of these issues, I would like to propose that > > we > > treat this as a high priority for the upcoming 5.2.0 release, and not > > include any other feature or big change to master branch until we merge > > this. > > > > Do you think cutting 5.2.0 now makes sense? This will enable other > > developers to merge features into master branch (5.3.0) and you can take > > some more time to make sure we cover all the corner cases for the data > > integrity issues that you uncovered. > > > > > > On Fri, Jan 5, 2024 at 6:38 PM Viraj Jasani <vjas...@apache.org> wrote: > > > > > Sounds good, thanks Rajeshbabu. I will try to get the first PR out next > > > week and while reviews happen in parallel, will try to get 5.1 PR soon. > > > > > > > > > On Wed, Jan 3, 2024 at 8:49 PM rajeshb...@apache.org < > > > chrajeshbab...@gmail.com> wrote: > > > > > > > Hi Viraj, > > > > > > > > Would be better to include the changes in 5.1.4 as in any way it > will > > > take > > > > at least 3-4 days to complete the omid release. > > > > > > > > Thanks, > > > > Rajeshbabu. > > > > > > > > > > > > On Thu, Jan 4, 2024 at 5:06 AM Viraj Jasani <vjas...@apache.org> > > wrote: > > > > > > > > > Thank you Kadir and Geoffrey for your replies!! > > > > > > > > > > > How does this affect 5.1.4, which is also listed as a Fix Version > > for > > > > > > PHOENIX-7106? > > > > > > > > > > Yes, it also needs to be ported to 5.1. Once the master PR is up > for > > > > final > > > > > review, I would start working on the backport PR. > > > > > We just need some more additional testing to ensure old client > (e.g. > > > > 5.1.3) > > > > > is compatible with the new server with the changes. > > > > > > > > > > Hence, yes it is now a blocker for upcoming 5.1.4 as well since > 5.1.4 > > > RC > > > > > preparation is still pending (while Omid release is in progress). > > > > > Otherwise if 5.1.4 was ready for release, I would have proposed > > > immediate > > > > > 5.1.5 release to include the changes proposed with PHOENIX-7106. > > > > > > > > > > > > > > > On Wed, Jan 3, 2024 at 3:08 PM Geoffrey Jacoby <gjac...@apache.org > > > > > > wrote: > > > > > > > > > > > I agree that data integrity issues are a higher priority than > > feature > > > > > > development, so I also support the decision. The fact that > several > > of > > > > the > > > > > > major remaining 5.2 features are currently being developed in > > > > > long-running > > > > > > feature branches also helps, as work can continue there at the > cost > > > of > > > > a > > > > > > rebase later. > > > > > > > > > > > > How does this affect 5.1.4, which is also listed as a Fix Version > > for > > > > > > PHOENIX-7106? From the bug description it also sounds like 5.1.3 > > and > > > > the > > > > > > forthcoming .4 are affected, since we have server-side paging in > > 5.1. > > > > > (Feel > > > > > > free to move that to a separate thread if you feel it should be a > > > > > separate > > > > > > discussion.) Should this be a blocker for releasing 5.1.4? > > > > > > > > > > > > Geoffrey > > > > > > > > > > > > > > > > > > On Wed, Jan 3, 2024 at 5:06 PM Kadir Ozdemir < > > > > > > ka...@gsuite.cloud.apache.org> > > > > > > wrote: > > > > > > > > > > > > > Being a database, Phoenix has to make sure that the data stays > on > > > > disk > > > > > > > intact and its queries return correct data. In this case, > Phoenix > > > > fails > > > > > > to > > > > > > > return correct data for some queries if their scans experience > > > region > > > > > > > movement. Now that we know these data integrity issues and how > to > > > > > > reproduce > > > > > > > them, fixing them should be our first priority. So, I fully > > support > > > > > this > > > > > > > proposal. > > > > > > > > > > > > > > On Wed, Jan 3, 2024 at 10:58 PM Viraj Jasani < > vjas...@apache.org > > > > > > > > wrote: > > > > > > > > > > > > > > > Hello, > > > > > > > > > > > > > > > > I would like to bring PHOENIX-7106 > > > > > > > > < > > > > > > https://urldefense.com/v3/__https://issues.apache.org/jira/browse/PHOENIX-7106__;!!DCbAVzZNrAf4!FZG5sv55IC1NqItQLY7GKWgUG2Do0gSta01gOiSdd36Dx3XHGtQx4M3c9visVXIt9DctPQzS-orob9vhzrCfVA$ > > > > to everyone's > > > > > > > > attention here and brief about the data integrity issues that > > we > > > > have > > > > > > in > > > > > > > > various coprocessors. Majority of the issues are related to > the > > > > fact > > > > > > that > > > > > > > > we do not return valid rowkey for certain queries. If any > > region > > > > > moves > > > > > > in > > > > > > > > the middle of the scan, the HBase client relies on the last > > > > returned > > > > > > > rowkey > > > > > > > > and accordingly changes the scan boundaries while the scanner > > is > > > > > > getting > > > > > > > > reset to continue the scan operation. If the region does not > > > move, > > > > > scan > > > > > > > is > > > > > > > > not expected to return invalid data, however if the region > > moves > > > in > > > > > the > > > > > > > > middle of ongoing scan operation, scan would return > > > > invalid/incorrect > > > > > > > data > > > > > > > > causing data integrity issues. > > > > > > > > > > > > > > > > Given the critical nature of these issues, I would like to > > > propose > > > > > that > > > > > > > we > > > > > > > > treat this as a high priority for the upcoming 5.2.0 release, > > and > > > > not > > > > > > > > include any other feature or big change to master branch > until > > we > > > > > merge > > > > > > > > this. The PR is still not ready as additional changes are > still > > > in > > > > my > > > > > > > > local, requiring rebase with the current master. > > > > > > > > > > > > > > > > I would get back to this discuss thread as soon as the PR and > > the > > > > doc > > > > > > are > > > > > > > > updated with the latest findings so far. The changes include > > many > > > > of > > > > > > our > > > > > > > > coproc scanner implementations and hence it would require > > > > significant > > > > > > > > review as well. > > > > > > > > It would be great if we can hold on to merging any feature or > > big > > > > > > change > > > > > > > to > > > > > > > > master branch until this gets in so as to not complicate > > > > > > > merging/rebasing. > > > > > > > > Once this is merged to the master branch, I would like to cut > > 5.2 > > > > > > branch > > > > > > > > from master and we can move forward with 5.2.0 release. > > > > > > > > > > > > > > > > Please let me know if this looks good or if you have any > other > > > high > > > > > > > > priority work for 5.2.0. > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >