Hi all,

We've moved to the next patch: https://gerrit.cloudera.org/c/12345/. It's
the first step to add LocalCatalog to branch-2.x. Anyone could give it
a +2? Or anyone has objections for it?

Thanks,
Quanlong

On Thu, Jan 31, 2019 at 9:00 PM Quanlong Huang <huangquanl...@gmail.com>
wrote:

> Sure. I think "fine-grained privileges" always introduce small changes in
> behaviors, i.e. unprivileged users used to be able to do something but they
> can't do so after an upgrade. We accept it since it's reasonable.
>
> There're incompatible changes too in the previous releases:
> https://www.cloudera.com/documentation/enterprise/release-notes/topics/impala_incompatible_changes.html.
> What we need is to document it well :)
>
> I just moved forward and start GVO job for the patch. Thanks!
>
>
>
> On Thu, Jan 31, 2019 at 2:00 AM Bharath Vissapragada <
> bhara...@cloudera.com> wrote:
>
>> On Wed, Jan 30, 2019 at 12:21 AM Quanlong Huang <huangquanl...@gmail.com>
>> wrote:
>>
>> > I'm afraid the difference between branch-2.x and Cloudera's branch is
>> > larger than the difference between branch-2.x and master branch.
>> Cloudera's
>> > branch already ignored lots of commits, which causes the gap. I've tried
>> > cherry-pick from master or Cloudera's branch and found it's much easier
>> to
>> > pick from master branch.
>> > If https://gerrit.cloudera.org/c/12292/ is merged, I can easily pick
>> 40+
>> > commits into branch-2.x with few conflicts to resolve!! See the first
>> > column in the sheet:
>> >
>> >
>> https://docs.google.com/spreadsheets/d/12h1rTAPS1gm0vhlDGxeOXjnRD7rrOcoqzX4rjRRCyBg
>> > The result is here:
>> > https://github.com/stiga-huang/incubator-impala/tree/future-2.x
>>
>>
>> Sure whatever works best for you. You are right that the Cloudera branch
>> was selective in cherry-picking stuff. We mostly focussed on "fetch
>> on-demand metadata" changes and "finer grained privileges" and ignored the
>> rest. If that is what you are looking for, it is easier to cherry-pick
>> from
>> Cloudera's branch. Otherwise probably better to replay commits from the
>> master branch.
>>
>>
>> >
>> > We can restart the cherrypick-2.x-and-test Jenkins job. Each time
>> > there're conflicts, I'll come to resolve it. If the job keeps running,
>> it's
>> > possible for branch-2.x to catch up the master branch!
>> >
>> > Besides, https://gerrit.cloudera.org/c/12292/ is about DESCRIBE
>> behavior
>> > in
>> > FGP(Fine-grained privileges). I think it's reasonable and not
>> > compatibility breaking. Does anyone have more thoughts about this?
>> >
>>
>> Hmm, I see what you are saying.  Definitely helps to include it to make
>> future cherry-picks easier.
>>
>> Technically it is still a behavioral change, especially if someone
>> upgrades
>> to a version with this fix and we typically try to avoid that (describe
>> that worked before doesn't work after upgrade). I can't speak about why we
>> included it in the Cloudera branch since that was an internal decision but
>> I don't know if we have any policies here around backporting such stuff
>> into older branches. Maybe good to know what others think.
>>
>> Anyway, I don't feel too strongly about this and since it is blocking your
>> work, I removed my -1 on the code review but this is something to keep in
>> mind when backporting such patches.
>>
>>
>> >
>> > On Wed, Jan 30, 2019 at 9:41 AM Fredy Wijaya <fwij...@cloudera.com>
>> wrote:
>> >
>> > > Due to the way we build 2.x where it can't use the pinned versions of
>> CDH
>> > > dependencies, it may be better to cherry-pick all commits in
>> > > https://github.com/cloudera/Impala/tree/cdh5-2.12.0_5.16.0, which
>> also
>> > > includes that DESCRIBE commit to avoid further integration issues
>> later
>> > on.
>> > >
>> > > On Tue, Jan 29, 2019 at 5:05 PM Quanlong Huang <
>> huangquanl...@gmail.com>
>> > > wrote:
>> > >
>> > > > Hi Bharath,
>> > > >
>> > > > Thank you a lot for your notice! However, I've gone through the
>> commits
>> > > of
>> > > > cdh branch before and found that this patch is also picked:
>> > > >
>> > > >
>> > >
>> >
>> https://github.com/cloudera/Impala/commit/f8a318d4f75e22a963b9cf4786ef07d2cd6bd93c
>> > > > .
>> > > > Is this really a compatibility breaking change?
>> > > >
>> > > > I'm also concern that the TestDescribeTableResults it introduced is
>> too
>> > > > strictly that may cause troubles. However, I found two later commits
>> > > > (IMPALA-7143 and IMPALA-7144) would fix this. I'm going to
>> cherry-pick
>> > > > these two and IMPALA-7676 (thanks Fredy's advise too!) right after
>> > > > https://gerrit.cloudera.org/c/12292/ is merged.
>> > > >
>> > > > Please let me know if this will go astray. Thanks!
>> > > >
>> > > >
>> > > > On Wed, Jan 30, 2019 at 12:36 AM Bharath Vissapragada <
>> > > > bhara...@cloudera.com>
>> > > > wrote:
>> > > >
>> > > > > On Mon, Jan 28, 2019 at 11:36 PM Quanlong Huang <
>> > > huangquanl...@gmail.com
>> > > > >
>> > > > > wrote:
>> > > > >
>> > > > > > >>For the first patch, "0b540b025 IMPALA-7128 (part 1) Refactor
>> > > > > interfaces
>> > > > > > for Db, View, Table, Partition", the cherry-pick conflicts is
>> due
>> > to
>> > > > the
>> > > > > > revert of IMPALA-6479 in 2.x. I'm testing branch-2.x with
>> > IMPALA-6479
>> > > > > being
>> > > > > > picked back. Does anyone know why we revert it? (I also comment
>> in
>> > > the
>> > > > > > JIRA).
>> > > > > > >
>> > > > > > >There are test failures. I guess it's the reason. Hopefully,
>> > > > > > cdh-5.16.1-release already picked up this patch, which provides
>> > some
>> > > > > > pointers :)
>> > > > > >
>> > > > > > I fix the test failures and create a review at
>> > > > > > https://gerrit.cloudera.org/c/12292/
>> > > > > > Waiting for Jenkins maintenance to finish and then run a GVO.
>> Hopes
>> > > > > someone
>> > > > > > can join and have a look!
>> > > > > >
>> > > > > > On Tue, Jan 29, 2019 at 7:39 AM Quanlong Huang <
>> > > > huangquanl...@gmail.com>
>> > > > > > wrote:
>> > > > > >
>> > > > > > > >For the first patch, "0b540b025 IMPALA-7128 (part 1) Refactor
>> > > > > interfaces
>> > > > > > > for Db, View, Table, Partition", the cherry-pick conflicts is
>> due
>> > > to
>> > > > > the
>> > > > > > > revert of IMPALA-6479 in 2.x. I'm testing branch-2.x with
>> > > IMPALA-6479
>> > > > > > being
>> > > > > > > picked back. Does anyone know why we revert it? (I also
>> comment
>> > in
>> > > > the
>> > > > > > > JIRA).
>> > > > > >
>> > > > >
>> > > > > It was reverted because it is a compatibility breaking change. We
>> > > > typically
>> > > > > try not to introduce such behavioral changes in the same major
>> > version
>> > > > line
>> > > > > as that can cause upgrade issues.
>> > > > >
>> > > > >
>> > > > > > >
>> > > > > > > There are test failures. I guess it's the reason. Hopefully,
>> > > > > > > cdh-5.16.1-release already picked up this patch, which
>> provides
>> > > some
>> > > > > > > pointers :)
>> > > > > >
>> > > > >
>> > > > > I work at Cloudera and we've gone through this exercise before.
>> It is
>> > > > > annoying to resolve the conflicts, so you can reuse our work and
>> save
>> > > > some
>> > > > > time.
>> > > > > https://github.com/cloudera/Impala/tree/cdh5-2.12.0_5.16.0
>> > > > >
>> > > > >
>> > > > > > >
>> > > > > > > On Mon, Jan 28, 2019 at 10:51 PM Quanlong Huang <
>> > > > > huangquanl...@gmail.com
>> > > > > > >
>> > > > > > > wrote:
>> > > > > > >
>> > > > > > >> Yes, there are two discussion threads before that are
>> relative
>> > to
>> > > > > this.
>> > > > > > >> One for stopping the cherrypick-2.x-and-test jenkins job:
>> > > > > > >>
>> > > > > > >>
>> > > > > >
>> > > > >
>> > > >
>> > >
>> >
>> https://lists.apache.org/thread.html/2b4b62d4c07661b27a5203618cb0425a429f6460f2eb505acbcd26c6@%3Cdev.impala.apache.org%3E
>> > > > > > >>
>> > > > > > >> The other for removing support for hadoop 2 in master branch:
>> > > > > > >>
>> > > > > >
>> > > > >
>> > > >
>> > >
>> >
>> https://lists.apache.org/thread.html/49f9b68ed3d6d2c0fdee16a877b259922545e4824e1233479227a657@%3Cdev.impala.apache.org%3E
>> > > > > > >>
>> > > > > > >> I'm +1 with the second thread that we only support Hadoop 2
>> in
>> > > > > > branch-2.x
>> > > > > > >> and support Hadoop 3 in the master branch to be more focused.
>> > I'm
>> > > > also
>> > > > > > >> agree with Paul's concern. It's such a dilemma that if we
>> skip
>> > > some
>> > > > > > >> commits, things will be harder and harder as we moving
>> forward;
>> > if
>> > > > we
>> > > > > > >> cherry-pick, review, and test the commits one by one,
>> branch-2.x
>> > > > will
>> > > > > > never
>> > > > > > >> catch up the master branch, which is an obstacle if someone
>> > (like
>> > > > me)
>> > > > > > wants
>> > > > > > >> to backport his/her new patch to branch-2.x but waits too
>> long
>> > and
>> > > > > > finally
>> > > > > > >> fogets details of the patch.
>> > > > > > >>
>> > > > > > >> I roughly investigated how other systems deal with multiple
>> > > > branches.
>> > > > > > The
>> > > > > > >> efforts to backport a patch could be the same for the
>> original
>> > > > patch.
>> > > > > > It's
>> > > > > > >> not a easy go, so the Hive community declares that
>> > > > > > >> "The decision to port a feature from master to branch-1 is at
>> > the
>> > > > > > >> discretion of the contributor and committer. However no
>> features
>> > > > that
>> > > > > > break
>> > > > > > >> backwards compatibility will be accepted on branch-1."
>> > > > > > >>
>> > > > > > >> I think it's a chance to understand more parts of Impala by
>> > > learning
>> > > > > and
>> > > > > > >> backporting the patches, since they have execellent commit
>> > > messages
>> > > > > and
>> > > > > > >> were strictly reviewed. So I volunteer for the job to move
>> > forward
>> > > > the
>> > > > > > >> branch-2.x. Hopes patch authors could give some pointers when
>> > I'm
>> > > > > > blocked!
>> > > > > > >> I'll try approach (b) first and switch to (a) when (b)
>> becomes
>> > > > > > impossible
>> > > > > > >> after too many commits are skipped. I'll confirm with the
>> author
>> > > if
>> > > > I
>> > > > > > think
>> > > > > > >> a patch should be skipped.
>> > > > > > >>
>> > > > > > >> For the first patch, "0b540b025 IMPALA-7128 (part 1) Refactor
>> > > > > interfaces
>> > > > > > >> for Db, View, Table, Partition", the cherry-pick conflicts is
>> > due
>> > > to
>> > > > > the
>> > > > > > >> revert of IMPALA-6479 in 2.x. I'm testing branch-2.x with
>> > > > IMPALA-6479
>> > > > > > being
>> > > > > > >> picked back. Does anyone know why we revert it? (I also
>> comment
>> > in
>> > > > the
>> > > > > > >> JIRA).
>> > > > > > >>
>> > > > > > >> On Mon, Jan 28, 2019 at 12:43 PM Philip Zeyliger <
>> > > > phi...@cloudera.com
>> > > > > >
>> > > > > > >> wrote:
>> > > > > > >>
>> > > > > > >>> As for Quanlong's question, I think the answer is however
>> the
>> > > folks
>> > > > > who
>> > > > > > >>> want to do the work prefer to do it. As you noticed in the
>> CDH
>> > > > > > >>> changelists,
>> > > > > > >>> Cloudera's distribution has opted for something more like
>> > > approach
>> > > > > (a),
>> > > > > > >>> choosing to backport individual features. For a while, we
>> were
>> > > > doing
>> > > > > > >>> automation for cherry-picking things automatically, and it
>> got
>> > > > > tedious
>> > > > > > >>> enough that we decided to turn it off.
>> > > > > > >>>
>> > > > > > >>> On Sun, Jan 27, 2019 at 7:37 PM Paul Rogers <
>> > > prog...@cloudera.com>
>> > > > > > >>> wrote:
>> > > > > > >>>
>> > > > > > >>> > Hi Quanlong,
>> > > > > > >>> >
>> > > > > > >>> > Thanks for the suggestion. I wonder if there is a third
>> > > strategy:
>> > > > > > >>> >
>> > > > > > >>> > c) Isolate the Hadoop 2.x/3.x differences into
>> > clearly-defined
>> > > > > driver
>> > > > > > >>> > layer so that basically all of 3.x can be applied to the
>> 2.x
>> > > > > branch.
>> > > > > > >>> Said
>> > > > > > >>> > another way, a single source base can work against either
>> > > Hadoop
>> > > > > 2.x
>> > > > > > or
>> > > > > > >>> > 3.x, with the build (C++) or runtime (Java) choosing the
>> > proper
>> > > > > > >>> “driver”
>> > > > > > >>> > classes.
>> > > > > > >>> >
>> > > > > > >>>
>> > > > > > >>> We had such a layer for a while, where Impala master could
>> be
>> > > built
>> > > > > > >>> against
>> > > > > > >>> either Hadoop3 or Hadoop2. We decided to clean it up in
>> commit
>> > > > > > >>> e4ae605b083ab536c68552e37ca3c46f6bff4c76.
>> > > > > > >>>
>> > > > > > >>> commit e4ae605b083ab536c68552e37ca3c46f6bff4c76
>> > > > > > >>> Author: Fredy Wijaya <fwij...@cloudera.com>
>> > > > > > >>> Date:   Thu Jul 12 17:01:13 2018 -0700
>> > > > > > >>>
>> > > > > > >>>     IMPALA-7295: Remove IMPALA_MINICLUSTER_PROFILE=2
>> > > > > > >>>
>> > > > > > >>>     This patch removes the use of
>> IMPALA_MINICLUSTER_PROFILE.
>> > The
>> > > > > code
>> > > > > > >>> that
>> > > > > > >>>     uses IMPALA_MINICLUSTER_PROFILE=2 is removed and it
>> > defaults
>> > > to
>> > > > > > code
>> > > > > > >>> from
>> > > > > > >>>     IMPALA_MINICLUSTER_PROFILE=3. In order to reduce having
>> too
>> > > > many
>> > > > > > code
>> > > > > > >>>     changes in this patch, there is no code change for the
>> > shims.
>> > > > The
>> > > > > > >>> shims
>> > > > > > >>>     for IMPALA_MINICLUSTER_PROFILE=3 automatically become
>> the
>> > > > default
>> > > > > > >>>     implementation.
>> > > > > > >>>
>> > > > > > >>>     Testing:
>> > > > > > >>>     - Ran core and exhaustive tests
>> > > > > > >>>
>> > > > > > >>>     Change-Id: Iba4a81165b3d2012dc04d4115454372c41e39f08
>> > > > > > >>>     Reviewed-on: http://gerrit.cloudera.org:8080/10940
>> > > > > > >>>     Reviewed-by: Impala Public Jenkins <
>> > > > > > >>> impala-public-jenk...@cloudera.com>
>> > > > > > >>>     Tested-by: Impala Public Jenkins <
>> > > > > > impala-public-jenk...@cloudera.com
>> > > > > > >>> >
>> > > > > > >>>
>> > > > > > >>
>> > > > > >
>> > > > >
>> > > >
>> > >
>> >
>>
>

Reply via email to