Thank Zihao! These are really helpful! I just backport them + IMPALA-9572 (to make IMPALA-10257 a clean cherry-pick):
- IMPALA-9952: Fix page index filtering for empty pages - IMPALA-9572: Fix DCHECK in nested Parquet scanning - IMPALA-10257: Relax check for page filtering - IMPALA-10310: Fix couldn't skip rows in parquet file on NextRowGroup - IMPALA-12565: Fix crash triggered by calling pmod() UDF - IMPALA-5476: Fix catalogd restart brings stale metadata Have ran CORE tests before merging them: https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/19977/ Another patch I want to add for each release is the package support (IMPALA-10262). It helps to build DEB/RPM packages to ease users to try Impala. There are some non-trivial conflicts that I resolved. Uploaded the patch for review: https://gerrit.cloudera.org/c/21129/ IMPALA-10057: Fix log spew by using jars in the classpath https://gerrit.cloudera.org/c/21130/ IMPALA-10262: RPM/DEB Packaging Support Note that IMPALA-10057 is needed to reduce some conflicts. Looking for reviewers for them. Thanks, Quanlong On Thu, Mar 7, 2024 at 5:17 PM Zihao Ye <eyiz...@163.com> wrote: > Thanks to Quanlong for the work on this, it's very valuable for those who > can't upgrade their Hive. > > > We backported the following patches to our internal Impala 3.4.0 cluster > and they have been running well in production. These patches are small and > don't seem to be in the 3.4.2 git log right now, so I suggest considering > including these fixes in the upcoming 3.4.2 release: > > > - IMPALA-12565: Fix crash triggered by calling pmod() UDF. > - IMPALA-10310: Fix inability to skip rows in a parquet file on > NextRowGroup. > - IMPALA-9952: Fix page index filtering for empty pages. > - IMPALA-10257: Relax check for page filtering. > - IMPALA-5476: Fix issue where a catalogd restart results in stale > metadata. > > > Thanks, > Zihao > > On 2024/02/08 00:11:19 Quanlong Huang wrote: > Hi all, > > I've created the 3.4.2 branch and submitted a patch for review to fix the > build issue: https://gerrit.cloudera.org/c/20995/ > > Thanks, > Quanlong > > On Mon, Jan 29, 2024 at 10:03 AM Guojing Feng <fg...@gmail.com> wrote: > > volunteer +1, is neccecary to maintain 3.x releases since its very > expensive to update Hive2 to Hive3 for legacy systems. > > Quanlong Huang <hu...@gmail.com> 于2024年1月27日周六 06:59写道: > > Hi all, > > We still have users on the 3.x branch since they can't upgrade their Hive > to newer versions. It's reported that the 3.4.1 release is not compilable > now (IMPALA-12755). It'd be nice to have a buildable release for 3.4. > > I propose doing a maintenance release of 3.4.2 to fix the build issue > and I > volunteer to manage the release of it. It can branch out based on the > HEAD > of 3.4.1. We can also add some fixes (some are already in 3.x): > > - IMPALA-11296: Fix infinite loop when reading orc files > - IMPALA-11444: Fix wrong results when reading wide rows from ORC > - IMPALA-12102: Avoid memory leaks in the handling of JNI exceptions > - IMPALA-11406: Fix incorrect duration log for authorization > > Optional items: > > - Backport RPM/DEB packaging support (IMPALA-10262) so it's easier to > distribute Impala. > - Backport script changes to support building on Ubuntu 18.04 and > 20.04 > (IMPALA-10007). > > I tried a branch like this: > https://github.com/stiga-huang/impala/commits/branch-3.4.2 > It passed the CORE tests: > https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/19973/ > > Please speak up and let the community know your ideas, especially any bug > fixes that you want to backport to 3.4.2. All feedback is welcome! > > PS This is not a VOTE thread but only a DISCUSS thread. > > Thanks, > Quanlong > > > > > > >