[
https://issues.apache.org/jira/browse/PARQUET-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17553554#comment-17553554
]
ASF GitHub Bot commented on PARQUET-2158:
-----------------------------------------
steveloughran commented on PR #976:
URL: https://github.com/apache/parquet-mr/pull/976#issuecomment-1153851509
thrift module doesn't compile is using an hadoop internal class tagged as
private & which made an incompatible change in hadoop 3. see HADOOP-12436
```
Error: Failed to execute goal
org.apache.maven.plugins:maven-compiler-plugin:3.8.1:compile (default-compile)
on project parquet-thrift: Compilation failure
Error:
/home/runner/work/parquet-mr/parquet-mr/parquet-thrift/src/main/java/org/apache/parquet/thrift/projection/deprecated/PathGlobPattern.java:[55,49]
incompatible types: com.google.re2j.Pattern cannot be converted to
java.util.regex.Pattern
```
the good news, the class is deprecated, which explains why nobody has seen
it in the wild. Any attempt to use that class would fail with hadoop 3.x on the
classpath.
> Upgrade Hadoop dependency to version 3.2.0
> ------------------------------------------
>
> Key: PARQUET-2158
> URL: https://issues.apache.org/jira/browse/PARQUET-2158
> Project: Parquet
> Issue Type: Improvement
> Components: parquet-mr
> Affects Versions: 1.13.0
> Reporter: Steve Loughran
> Priority: Major
>
> Parquet still builds against Hadoop 2.10. This is very out of date and does
> not work with java 11, let alone later releases.
> Upgrading the dependency to Hadoop 3.2.0 makes the release compatible with
> java 11, and lines up with active work on HADOOP-18287, _Provide a shim
> library for modern FS APIs_
> This will significantly speed up access to columnar data, especially in
> cloud stores.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)