[ 
https://issues.apache.org/jira/browse/PARQUET-2158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17553554#comment-17553554
 ] 

ASF GitHub Bot commented on PARQUET-2158:
-----------------------------------------

steveloughran commented on PR #976:
URL: https://github.com/apache/parquet-mr/pull/976#issuecomment-1153851509

   thrift module doesn't compile is using an hadoop internal class tagged as 
private & which made an incompatible change in hadoop 3. see  HADOOP-12436
   
   ```
   Error:  Failed to execute goal 
org.apache.maven.plugins:maven-compiler-plugin:3.8.1:compile (default-compile) 
on project parquet-thrift: Compilation failure
   Error:  
/home/runner/work/parquet-mr/parquet-mr/parquet-thrift/src/main/java/org/apache/parquet/thrift/projection/deprecated/PathGlobPattern.java:[55,49]
 incompatible types: com.google.re2j.Pattern cannot be converted to 
java.util.regex.Pattern
   ```
   
   the good news, the class is deprecated, which explains why nobody has seen 
it in the wild. Any attempt to use that class would fail with hadoop 3.x on the 
classpath. 




> Upgrade Hadoop dependency to version 3.2.0
> ------------------------------------------
>
>                 Key: PARQUET-2158
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2158
>             Project: Parquet
>          Issue Type: Improvement
>          Components: parquet-mr
>    Affects Versions: 1.13.0
>            Reporter: Steve Loughran
>            Priority: Major
>
> Parquet still builds against Hadoop 2.10. This is very out of date and does 
> not work with java 11, let alone later releases.
> Upgrading the dependency to Hadoop 3.2.0 makes the release compatible with 
> java 11, and lines up with active work on  HADOOP-18287,  _Provide a shim 
> library for modern FS APIs_ 
> This will significantly speed up access to columnar data, especially  in 
> cloud stores.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to