[ 
https://issues.apache.org/jira/browse/PARQUET-2277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17712323#comment-17712323
 ] 

Steve Loughran commented on PARQUET-2277:
-----------------------------------------

happy. Have you considered cutting hadoop 2 support entirely? Because with 
parquet building on 3.3.5 you are in a position to take up Mukund's vector IO 
patch PARQUET-2171 and see significant speedup in local IO reads (java nio at 
work) and on s3 through the s3a connector (parallel range requests)

targeting hadoop 3.3.x only gives you an openfile call where you can skip all 
HEAD probes and ask for random IO too. cloud speedup all round.

> Bump hadoop.version from 3.2.3 to 3.3.5
> ---------------------------------------
>
>                 Key: PARQUET-2277
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2277
>             Project: Parquet
>          Issue Type: Improvement
>    Affects Versions: 1.13.0
>            Reporter: Fokko Driesprong
>            Assignee: Fokko Driesprong
>            Priority: Major
>             Fix For: 1.14.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to