Mukund Thakur created PARQUET-2171:
--------------------------------------

             Summary: Implement vectored IO in parquet file format
                 Key: PARQUET-2171
                 URL: https://issues.apache.org/jira/browse/PARQUET-2171
             Project: Parquet
          Issue Type: New Feature
          Components: parquet-mr
            Reporter: Mukund Thakur


We recently added a new feature called vectored IO in Hadoop for improving read 
performance for seek heavy readers. Spark Jobs and others which uses parquet 
will greatly benefit from this api. Details can be found hereĀ 

[https://github.com/apache/hadoop/commit/e1842b2a749d79cbdc15c524515b9eda64c339d5]

https://issues.apache.org/jira/browse/HADOOP-18103

https://issues.apache.org/jira/browse/HADOOP-11867



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to