Steve Loughran created AVRO-3594:
------------------------------------
Summary: FsInput to use openFile() API for cloud storage read
performance
Key: AVRO-3594
URL: https://issues.apache.org/jira/browse/AVRO-3594
Project: Apache Avro
Issue Type: Improvement
Components: java
Affects Versions: 1.11.2
Reporter: Steve Loughran
avro can now use the FileSystem.openFile() API to open a file on a hadoop
filesystem connector (HADOOP-15229).
by setting the file length and fadvise policy through opt() calls, the clients
can
* skip a HEAD request when opening a file
* optimise the ranges of GET request for sequential access, even in clusters
where s3a has been configured to use random iO (which some hive clusters do)
filesystems/releases which don't recognise the options added in HADOOP-16202
will ignore them; the api will fall back to classic open(path) API call if the
connector doesn't have a custom implementation.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)