dzcxzl created ORC-1845:
---------------------------
Summary: Reduce read requests for ORC stored in S3
Key: ORC-1845
URL: https://issues.apache.org/jira/browse/ORC-1845
Project: ORC
Issue Type: Improvement
Reporter: dzcxzl
Read ORC file will have two requests, one is open, one is getFileStatus, the
logic of open will also call the getFileStatus request to get the file's status.
org.apache.orc.impl.ReaderImpl#extractFileTail
{code:java}
file = fs.open(path);
FileStatus fileStatus = fs.getFileStatus(path);
size = fileStatus.getLen(); {code}
org.apache.hadoop.fs.s3a.S3AFileSystem#executeOpen
{code:java}
final S3AFileStatus fileStatus =
trackDuration(inputStreamStats,
ACTION_FILE_OPENED.getSymbol(), () ->
extractOrFetchSimpleFileStatus(path, fileInformation)); {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)