dzcxzl created ORC-1845: --------------------------- Summary: Reduce read requests for ORC stored in S3 Key: ORC-1845 URL: https://issues.apache.org/jira/browse/ORC-1845 Project: ORC Issue Type: Improvement Reporter: dzcxzl
Read ORC file will have two requests, one is open, one is getFileStatus, the logic of open will also call the getFileStatus request to get the file's status. org.apache.orc.impl.ReaderImpl#extractFileTail {code:java} file = fs.open(path); FileStatus fileStatus = fs.getFileStatus(path); size = fileStatus.getLen(); {code} org.apache.hadoop.fs.s3a.S3AFileSystem#executeOpen {code:java} final S3AFileStatus fileStatus = trackDuration(inputStreamStats, ACTION_FILE_OPENED.getSymbol(), () -> extractOrFetchSimpleFileStatus(path, fileInformation)); {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)