dzcxzl created ORC-1845:
---------------------------

             Summary: Reduce read requests for ORC stored in S3
                 Key: ORC-1845
                 URL: https://issues.apache.org/jira/browse/ORC-1845
             Project: ORC
          Issue Type: Improvement
            Reporter: dzcxzl


Read ORC file will have two requests, one is open, one is getFileStatus, the 
logic of open will also call the getFileStatus request to get the file's status.

 

org.apache.orc.impl.ReaderImpl#extractFileTail
{code:java}
file = fs.open(path);

FileStatus fileStatus = fs.getFileStatus(path);
size = fileStatus.getLen(); {code}
 

org.apache.hadoop.fs.s3a.S3AFileSystem#executeOpen
{code:java}
final S3AFileStatus fileStatus =
    trackDuration(inputStreamStats,
        ACTION_FILE_OPENED.getSymbol(), () ->
        extractOrFetchSimpleFileStatus(path, fileInformation)); {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to