Steve Loughran created HADOOP-18966:
---------------------------------------

             Summary: s3a file rename does double HEAD or LIST on source 
fiile/dir
                 Key: HADOOP-18966
                 URL: https://issues.apache.org/jira/browse/HADOOP-18966
             Project: Hadoop Common
          Issue Type: Sub-task
          Components: fs/s3
    Affects Versions: 3.3.6
            Reporter: Steve Loughran


going to add this as a v2 issue, though really it's been latent.

when you do a rename() we do a getFileStatus call to get the list/file 
status..if this is a dir that's a single LIST; a file then it'll be LIST + HEAD.

when the actual copy is kicked off, 
* if the source is a dir, then a duplicate LIST is initiated straight 
afterwards.
* FIle: there's a second HEAD to get the metadata for the copy

Proposed.
* LIST initiates a full list, and if it returns a list of objects, that list 
iterator is passed in to rename.
* full result of HEAD preserved and passed down to copyFile()

will cut out one round trip regardless of source type





--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to