[jira] [Commented] (HADOOP-13560) S3ABlockOutputStream to support huge (many GB) file writes

Steve Loughran (JIRA) Tue, 04 Oct 2016 13:18:44 -0700

    [ 
https://issues.apache.org/jira/browse/HADOOP-13560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15546540#comment-15546540
 ]


Steve Loughran commented on HADOOP-13560:
-----------------------------------------

Latest patch: factored out semaphore queue in front of an ExecutorService into 
{{SemaphoredDelegatingExecutor}}. As well as being used to manage thread pool 
load, each block output stream gets its own submissions to the central thread 
pool limited by the property {{fs.s3a.block.output.active.limit}}. This will 
let us have a larger common pool, but still limit the amount of bandwidth a 
single stream can consume. Which I need as I want to use that same thread pool 
for parallel rename operations and async mkdir/delete dir operations. 

> S3ABlockOutputStream to support huge (many GB) file writes
> ----------------------------------------------------------
>
>                 Key: HADOOP-13560
>                 URL: https://issues.apache.org/jira/browse/HADOOP-13560
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/s3
>    Affects Versions: 2.9.0
>            Reporter: Steve Loughran
>            Assignee: Steve Loughran
>            Priority: Minor
>         Attachments: HADOOP-13560-branch-2-001.patch, 
> HADOOP-13560-branch-2-002.patch, HADOOP-13560-branch-2-003.patch, 
> HADOOP-13560-branch-2-004.patch
>
>
> An AWS SDK [issue|https://github.com/aws/aws-sdk-java/issues/367] highlights 
> that metadata isn't copied on large copies.
> 1. Add a test to do that large copy/rname and verify that the copy really 
> works
> 2. Verify that metadata makes it over.
> Verifying large file rename is important on its own, as it is needed for very 
> large commit operations for committers using rename



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (HADOOP-13560) S3ABlockOutputStream to support huge (many GB) file writes

Reply via email to