Steve Loughran created HADOOP-15944:
---------------------------------------

             Summary: S3AInputStream logging to make it easier to debug file 
leakage
                 Key: HADOOP-15944
                 URL: https://issues.apache.org/jira/browse/HADOOP-15944
             Project: Hadoop Common
          Issue Type: Sub-task
          Components: fs/s3
    Affects Versions: 3.1.1
            Reporter: Steve Loughran


Problem: if an app opens too many input streams, then all the http connections 
in the S3A pool can be used up; all attempts to do other FS operations fail 
timing out for http pool access

Proposed simple solution: log better what's going on with input stream 
lifecyce, specifically

# include URL of file in open, reopen & close events
# maybe: Separate logger for these events, though S3A Input stream should be 
enough as it doesn't do much else.
# maybe: have some prefix in the events like "Lifecycle", so that you could use 
the existing log @ debug, grep for that phrase and look at the printed URLs to 
identify what's going on
# stream metrics: expose some of the state of the http connection pool and/or 
active input and output streams

Idle output streams don't use up http connections, as they only connect during 
block upload.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org

Reply via email to