[jira] [Commented] (HADOOP-10203) Connection leak in Jets3tNativeFileSystemStore#retrieveMetadata

2014-01-04 Thread Steve Loughran (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13862284#comment-13862284
 ] 

Steve Loughran commented on HADOOP-10203:
-

why the move from the S3 classes to the Storage ones? I don't see it compiling 
without imports and some other changes -but I'll wait to see what Jenkins says 
there

 Connection leak in Jets3tNativeFileSystemStore#retrieveMetadata 
 

 Key: HADOOP-10203
 URL: https://issues.apache.org/jira/browse/HADOOP-10203
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs/s3
 Environment: CDH 2.0.0-cdh4.5.0 
 (30821ec616ee7a21ee8447949b7c6208a8f1e7d8) 
Reporter: Andrei Savu
Assignee: Andrei Savu
 Attachments: HADOOP-10203-trunk.patch, HADOOP-10203.patch


 Jets3tNativeFileSystemStore#retrieveMetadata  is leaking connections. 
 This affects any client that tries to read many small files very quickly 
 (e.g. distcp from s3 to hdfs with small files blocks due to connection pool 
 starvation). 
 This is not a problem for larger files because when the GC runs any 
 connection that's out of scope will be released in #finalize().
 We are seeing the following log messages as a symptom of this problem:
 {noformat}
 13/12/26 13:40:01 WARN httpclient.HttpMethodReleaseInputStream: Attempting to 
 release HttpMethod in finalize() as its response data stream has gone out of 
 scope. This attempt will not always succeed and cannot be relied upon! Please 
 ensure response data streams are always fully consumed or closed to avoid 
 HTTP connection starvation.
 13/12/26 13:40:01 WARN httpclient.HttpMethodReleaseInputStream: Successfully 
 released HttpMethod in finalize(). You were lucky this time... Please ensure 
 response data streams are always fully consumed or closed.
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HADOOP-10203) Connection leak in Jets3tNativeFileSystemStore#retrieveMetadata

2014-01-04 Thread Steve Loughran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-10203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran updated HADOOP-10203:


Status: Patch Available  (was: Open)

 Connection leak in Jets3tNativeFileSystemStore#retrieveMetadata 
 

 Key: HADOOP-10203
 URL: https://issues.apache.org/jira/browse/HADOOP-10203
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs/s3
 Environment: CDH 2.0.0-cdh4.5.0 
 (30821ec616ee7a21ee8447949b7c6208a8f1e7d8) 
Reporter: Andrei Savu
Assignee: Andrei Savu
 Attachments: HADOOP-10203-trunk.patch, HADOOP-10203.patch


 Jets3tNativeFileSystemStore#retrieveMetadata  is leaking connections. 
 This affects any client that tries to read many small files very quickly 
 (e.g. distcp from s3 to hdfs with small files blocks due to connection pool 
 starvation). 
 This is not a problem for larger files because when the GC runs any 
 connection that's out of scope will be released in #finalize().
 We are seeing the following log messages as a symptom of this problem:
 {noformat}
 13/12/26 13:40:01 WARN httpclient.HttpMethodReleaseInputStream: Attempting to 
 release HttpMethod in finalize() as its response data stream has gone out of 
 scope. This attempt will not always succeed and cannot be relied upon! Please 
 ensure response data streams are always fully consumed or closed to avoid 
 HTTP connection starvation.
 13/12/26 13:40:01 WARN httpclient.HttpMethodReleaseInputStream: Successfully 
 released HttpMethod in finalize(). You were lucky this time... Please ensure 
 response data streams are always fully consumed or closed.
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HADOOP-10204) ThrottledInputStream should #close() the underlying stream

2014-01-04 Thread Steve Loughran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-10204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran updated HADOOP-10204:


Status: Patch Available  (was: Open)

 ThrottledInputStream should #close() the underlying stream 
 ---

 Key: HADOOP-10204
 URL: https://issues.apache.org/jira/browse/HADOOP-10204
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs
 Environment: CDH 2.0.0-cdh4.5.0 
 (30821ec616ee7a21ee8447949b7c6208a8f1e7d8) 
Reporter: Andrei Savu
Assignee: Andrei Savu
 Attachments: HADOOP-10204-trunk.patch, HADOOP-10204.patch


 While working on HADOOP-10203 I've noticed that ThrottledInputStream (DistCP 
 V2) does not override #close(). This can also leak connection.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (HADOOP-10204) ThrottledInputStream should #close() the underlying stream

2014-01-04 Thread Steve Loughran (JIRA)

 [ 
https://issues.apache.org/jira/browse/HADOOP-10204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Loughran updated HADOOP-10204:


Priority: Minor  (was: Major)

 ThrottledInputStream should #close() the underlying stream 
 ---

 Key: HADOOP-10204
 URL: https://issues.apache.org/jira/browse/HADOOP-10204
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs
 Environment: CDH 2.0.0-cdh4.5.0 
 (30821ec616ee7a21ee8447949b7c6208a8f1e7d8) 
Reporter: Andrei Savu
Assignee: Andrei Savu
Priority: Minor
 Attachments: HADOOP-10204-trunk.patch, HADOOP-10204.patch


 While working on HADOOP-10203 I've noticed that ThrottledInputStream (DistCP 
 V2) does not override #close(). This can also leak connection.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HADOOP-10204) ThrottledInputStream should #close() the underlying stream

2014-01-04 Thread Steve Loughran (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13862290#comment-13862290
 ] 

Steve Loughran commented on HADOOP-10204:
-

looking at the code, probably good idea to tag the constructor arg as 
{{@Nonnull}} and have the ctor raise the NPE if it is null, avoids having a 
check on the close(). That'll let you add a test too, to keep jenkins happy

 ThrottledInputStream should #close() the underlying stream 
 ---

 Key: HADOOP-10204
 URL: https://issues.apache.org/jira/browse/HADOOP-10204
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs
 Environment: CDH 2.0.0-cdh4.5.0 
 (30821ec616ee7a21ee8447949b7c6208a8f1e7d8) 
Reporter: Andrei Savu
Assignee: Andrei Savu
 Attachments: HADOOP-10204-trunk.patch, HADOOP-10204.patch


 While working on HADOOP-10203 I've noticed that ThrottledInputStream (DistCP 
 V2) does not override #close(). This can also leak connection.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HADOOP-10204) ThrottledInputStream should #close() the underlying stream

2014-01-04 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13862293#comment-13862293
 ] 

Hadoop QA commented on HADOOP-10204:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12621438/HADOOP-10204-trunk.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:red}-1 javac{color:red}.  The patch appears to cause the build to 
fail.

Console output: 
https://builds.apache.org/job/PreCommit-HADOOP-Build/3398//console

This message is automatically generated.

 ThrottledInputStream should #close() the underlying stream 
 ---

 Key: HADOOP-10204
 URL: https://issues.apache.org/jira/browse/HADOOP-10204
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs
 Environment: CDH 2.0.0-cdh4.5.0 
 (30821ec616ee7a21ee8447949b7c6208a8f1e7d8) 
Reporter: Andrei Savu
Assignee: Andrei Savu
Priority: Minor
 Attachments: HADOOP-10204-trunk.patch, HADOOP-10204.patch


 While working on HADOOP-10203 I've noticed that ThrottledInputStream (DistCP 
 V2) does not override #close(). This can also leak connection.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HADOOP-10203) Connection leak in Jets3tNativeFileSystemStore#retrieveMetadata

2014-01-04 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-10203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13862295#comment-13862295
 ] 

Hadoop QA commented on HADOOP-10203:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12621436/HADOOP-10203-trunk.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 eclipse:eclipse{color}.  The patch built with 
eclipse:eclipse.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 core tests{color}.  The patch passed unit tests in 
hadoop-common-project/hadoop-common.

{color:green}+1 contrib tests{color}.  The patch passed contrib unit tests.

Test results: 
https://builds.apache.org/job/PreCommit-HADOOP-Build/3399//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HADOOP-Build/3399//console

This message is automatically generated.

 Connection leak in Jets3tNativeFileSystemStore#retrieveMetadata 
 

 Key: HADOOP-10203
 URL: https://issues.apache.org/jira/browse/HADOOP-10203
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs/s3
 Environment: CDH 2.0.0-cdh4.5.0 
 (30821ec616ee7a21ee8447949b7c6208a8f1e7d8) 
Reporter: Andrei Savu
Assignee: Andrei Savu
 Attachments: HADOOP-10203-trunk.patch, HADOOP-10203.patch


 Jets3tNativeFileSystemStore#retrieveMetadata  is leaking connections. 
 This affects any client that tries to read many small files very quickly 
 (e.g. distcp from s3 to hdfs with small files blocks due to connection pool 
 starvation). 
 This is not a problem for larger files because when the GC runs any 
 connection that's out of scope will be released in #finalize().
 We are seeing the following log messages as a symptom of this problem:
 {noformat}
 13/12/26 13:40:01 WARN httpclient.HttpMethodReleaseInputStream: Attempting to 
 release HttpMethod in finalize() as its response data stream has gone out of 
 scope. This attempt will not always succeed and cannot be relied upon! Please 
 ensure response data streams are always fully consumed or closed to avoid 
 HTTP connection starvation.
 13/12/26 13:40:01 WARN httpclient.HttpMethodReleaseInputStream: Successfully 
 released HttpMethod in finalize(). You were lucky this time... Please ensure 
 response data streams are always fully consumed or closed.
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HADOOP-9992) Modify the NN loadGenerator to optionally run as a MapReduce job

2014-01-04 Thread Sanjay Radia (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13862502#comment-13862502
 ] 

Sanjay Radia commented on HADOOP-9992:
--

Looks good. Update the javadoc to reflect the MR option. Also you have left 
some debugging printfs in the code.

 Modify the NN loadGenerator to optionally run as a MapReduce job
 

 Key: HADOOP-9992
 URL: https://issues.apache.org/jira/browse/HADOOP-9992
 Project: Hadoop Common
  Issue Type: Bug
Reporter: Akshay Radia
Assignee: Akshay Radia
 Attachments: hadoop-9992.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (HADOOP-9992) Modify the NN loadGenerator to optionally run as a MapReduce job

2014-01-04 Thread Sanjay Radia (JIRA)

[ 
https://issues.apache.org/jira/browse/HADOOP-9992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13862506#comment-13862506
 ] 

Sanjay Radia commented on HADOOP-9992:
--

Update MapredTestDriver to include the loadGenerator for convenience.

 Modify the NN loadGenerator to optionally run as a MapReduce job
 

 Key: HADOOP-9992
 URL: https://issues.apache.org/jira/browse/HADOOP-9992
 Project: Hadoop Common
  Issue Type: Bug
Reporter: Akshay Radia
Assignee: Akshay Radia
 Attachments: hadoop-9992.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)