[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException when 50 regions present

2012-05-18 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13279413#comment-13279413
 ] 

Hudson commented on HBASE-6018:
---

Integrated in HBase-0.94-security #27 (See 
[https://builds.apache.org/job/HBase-0.94-security/27/])
HBASE-6018 hbck fails with a RejectedExecutionException when 50 regions 
present (Revision 1339877)

 Result = SUCCESS
jmhsieh : 
Files : 
* /hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/util/TestHBaseFsck.java


 hbck fails with a RejectedExecutionException when 50 regions present
 -

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Fix For: 0.92.2, 0.96.0, 0.94.1

 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-17 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277599#comment-13277599
 ] 

Jonathan Hsieh commented on HBASE-6018:
---

I will commit if this passes the hadoopqabot.

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-17 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277643#comment-13277643
 ] 

Hadoop QA commented on HBASE-6018:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12527787/hbase-6018-v2.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

+1 tests included.  The patch appears to include 3 new or modified tests.

+1 hadoop23.  The patch compiles against the hadoop 0.23.x profile.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

-1 findbugs.  The patch appears to introduce 32 new Findbugs (version 
1.3.9) warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

 -1 core tests.  The patch failed these unit tests:
   
org.apache.hadoop.hbase.regionserver.TestSplitTransactionOnCluster
  org.apache.hadoop.hbase.io.hfile.TestForceCacheImportantBlocks

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/1907//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/1907//artifact/trunk/patchprocess/newPatchFindbugsWarnings.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/1907//console

This message is automatically generated.

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-17 Thread Anoop Sam John (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277686#comment-13277686
 ] 

Anoop Sam John commented on HBASE-6018:
---


I feel this change should be fine enough to handle our requirement.
We make the core threads count = max threads count and use an unbounded queue. 
hbasefsck.numthreads - By this count we mean ideally that these many 
parallel operations can run. 
As we make the core threads count as this value rather than 1, the submitted 
tasks will create new threads or allocat to available threads. Only when more 
than these many requests, those will get queued.

Also we make executor.allowCoreThreadTimeOut(true).  So there is absolutely no 
problem in increasing the core threads count from 1..



 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-17 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277920#comment-13277920
 ] 

Lars Hofhansl commented on HBASE-6018:
--

The change is fine. I was just pointing out the executor weirdness.

From your quoted text...
bq. If corePoolSize or more threads are running, the Executor always prefers 
queuing a request rather than adding a new thread

Since LinkedBlockingQueue is unbounded the executor will *never* grow the 
number of threads past the number of core threads specified (re-read the doc if 
you don't believe me :) ). This is certainly not what you want although in the 
case of HBCK it does not matter, as you're not enqueuing 10's of tasks.
At least the matching change should be to set core threads and max threads to 
the same value (in order to avoid later confusion).


 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-17 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277922#comment-13277922
 ] 

Lars Hofhansl commented on HBASE-6018:
--

NM me. The patch already does this. +1

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException when 50 regions present

2012-05-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13278363#comment-13278363
 ] 

Hudson commented on HBASE-6018:
---

Integrated in HBase-TRUNK #2895 (See 
[https://builds.apache.org/job/HBase-TRUNK/2895/])
HBASE-6018 hbck fails with a RejectedExecutionException when 50 regions 
present (Revision 1339876)

 Result = SUCCESS
jmhsieh : 
Files : 
* /hbase/trunk/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
* /hbase/trunk/src/test/java/org/apache/hadoop/hbase/util/TestHBaseFsck.java


 hbck fails with a RejectedExecutionException when 50 regions present
 -

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Fix For: 0.92.2, 0.96.0, 0.94.1

 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException when 50 regions present

2012-05-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13278377#comment-13278377
 ] 

Hudson commented on HBASE-6018:
---

Integrated in HBase-0.94 #197 (See 
[https://builds.apache.org/job/HBase-0.94/197/])
HBASE-6018 hbck fails with a RejectedExecutionException when 50 regions 
present (Revision 1339877)

 Result = FAILURE
jmhsieh : 
Files : 
* /hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/util/TestHBaseFsck.java


 hbck fails with a RejectedExecutionException when 50 regions present
 -

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Fix For: 0.92.2, 0.96.0, 0.94.1

 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException when 50 regions present

2012-05-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13278420#comment-13278420
 ] 

Hudson commented on HBASE-6018:
---

Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #8 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/8/])
HBASE-6018 hbck fails with a RejectedExecutionException when 50 regions 
present (Revision 1339876)

 Result = FAILURE
jmhsieh : 
Files : 
* /hbase/trunk/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
* /hbase/trunk/src/test/java/org/apache/hadoop/hbase/util/TestHBaseFsck.java


 hbck fails with a RejectedExecutionException when 50 regions present
 -

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Fix For: 0.92.2, 0.96.0, 0.94.1

 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException when 50 regions present

2012-05-17 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13278523#comment-13278523
 ] 

Hudson commented on HBASE-6018:
---

Integrated in HBase-0.92-security #107 (See 
[https://builds.apache.org/job/HBase-0.92-security/107/])
HBASE-6018 hbck fails with a RejectedExecutionException when 50 regions 
present (Revision 1339879)

 Result = FAILURE
jmhsieh : 
Files : 
* /hbase/branches/0.92/CHANGES.txt
* /hbase/branches/0.92/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
* 
/hbase/branches/0.92/src/test/java/org/apache/hadoop/hbase/util/TestHBaseFsck.java


 hbck fails with a RejectedExecutionException when 50 regions present
 -

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Fix For: 0.92.2, 0.96.0, 0.94.1

 Attachments: hbase-6018-v2.patch, hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-16 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13276787#comment-13276787
 ] 

Jonathan Hsieh commented on HBASE-6018:
---


This line seems related to attempt to enqueue a work item into a 
SynchronousQueue introduced in HBASE-4859.  I don't understand why a 
SynchronousQueue is used (it has no capacity!) 

Problem goes away after this change:

{code}
diff --git a/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java 
b/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
index 83aa316..8a050fd 100644
--- a/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
+++ b/src/main/java/org/apache/hadoop/hbase/util/HBaseFsck.java
@@ -33,7 +33,8 @@ import java.util.SortedSet;
 import java.util.TreeMap;
 import java.util.TreeSet;
 import java.util.concurrent.ConcurrentSkipListMap;
-import java.util.concurrent.SynchronousQueue;
+//import java.util.concurrent.SynchronousQueue;
+import java.util.concurrent.LinkedBlockingQueue;
 import java.util.concurrent.ThreadPoolExecutor;
 import java.util.concurrent.TimeUnit;
 import java.util.concurrent.atomic.AtomicInteger;
@@ -217,9 +218,9 @@ public class HBaseFsck {
 this.conf = conf;
 
 int numThreads = conf.getInt(hbasefsck.numthreads, MAX_NUM_THREADS);
-executor = new ThreadPoolExecutor(1, numThreads,
+executor = new ThreadPoolExecutor(numThreads, numThreads,
 THREADS_KEEP_ALIVE_SECONDS, TimeUnit.SECONDS,
-new SynchronousQueueRunnable());
+new LinkedBlockingQueueRunnable());
 executor.allowCoreThreadTimeOut(true);
   }
{code}

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh

 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-16 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13276813#comment-13276813
 ] 

Jonathan Hsieh commented on HBASE-6018:
---

When there are  MAX_NUM_THREADS regions, the SynchronousQueue version works, 
when it there are  MAX_NUM_THREADS regions it fails with the 
RejectedExectionException.

A workaround is to add hbase-site.xml hbasefsck.numthreads property set to a 
value larger than the number of regions in your hbase instance.  (you can 
purposely set it low to trigger the problem).


 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-16 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13276844#comment-13276844
 ] 

Hadoop QA commented on HBASE-6018:
--

-1 overall.  Here are the results of testing the latest attachment 
  http://issues.apache.org/jira/secure/attachment/12527633/hbase-6018.patch
  against trunk revision .

+1 @author.  The patch does not contain any @author tags.

-1 tests included.  The patch doesn't appear to include any new or modified 
tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

+1 hadoop23.  The patch compiles against the hadoop 0.23.x profile.

+1 javadoc.  The javadoc tool did not generate any warning messages.

+1 javac.  The applied patch does not increase the total number of javac 
compiler warnings.

-1 findbugs.  The patch appears to introduce 31 new Findbugs (version 
1.3.9) warnings.

+1 release audit.  The applied patch does not increase the total number of 
release audit warnings.

+1 core tests.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/1888//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/1888//artifact/trunk/patchprocess/newPatchFindbugsWarnings.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/1888//console

This message is automatically generated.

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-16 Thread Zhihong Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277146#comment-13277146
 ] 

Zhihong Yu commented on HBASE-6018:
---

Patch looks good.

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-16 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277237#comment-13277237
 ] 

Lars Hofhansl commented on HBASE-6018:
--

ThreadPoolExecutor is pretty stupid (if you ask me). Unless the Queue fills up 
it will never allocate more than threads than indicated by the number of core 
threads.
The LinkedBlockingQueue has no limit, so it'll only ever use 
conf.getInt(hbasefsck.numthreads) number of threads.

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6018) hbck fails with a RejectedExecutionException

2012-05-16 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13277581#comment-13277581
 ] 

Jonathan Hsieh commented on HBASE-6018:
---

I'm convinced that SynchronousQueue was just wrong and the failure was 
expected behavior.  I'm a little confused by your statement Lars; queues are 
just used if there are more tasks than max concurrent enough threads in the 
pool.

See java doc here 
http://docs.oracle.com/javase/1.5.0/docs/api/java/util/concurrent/ThreadPoolExecutor.html

{quote}
Queuing
Any BlockingQueue may be used to transfer and hold submitted tasks. The use of 
this queue interacts with pool sizing:
If fewer than corePoolSize threads are running, the Executor always prefers 
adding a new thread rather than queuing.
If corePoolSize or more threads are running, the Executor always prefers 
queuing a request rather than adding a new thread.
If a request cannot be queued, a new thread is created unless this would exceed 
maximumPoolSize, in which case, the task will be rejected.
{quote}

 hbck fails with a RejectedExecutionException
 

 Key: HBASE-6018
 URL: https://issues.apache.org/jira/browse/HBASE-6018
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.92.1, 0.94.0
Reporter: Jonathan Hsieh
Assignee: Jonathan Hsieh
 Attachments: hbase-6018.patch


 On a long running job 0.94.0rc3 cluster, we get to a point where hbck 
 consistently encounters this error and fails:
 {code}
 Exception in thread main java.util.concurrent.RejectedExecutionException
   at 
 java.util.concurrent.ThreadPoolExecutor$AbortPolicy.rejectedExecution(ThreadPoolExecutor.java:1768)
   at 
 java.util.concurrent.ThreadPoolExecutor.reject(ThreadPoolExecutor.java:767)
   at 
 java.util.concurrent.ThreadPoolExecutor.execute(ThreadPoolExecutor.java:658)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.loadHdfsRegionInfos(HBaseFsck.java:633)
   at 
 org.apache.hadoop.hbase.util.HBaseFsck.onlineConsistencyRepair(HBaseFsck.java:354)
   at org.apache.hadoop.hbase.util.HBaseFsck.onlineHbck(HBaseFsck.java:382)
   at org.apache.hadoop.hbase.util.HBaseFsck.main(HBaseFsck.java:3120)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira