[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-08-04 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13428728#comment-13428728
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-0.94-security-on-Hadoop-23 #6 (See 
[https://builds.apache.org/job/HBase-0.94-security-on-Hadoop-23/6/])
HBASE-6406 Disable TestZooKeeper#testClientSessionExpired (Revision 1364207)
HBASE-6406 Remove TestReplicationPeer (Revision 1363213)

 Result = FAILURE
larsh : 
Files : 
* /hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/TestZooKeeper.java

larsh : 
Files : 
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationPeer.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.1

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-24 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13421666#comment-13421666
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-0.94-security #45 (See 
[https://builds.apache.org/job/HBase-0.94-security/45/])
HBASE-6406 Disable TestZooKeeper#testClientSessionExpired (Revision 1364207)

 Result = FAILURE
larsh : 
Files : 
* /hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/TestZooKeeper.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.1

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-23 Thread Jonathan Hsieh (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13421143#comment-13421143
 ] 

Jonathan Hsieh commented on HBASE-6406:
---

Just as a note, the TestZooKeeper.testClientSessionExpired test fails 
occasionally in 0.90.6.  I have a patch that reduces the tests failure 
frequency, for 0.90 (fails 1 out of 400 runs), but haven't investigated in 
newer versions yet.

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.1

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-22 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13420158#comment-13420158
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #103 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/103/])
HBASE-6406 Disable TestZooKeeper#testClientSessionExpired (Revision 1364204)

 Result = FAILURE
larsh : 
Files : 
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/TestZooKeeper.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.2

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-21 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419778#comment-13419778
 ] 

stack commented on HBASE-6406:
--

Disable the test for now I'd say Lars.  Make an issue to examine whats up but 
get it out of the way of 0.94.1 I'd say.

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.2

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-21 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13420090#comment-13420090
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-0.94 #351 (See 
[https://builds.apache.org/job/HBase-0.94/351/])
HBASE-6406 Disable TestZooKeeper#testClientSessionExpired (Revision 1364207)

 Result = SUCCESS
larsh : 
Files : 
* /hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/TestZooKeeper.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.2

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-21 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13420094#comment-13420094
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-TRUNK #3158 (See 
[https://builds.apache.org/job/HBase-TRUNK/3158/])
HBASE-6406 Disable TestZooKeeper#testClientSessionExpired (Revision 1364204)

 Result = FAILURE
larsh : 
Files : 
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/TestZooKeeper.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.2

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-20 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13419651#comment-13419651
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-0.94-security #44 (See 
[https://builds.apache.org/job/HBase-0.94-security/44/])
HBASE-6406 Remove TestReplicationPeer (Revision 1363213)

 Result = FAILURE
larsh : 
Files : 
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationPeer.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.1

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-19 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418225#comment-13418225
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #99 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/99/])
HBASE-6406 Remove TestReplicationPeer (Revision 1363217)

 Result = FAILURE
larsh : 
Files : 
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationPeer.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-19 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418363#comment-13418363
 ] 

ramkrishna.s.vasudevan commented on HBASE-6406:
---

@Lars
Can you just attach the patch over here? Thanks..

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-19 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418936#comment-13418936
 ] 

Lars Hofhansl commented on HBASE-6406:
--

TestZooKeeper.testClientSessionExpired failed again in latest 0.94 build.
Although this is not obvious from the logs the pattern in the code is that same 
as in TestReplicationPeer.

My initial suspicion was RecoverableZooKeeper and that it somehow retries the 
operation and thereby reconnects the expired session. According to the code it 
does not do that, though.

Somehow HBaseTestingUtil.expireSession is subject to racing.
In the case of TestReplicationPeer that happened when expireSession is called 
before the connection was actually established.

Is there a way to check whether the connection was established first and wait 
if it wasn't?
Otherwise, I'd say we disable this test for now.


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.96.0, 0.94.1

 Attachments: 6406.txt, testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13417310#comment-13417310
 ] 

Lars Hofhansl commented on HBASE-6406:
--

Hmm... Looks like TestReplication is hanging in setup waiting for the root 
region to be assigned.
TestZooKeeper also appears to be waiting for a RegionServer to start.

These would seem to be more general issues with starting the MiniCluster

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Zhihong Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13417665#comment-13417665
 ] 

Zhihong Ted Yu commented on HBASE-6406:
---

For trunk, TestZooKeeper hung with the following output:
{code}
2012-07-18 13:24:34,764 INFO  
[Master:0;sdev25.arch.ebay.com,59816,1342643039714] master.HMaster(455): 
HMaster main thread exiting

2012-07-18 13:24:34,764 INFO  
[RegionServer:2;sdev25.arch.ebay.com,60707,1342643074759] 
zookeeper.RecoverableZooKeeper(102): The identifier of this process is 
15496@sdev25

2012-07-18 13:24:34,772 DEBUG 
[RegionServer:2;sdev25.arch.ebay.com,60707,1342643074759-EventThread] 
zookeeper.ZooKeeperWatcher(262): regionserver:60707 Received ZooKeeper Event, 
type=None, state=SyncConnected, path=null

2012-07-18 13:24:34,773 DEBUG 
[RegionServer:2;sdev25.arch.ebay.com,60707,1342643074759] 
zookeeper.ZKUtil(238): regionserver:60707 /hbase/master does not exist. Watcher 
is set.

2012-07-18 13:24:34,774 DEBUG 
[RegionServer:2;sdev25.arch.ebay.com,60707,1342643074759-EventThread] 
zookeeper.ZooKeeperWatcher(339): regionserver:60707-0x1389bc2dddb000c connected

2012-07-18 13:24:35,062 INFO  
[sdev25.arch.ebay.com,59816,1342643039714.splitLogManagerTimeoutMonitor] 
hbase.Chore(82): 
sdev25.arch.ebay.com,59816,1342643039714.splitLogManagerTimeoutMonitor exiting

2012-07-18 13:24:35,080 DEBUG 
[RegionServer:0;sdev25.arch.ebay.com,48349,1342643039994] 
regionserver.HRegionServer(1817): No master found; retry

2012-07-18 13:24:36,081 DEBUG 
[RegionServer:0;sdev25.arch.ebay.com,48349,1342643039994] 
regionserver.HRegionServer(1817): No master found; retry
{code}


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13417811#comment-13417811
 ] 

Lars Hofhansl commented on HBASE-6406:
--

Talked to Chris Trezzo (who wrote TestReplicationPeer). We both think that we 
should just yank the test. It basically just validates that a new ZKWatcher can 
connect to the ZK ensemble after another ZKWatcher was disconnected.

If there're no objection I'll make that so.

Looking at TestZooKeeper next.

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13417836#comment-13417836
 ] 

Andrew Purtell commented on HBASE-6406:
---

bq. Talked to Chris Trezzo (who wrote TestReplicationPeer). We both think that 
we should just yank the test. 

+1

But that's different from TestReplication. I opened HBASE-6424

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418013#comment-13418013
 ] 

Lars Hofhansl commented on HBASE-6406:
--

I removed TestReplicationPeer.
It seems TestZooKeeper is fixed by reverting HBASE-6389.

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418044#comment-13418044
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-0.94 #339 (See 
[https://builds.apache.org/job/HBase-0.94/339/])
HBASE-6406 Remove TestReplicationPeer (Revision 1363213)

 Result = FAILURE
larsh : 
Files : 
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationPeer.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Zhihong Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418047#comment-13418047
 ] 

Zhihong Ted Yu commented on HBASE-6406:
---

TestReplicationPeer.java should be removed from trunk as well, right ?

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418051#comment-13418051
 ] 

Lars Hofhansl commented on HBASE-6406:
--

Oops. Yes.

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418053#comment-13418053
 ] 

Lars Hofhansl commented on HBASE-6406:
--

Done.

 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-18 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13418072#comment-13418072
 ] 

Hudson commented on HBASE-6406:
---

Integrated in HBase-TRUNK #3148 (See 
[https://builds.apache.org/job/HBase-TRUNK/3148/])
HBASE-6406 Remove TestReplicationPeer (Revision 1363217)

 Result = FAILURE
larsh : 
Files : 
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationPeer.java


 TestReplicationPeer.testResetZooKeeperSession and 
 TestZooKeeper.testClientSessionExpired fail frequently
 

 Key: HBASE-6406
 URL: https://issues.apache.org/jira/browse/HBASE-6406
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.1
Reporter: Lars Hofhansl
Assignee: Lars Hofhansl
 Fix For: 0.94.2

 Attachments: testReplication.jstack, testZooKeeper.jstack


 Looking back through the 0.94 test runs these two tests accounted for 11 of 
 34 failed tests.
 They should be fixed or (temporarily) disabled.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira




[jira] [Commented] (HBASE-6406) TestReplicationPeer.testResetZooKeeperSession and TestZooKeeper.testClientSessionExpired fail frequently

2012-07-17 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13416673#comment-13416673
 ] 

Lars Hofhansl commented on HBASE-6406:
--

The first one fails due to a race condition:
{quote}
2012-07-16 03:36:04,631 INFO  [pool-1-thread-1] 
zookeeper.MiniZooKeeperCluster(193): Started MiniZK Cluster and connect 1 ZK 
server on client port: 61529
2012-07-16 03:36:04,760 DEBUG [pool-1-thread-1] zookeeper.ZKUtil(100): 
connection to cluster: clusterId opening connection to ZooKeeper with ensemble 
(localhost:61529)
2012-07-16 03:36:04,831 INFO  [pool-1-thread-1] 
zookeeper.RecoverableZooKeeper(97): The identifier of this process is 
22...@vesta.apache.org
2012-07-16 03:36:04,927 INFO  [pool-1-thread-1] hbase.ResourceChecker(145): 
before replication.TestReplicationPeer#testResetZooKeeperSession: 11 threads, 
86 file descriptors 0 connections, 
2012-07-16 03:36:07,918 DEBUG [pool-1-thread-1-EventThread] 
zookeeper.ZooKeeperWatcher(262): connection to cluster: clusterId Received 
ZooKeeper Event, type=None, state=SyncConnected, path=null
2012-07-16 03:36:07,926 INFO  [Thread-2] replication.TestReplicationPeer(54): 
Expiring ReplicationPeer ZooKeeper session.
2012-07-16 03:36:07,950 DEBUG [pool-1-thread-1-EventThread] 
zookeeper.ZooKeeperWatcher(339): connection to cluster: 
clusterId-0x1388ddb6141 connected
2012-07-16 03:36:08,091 INFO  [Thread-2] hbase.HBaseTestingUtility(1344): ZK 
Closed Session 0x1388ddb6141; sleeping=7000
2012-07-16 03:36:15,092 INFO  [Thread-2] replication.TestReplicationPeer(58): 
Attempting to use expired ReplicationPeer ZooKeeper session.
2012-07-16 03:36:15,095 INFO  [pool-1-thread-1] hbase.ResourceChecker(145): 
after replication.TestReplicationPeer#testResetZooKeeperSession: 11 threads 
(was 11), 89 file descriptors (was 89). 0 connections, 
{quote}

A successful run looks like this:
{quote}
2012-07-17 15:20:35,285 INFO  [main] zookeeper.MiniZooKeeperCluster(193): 
Started MiniZK Cluster and connect 1 ZK server on client port: 49834
2012-07-17 15:20:35,298 DEBUG [main] zookeeper.ZKUtil(100): connection to 
cluster: clusterId opening connection to ZooKeeper with ensemble 
(localhost:49834)
2012-07-17 15:20:35,312 INFO  [main] zookeeper.RecoverableZooKeeper(97): The 
identifier of this process is 26186@
2012-07-17 15:20:35,336 DEBUG [main-EventThread] 
zookeeper.ZooKeeperWatcher(262): connection to cluster: clusterId Received 
ZooKeeper Event, type=None, state=SyncConnected, path=null
2012-07-17 15:20:35,338 DEBUG [main-EventThread] 
zookeeper.ZooKeeperWatcher(339): connection to cluster: 
clusterId-0x1389707502a connected
2012-07-17 15:20:35,348 INFO  [main] hbase.ResourceChecker(145): before 
replication.TestReplicationPeer#testResetZooKeeperSession: 10 threads, 87 file 
descriptors 0 connections, 
2012-07-17 15:20:35,356 INFO  [Thread-2] replication.TestReplicationPeer(56): 
Expiring ReplicationPeer ZooKeeper session.
2012-07-17 15:20:35,360 INFO  [Thread-2] hbase.HBaseTestingUtility(1344): ZK 
Closed Session 0x1389707502a; sleeping=7000
2012-07-17 15:20:35,459 DEBUG [main-EventThread] 
zookeeper.ZooKeeperWatcher(262): connection to cluster: 
clusterId-0x1389707502a Received ZooKeeper Event, type=None, 
state=Disconnected, path=null
2012-07-17 15:20:35,459 DEBUG [main-EventThread] 
zookeeper.ZooKeeperWatcher(360): connection to cluster: 
clusterId-0x1389707502a Received Disconnected from ZooKeeper, ignoring
2012-07-17 15:20:37,267 DEBUG [main-EventThread] 
zookeeper.ZooKeeperWatcher(262): connection to cluster: 
clusterId-0x1389707502a Received ZooKeeper Event, type=None, state=Expired, 
path=null
2012-07-17 15:20:37,269 WARN  [main-EventThread] 
replication.ReplicationPeer(157): The ReplicationPeer coresponding to peer 
clusterKey was aborted for the following reason(s):connection to cluster: 
clusterId-0x1389707502a connection to cluster: clusterId-0x1389707502a 
received expired from ZooKeeper, aborting
org.apache.zookeeper.KeeperException$SessionExpiredException: KeeperErrorCode = 
Session expired
at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.connectionEvent(ZooKeeperWatcher.java:374)
at 
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.process(ZooKeeperWatcher.java:271)
at 
org.apache.zookeeper.ClientCnxn$EventThread.processEvent(ClientCnxn.java:521)
at org.apache.zookeeper.ClientCnxn$EventThread.run(ClientCnxn.java:497)
2012-07-17 15:20:42,360 INFO  [Thread-2] replication.TestReplicationPeer(60): 
Attempting to use expired ReplicationPeer ZooKeeper session.
2012-07-17 15:20:42,362 DEBUG [Thread-2] zookeeper.ZKUtil(100): connection to 
cluster: clusterId opening connection to ZooKeeper with ensemble 
(localhost:49834)
2012-07-17 15:20:42,363 INFO  [Thread-2] zookeeper.RecoverableZooKeeper(97): 
The identifier of this process is 26186@
2012-07-17 15:20:42,364 INFO  [Thread-2]