[ 
https://issues.apache.org/jira/browse/HBASE-9338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13751901#comment-13751901
 ] 

Elliott Clark commented on HBASE-9338:
--------------------------------------

These failures are consistent when the ChaosMonkey is turned on to slow 
deterministic.  If it's turned off then the tests succeeded.

Here's an example:
{code}
        Shuffle Errors
                BAD_ID=0
                CONNECTION=0
                IO_ERROR=0
                WRONG_LENGTH=0
                WRONG_MAP=0
                WRONG_REDUCE=0
        org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList$Verify$Counts
                REFERENCED=1199999908
                UNDEFINED=46
                UNREFERENCED=46
        File Input Format Counters 
                Bytes Read=0
        File Output Format Counters 
                Bytes Written=6866
2013-08-27 04:56:16,792 ERROR [main] test.IntegrationTestBigLinkedList$Verify: 
Expected referenced count does not match with actual referenced count. expected 
referenced=1200000000 ,actual=1199999908
2013-08-27 04:56:16,792 ERROR [main] test.IntegrationTestBigLinkedList$Verify: 
Unreferenced nodes were not expected. Unreferenced count=46
2013-08-27 04:56:16,792 ERROR [main] test.IntegrationTestBigLinkedList$Verify: 
Found an undefined node. Undefined count=46
2013-08-27 04:57:00,129 INFO  [main] hbase.ClusterManager: Executing remote 
command: ps aux | grep proc_regionserver | grep -v grep | tr -s ' ' | cut -d ' 
' -f2 , hostname:a1806.halxg.cloudera.com
2013-08-27 04:57:00,130 INFO  [main] util.Shell: Executing full command 
[/usr/bin/ssh -o ConnectTimeout=1 -o StrictHostKeyChecking=no 
a1806.halxg.cloudera.com "ps aux | grep proc_regionserver | grep -v grep | tr 
-s ' ' | cut -d ' ' -f2"]
2013-08-27 04:57:00,945 INFO  [main] hbase.ClusterManager: Executed remote 
command, exit code:0 , output:
2013-08-27 04:57:00,945 INFO  [main] hbase.HBaseCluster: Starting RS on: 
a1806.halxg.cloudera.com
2013-08-27 04:57:00,945 INFO  [main] hbase.ClusterManager: Executing remote 
command: /opt/hbase/current/bin/../bin/hbase-daemon.sh  start regionserver , 
hostname:a1806.halxg.cloudera.com
2013-08-27 04:57:00,946 INFO  [main] util.Shell: Executing full command 
[/usr/bin/ssh -o ConnectTimeout=1 -o StrictHostKeyChecking=no 
a1806.halxg.cloudera.com "/opt/hbase/current/bin/../bin/hbase-daemon.sh  start 
regionserver"]
2013-08-27 04:57:02,269 INFO  [main] hbase.ClusterManager: Executed remote 
command, exit code:0 , output:starting regionserver, logging to 
/opt/hbase/current/bin/../logs/hbase-hbase-regionserver-a1806.halxg.cloudera.com.out

2013-08-27 04:57:02,270 INFO  [main] hbase.HBaseCluster: Added new HBaseAdmin
2013-08-27 04:57:02,271 ERROR [main] util.AbstractHBaseTool: Error running 
command-line tool
java.lang.RuntimeException: Verify.verify failed
        at 
org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList$Loop.runVerify(IntegrationTestBigLinkedList.java:727)
        at 
org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList$Loop.run(IntegrationTestBigLinkedList.java:760)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at 
org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList.runTestFromCommandLine(IntegrationTestBigLinkedList.java:1059)
        at 
org.apache.hadoop.hbase.IntegrationTestBase.doWork(IntegrationTestBase.java:77)
        at 
org.apache.hadoop.hbase.util.AbstractHBaseTool.run(AbstractHBaseTool.java:112)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
        at 
org.apache.hadoop.hbase.test.IntegrationTestBigLinkedList.main(IntegrationTestBigLinkedList.java:1094)
Build step 'Execute shell' marked build as failure
{code}

Word of caution:  If you don't have HBASE-9253 then you are testing 1/100th of 
what is intended.
                
> Test Big Linked List fails on Hadoop 2.1.0
> ------------------------------------------
>
>                 Key: HBASE-9338
>                 URL: https://issues.apache.org/jira/browse/HBASE-9338
>             Project: HBase
>          Issue Type: Bug
>          Components: test
>    Affects Versions: 0.96.0
>            Reporter: Elliott Clark
>            Assignee: Elliott Clark
>            Priority: Critical
>             Fix For: 0.96.0
>
>         Attachments: HBASE-9338-0.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to