[ 
https://issues.apache.org/jira/browse/ACCUMULO-3141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14139067#comment-14139067
 ] 

Bill Havanki commented on ACCUMULO-3141:
----------------------------------------

Thanks Josh!

FWIW, I didn't have any particular attachment to the idea of "twice the 
standard deviation", beyond it sounding pretty reasonable and handling the 
unbalanced issues at the time. So, this particular problem could either be a 
real thing or just not fitting into the reasonable expectation from before.

> Many RW failures due to balance check
> -------------------------------------
>
>                 Key: ACCUMULO-3141
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-3141
>             Project: Accumulo
>          Issue Type: Bug
>         Environment: 1.5.2 RC1, Hadoop 2.3.0, Zookeeper 3.4.5, CentOS 6, 20 
> node EC2
>            Reporter: Keith Turner
>             Fix For: 1.5.3, 1.6.1, 1.7.0
>
>
> While running RW test against 1.5.2 RC1, 10 of 17 walkers failed with a 
> message like the following.
> {noformat}
> 16 19:35:48,820 [randomwalk.Framework] ERROR: Error during random walk
> java.lang.Exception: Error running node Concurrent.xml
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:285)
>         at 
> org.apache.accumulo.test.randomwalk.Framework.run(Framework.java:63)
>         at 
> org.apache.accumulo.test.randomwalk.Framework.main(Framework.java:122)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at org.apache.accumulo.start.Main$1.run(Main.java:107)
>         at java.lang.Thread.run(Thread.java:744)
> Caused by: java.lang.Exception: Error running node ct.CheckBalance
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:285)
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:254)
>         ... 8 more
> Caused by: java.lang.Exception: servers are unbalanced! location 
> 2487f8db354002f count 345 too far from average 151.86666666666667
>         at 
> org.apache.accumulo.test.randomwalk.concurrent.CheckBalance.visit(CheckBalance.java:86)
>         at org.apache.accumulo.test.randomwalk.Module.visit(Module.java:254)
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to