[jira] [Created] (HBASE-27426) Region server abort with failing to list region servers.

2022-10-12 Thread Victor Li (Jira)
Victor Li created HBASE-27426:
-

 Summary: Region server abort with failing to list region servers.
 Key: HBASE-27426
 URL: https://issues.apache.org/jira/browse/HBASE-27426
 Project: HBase
  Issue Type: Bug
  Components: Zookeeper
Reporter: Victor Li


During unit tests, clusters were created and destroyed, the region server could 
abort during shutdown with failing to list region servers.

 

Logs:

 

2022-10-11 18:44:43,393 INFO  [main] utility.HBaseClusterSetup(160): Starting 
mini cluster: singleCluster

[master/victorli-wsl:0:becomeActiveMaster] INFO 
org.apache.hadoop.hbase.zookeeper.ZKWatcher - not a secure deployment, 
proceeding

2022-10-11 18:45:07,234 INFO  [Listener at localhost/45185] 
utility.HBaseClusterSetup(177): Started mini cluster: singleCluster

[Listener at localhost/45185] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher 
- ZKWatcher close started.

[Listener at localhost/45185] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher 
- ZKWatcher close finished.

[Listener at localhost/45185] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher 
- ZKWatcher close started.

[Listener at localhost/45185] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher 
- ZKWatcher close finished.

2022-10-11 18:45:45,980 INFO  [Listener at localhost/45185] 
utility.HBaseClusterSetup(132): Stopping mini cluster: singleCluster

[RS:0;victorli-wsl:35113] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher - 
ZKWatcher close started.

[zk-event-processor-pool-0] ERROR 
org.apache.hadoop.hbase.regionserver.HRegionServer - * ABORTING region 
server victorli-wsl,35113,1665539090704: failed to list region servers *

[RS:0;victorli-wsl:35113] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher - 
ZKWatcher close finished.

[M:0;victorli-wsl:42915] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher - 
ZKWatcher close started.

[M:0;victorli-wsl:42915] INFO org.apache.hadoop.hbase.zookeeper.ZKWatcher - 
ZKWatcher close finished.

2022-10-11 18:45:57,208 INFO  [Listener at localhost/45185] 
utility.HBaseClusterSetup(144): Stopped mini cluster: singleCluster

 

Stack:

[zk-event-processor-pool-0] WARN org.apache.hadoop.hbase.zookeeper.ZKUtil - 
regionserver:35113-0x101b6fed2a1, quorum=127.0.0.1:53028, baseZNode=/9 
Unable to list children of znode /9/rs

java.lang.InterruptedException

    at java.lang.Object.wait(Native Method)

    at java.lang.Object.wait(Object.java:502)

    at org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1601)

    at org.apache.zookeeper.ClientCnxn.submitRequest(ClientCnxn.java:1573)

    at org.apache.zookeeper.ZooKeeper.getChildren(ZooKeeper.java:2420)

    at 
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.getChildren(RecoverableZooKeeper.java:341)

    at 
org.apache.hadoop.hbase.zookeeper.ZKUtil.listChildrenAndWatchForNewChildren(ZKUtil.java:190)

    at 
org.apache.hadoop.hbase.zookeeper.RegionServerAddressTracker.loadRegionServerList(RegionServerAddressTracker.java:54)

    at 
org.apache.hadoop.hbase.zookeeper.RegionServerAddressTracker.nodeChildrenChanged(RegionServerAddressTracker.java:71)

    at 
org.apache.hadoop.hbase.zookeeper.ZKWatcher.lambda$processEvent$0(ZKWatcher.java:586)

    at org.apache.hadoop.hbase.trace.TraceUtil.trace(TraceUtil.java:187)

    at org.apache.hadoop.hbase.trace.TraceUtil.trace(TraceUtil.java:177)

    at 
org.apache.hadoop.hbase.zookeeper.ZKWatcher.processEvent(ZKWatcher.java:554)

    at 
org.apache.hadoop.hbase.zookeeper.ZKWatcher.lambda$process$1(ZKWatcher.java:607)

    at 
org.apache.hadoop.hbase.trace.TraceUtil.lambda$tracedRunnable$2(TraceUtil.java:155)

    at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)

    at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)

    at java.lang.Thread.run(Thread.java:750)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (HBASE-27425) Run flaky test job more often

2022-10-12 Thread Sean Busbey (Jira)
Sean Busbey created HBASE-27425:
---

 Summary: Run flaky test job more often
 Key: HBASE-27425
 URL: https://issues.apache.org/jira/browse/HBASE-27425
 Project: HBase
  Issue Type: Task
  Components: test
Reporter: Sean Busbey
 Attachments: ci-hbase-hbase-flaky-test-master-time-png.png, 
ci-hbase-hbase-load-chart-long.png

We currently run the flaky tests once per 4 hours and keep a backlog of 50 
runs. That means it takes a perfect test a bit over 8 days to get back out of 
being penalized.

we regularly are leaving a bunch of executors idle (see attached PNG) and I 
think we should run the tests more often.

we use an executor per branch for a flaky test run. the duration depends on the 
specific tests that are flagged, but it has a timeout of 2 hours.

as an example the master branch tests currently take < 30 minutes (see other 
attached PNG).





--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (HBASE-27400) The 'Remove unhelpful javadoc stubs' spotless rule will leave a 'n' instead of '\n'

2022-10-12 Thread Duo Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang resolved HBASE-27400.
---
  Assignee: Duo Zhang
Resolution: Fixed

All sub tasks are done.

Resolve.

> The 'Remove unhelpful javadoc stubs' spotless rule will leave a 'n' instead 
> of '\n'
> ---
>
> Key: HBASE-27400
> URL: https://issues.apache.org/jira/browse/HBASE-27400
> Project: HBase
>  Issue Type: Bug
>  Components: documentation
>Reporter: Duo Zhang
>Assignee: Duo Zhang
>Priority: Major
>
> That's why there are lots of 'n' in our code base...
> Let me find a way to fix this and clean up all the 'n's.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (HBASE-27421) Bump spotless plugin to 2.27.2 and reimplement the 'Remove unhelpful javadoc stubs' rule

2022-10-12 Thread Duo Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang resolved HBASE-27421.
---
Fix Version/s: 2.6.0
   2.5.1
   3.0.0-alpha-4
   2.4.15
 Hadoop Flags: Reviewed
   Resolution: Fixed

Pushed to branch-2.4_.

Thanks [~haxiaolin] for reviewing!

> Bump spotless plugin to 2.27.2 and reimplement the 'Remove unhelpful javadoc 
> stubs' rule
> 
>
> Key: HBASE-27421
> URL: https://issues.apache.org/jira/browse/HBASE-27421
> Project: HBase
>  Issue Type: Sub-task
>  Components: documentation, pom
>Reporter: Duo Zhang
>Assignee: Duo Zhang
>Priority: Major
> Fix For: 2.6.0, 2.5.1, 3.0.0-alpha-4, 2.4.15
>
>
> spotless maven 2.27.2 is released and it contains 
> https://github.com/diffplug/spotless/issues/1359, which allows the 
> replacement to be null, so we can add back the 'Remove unhelpful javadoc 
> stubs' rule now.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (HBASE-27405) Fix the replication hfile/log cleaner report that the replication table does not exist

2022-10-12 Thread Duo Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/HBASE-27405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Duo Zhang resolved HBASE-27405.
---
Fix Version/s: HBASE-27109
 Hadoop Flags: Reviewed
   Resolution: Fixed

Merged to branch HBASE-27109.

Thanks [~heliangjun] for contributing!

> Fix the replication hfile/log cleaner report that the replication table does 
> not exist
> --
>
> Key: HBASE-27405
> URL: https://issues.apache.org/jira/browse/HBASE-27405
> Project: HBase
>  Issue Type: Sub-task
>  Components: master, Replication
>Reporter: LiangJun He
>Assignee: LiangJun He
>Priority: Major
> Fix For: HBASE-27109
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)