[
https://issues.apache.org/jira/browse/HBASE-23737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17023379#comment-17023379
]
Michael Stack commented on HBASE-23737:
---------------------------------------
Uploaded patch I pushed. It just adds waiting on all RITs to clear. Seems to
make the test better but still strange condition where Region is missing its
FavoredNodes in global list. I added debug around all adds/removes and made
lists immutable and where we let out the FN plan, I made copies... but needs
more time to nail definitively. Ugh.
> [Flakey Tests] TestFavoredNodeTableImport fails 30% of the time
> ---------------------------------------------------------------
>
> Key: HBASE-23737
> URL: https://issues.apache.org/jira/browse/HBASE-23737
> Project: HBase
> Issue Type: Bug
> Reporter: Michael Stack
> Priority: Major
> Attachments:
> 0001-HBASE-23737-Flakey-Tests-TestFavoredNodeTableImport-.patch
>
>
> Spent time on TestFavoredNodeTableImport. It fails w/ an NPE when we go to
> get favorednodes for one of the regions. It is sporadic. Fails for me locally
> too about 30% of the time.
> I tried to study where we are going wrong. The balancer is disabled when we
> start the cluster up again on FN balancer... but this don't seem to be the
> problem.
> Looks like laggard Regions taking their time to open means they don't show in
> the global list of favored nodes when the checking runs. Adding a wait till
> no RIT seems to stabilize the test.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)