[jira] [Updated] (GEODE-10409) Rebalance Model Missing Collocated Regions At Server Startup

2022-09-13 Thread Weijie Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Weijie Xu updated GEODE-10409:
--
Fix Version/s: 1.16.0

> Rebalance Model Missing Collocated Regions At Server Startup
> 
>
> Key: GEODE-10409
> URL: https://issues.apache.org/jira/browse/GEODE-10409
> Project: Geode
>  Issue Type: Bug
>Reporter: Weijie Xu
>Assignee: Weijie Xu
>Priority: Major
>  Labels: needsTriage, pull-request-available
> Fix For: 1.16.0
>
> Attachments: server2.log, test.tar.gz
>
>
> Following steps reproduce the issue:
> Run the start.gfsh in the attached example, which configures a geode system 
> with a partitioned region, a gateway sender and a collocated region with the 
> partitioned region. So there are three regions totally, the leader region, 
> the collcated region and the queue region.
> Then run the example code, which will source ~400M data and 5 times amount of 
> events into the system.
> Then stop one of the server, and revoke the disk file of the server.
> Then start the server, which will trigger a bucket recovery.
> From the attached log line596, line598 and line5958, we can see that the 
> queue region is not included in the rebalance model, either in the data size 
> colum nor in the max size colum.
> Then do a manual rebalance after the server is up, this time log shows the 
> queue region is added to the model.(line6010, line6012, lin6014 and line6028)
>  
> The inconsistent behavior will lead to 2 negative results:
> 1) Different result of rebalance between server startup phase and manual 
> trigger, startup rebalance tells everything is OK, rebalance finished, but 
> manual trigger rebalance tells space not enough since it included the queue 
> region into the model which has 5 times data size as the leader region.
> 2) A dismatch between the rebalance model and the actual data being 
> rebalanced(Actually the queue region data is rebalanced although the region 
> is not included in the model at server startup phase).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (GEODE-10409) Rebalance Model Missing Collocated Regions At Server Startup

2022-08-25 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated GEODE-10409:
---
Labels: needsTriage pull-request-available  (was: needsTriage)

> Rebalance Model Missing Collocated Regions At Server Startup
> 
>
> Key: GEODE-10409
> URL: https://issues.apache.org/jira/browse/GEODE-10409
> Project: Geode
>  Issue Type: Bug
>Reporter: Weijie Xu
>Priority: Major
>  Labels: needsTriage, pull-request-available
> Attachments: server2.log, test.tar.gz
>
>
> Following steps reproduce the issue:
> Run the start.gfsh in the attached example, which configures a geode system 
> with a partitioned region, a gateway sender and a collocated region with the 
> partitioned region. So there are three regions totally, the leader region, 
> the collcated region and the queue region.
> Then run the example code, which will source ~400M data and 5 times amount of 
> events into the system.
> Then stop one of the server, and revoke the disk file of the server.
> Then start the server, which will trigger a bucket recovery.
> From the attached log line596, line598 and line5958, we can see that the 
> queue region is not included in the rebalance model, either in the data size 
> colum nor in the max size colum.
> Then do a manual rebalance after the server is up, this time log shows the 
> queue region is added to the model.(line6010, line6012, lin6014 and line6028)
>  
> The inconsistent behavior will lead to 2 negative results:
> 1) Different result of rebalance between server startup phase and manual 
> trigger, startup rebalance tells everything is OK, rebalance finished, but 
> manual trigger rebalance tells space not enough since it included the queue 
> region into the model which has 5 times data size as the leader region.
> 2) A dismatch between the rebalance model and the actual data being 
> rebalanced(Actually the queue region data is rebalanced although the region 
> is not included in the model at server startup phase).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (GEODE-10409) Rebalance Model Missing Collocated Regions At Server Startup

2022-08-23 Thread Weijie Xu (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Weijie Xu updated GEODE-10409:
--
Attachment: server2.log
test.tar.gz

> Rebalance Model Missing Collocated Regions At Server Startup
> 
>
> Key: GEODE-10409
> URL: https://issues.apache.org/jira/browse/GEODE-10409
> Project: Geode
>  Issue Type: Bug
>Reporter: Weijie Xu
>Priority: Major
>  Labels: needsTriage
> Attachments: server2.log, test.tar.gz
>
>
> Following steps reproduce the issue:
> Run the start.gfsh in the attached example, which configures a geode system 
> with a partitioned region, a gateway sender and a collocated region with the 
> partitioned region. So there are three regions totally, the leader region, 
> the collcated region and the queue region.
> Then run the example code, which will source ~400M data and 5 times amount of 
> events into the system.
> Then stop one of the server, and revoke the disk file of the server.
> Then start the server, which will trigger a bucket recovery.
> From the attached log line596, line598 and line5958, we can see that the 
> queue region is not included in the rebalance model, either in the data size 
> colum nor in the max size colum.
> Then do a manual rebalance after the server is up, this time log shows the 
> queue region is added to the model.(line6010, line6012, lin6014 and line6028)
>  
> The inconsistent behavior will lead to 2 negative results:
> 1) Different result of rebalance between server startup phase and manual 
> trigger, startup rebalance tells everything is OK, rebalance finished, but 
> manual trigger rebalance tells space not enough since it included the queue 
> region into the model which has 5 times data size as the leader region.
> 2) A dismatch between the rebalance model and the actual data being 
> rebalanced(Actually the queue region data is rebalanced although the region 
> is not included in the model at server startup phase).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (GEODE-10409) Rebalance Model Missing Collocated Regions At Server Startup

2022-08-23 Thread Alexander Murmann (Jira)


 [ 
https://issues.apache.org/jira/browse/GEODE-10409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alexander Murmann updated GEODE-10409:
--
Labels: needsTriage  (was: )

> Rebalance Model Missing Collocated Regions At Server Startup
> 
>
> Key: GEODE-10409
> URL: https://issues.apache.org/jira/browse/GEODE-10409
> Project: Geode
>  Issue Type: Bug
>Reporter: Weijie Xu
>Priority: Major
>  Labels: needsTriage
>
> Following steps reproduce the issue:
> Run the start.gfsh in the attached example, which configures a geode system 
> with a partitioned region, a gateway sender and a collocated region with the 
> partitioned region. So there are three regions totally, the leader region, 
> the collcated region and the queue region.
> Then run the example code, which will source ~400M data and 5 times amount of 
> events into the system.
> Then stop one of the server, and revoke the disk file of the server.
> Then start the server, which will trigger a bucket recovery.
> From the attached log line596, line598 and line5958, we can see that the 
> queue region is not included in the rebalance model, either in the data size 
> colum nor in the max size colum.
> Then do a manual rebalance after the server is up, this time log shows the 
> queue region is added to the model.(line6010, line6012, lin6014 and line6028)
>  
> The inconsistent behavior will lead to 2 negative results:
> 1) Different result of rebalance between server startup phase and manual 
> trigger, startup rebalance tells everything is OK, rebalance finished, but 
> manual trigger rebalance tells space not enough since it included the queue 
> region into the model which has 5 times data size as the leader region.
> 2) A dismatch between the rebalance model and the actual data being 
> rebalanced(Actually the queue region data is rebalanced although the region 
> is not included in the model at server startup phase).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)