[ 
https://issues.apache.org/jira/browse/IGNITE-18170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Pligin updated IGNITE-18170:
-------------------------------------
    Description: 
Currently, {{TableManager#updateAssignmentsInternal}} is fully synchronous. The 
scenario is as follows:
 # {{updateAssignmentsInternal}} starts a RAFT group for a partition
 # {{FSMCallerImpl}} finds out that its applied index is below the group 
committed index, so it starts to apply the missing log entries in its 
{{init()}} method (this is still done synchronously)
 # While doing so, it invokes {{{}PartitionListener{}}}, which tries to execute 
an insert
 # To make an insert, a PK is needed, so it the insertion code tries to obtain 
a PK from its future like this: {{pkFuture.join()}}
 # That future is completed from {{{}IndexManager#createIndexLocally(){}}}, 
which is invoked by {{ConfigurationNotifier}} later than 
{{updateassignmentsInternal}} in the same thread
 # As a result, the PK future cannot be completed before the sync 
{{updateAssignmentsInternal}} finishes its job and returns, and it cannot 
finish its job before the PK future is completed

We should make {{updateAssignmentsInternal}} async. 

  was:
Currently, {{TableManager#updateAssignmentsInternal}} is fully synchronous. The 
scenario is as follows:
 # {{updateAssignmentsInternal}} starts a RAFT group for a partition
 # {{FSMCallerImpl}} finds out that its applied index is below the group 
committed index, so it starts to apply the missing log entries in its 
{{init()}} method (this is still done synchronously)
 # While doing so, it invokes {{{}PartitionListener{}}}, which tries to execute 
an insert
 # To make an insert, a PK is needed, so it the insertion code tries to obtain 
a PK from its future like this: {{pkFuture.join()}}
 # That future is completed from {{{}IndexManager#createIndexLocally(){}}}, 
which is invoked by {{ConfigurationNotifier}} later than 
{{updateassignmentsInternal}} in the same thread
 # As a result, the PK future cannot be completed before the sync 
{{updateAssignmentsInternal}} finishes its job and returns, and it cannot 
finish its job before the PK future is completed

We should make {{updateAssignmentsInternal}} async.


> Deadlock in TableManager#updateAssignmentInternal()
> ---------------------------------------------------
>
>                 Key: IGNITE-18170
>                 URL: https://issues.apache.org/jira/browse/IGNITE-18170
>             Project: Ignite
>          Issue Type: Bug
>            Reporter: Roman Puchkovskiy
>            Assignee: Roman Puchkovskiy
>            Priority: Major
>              Labels: ignite-3
>             Fix For: 3.0.0-beta2
>
>         Attachments: threads_report.txt
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Currently, {{TableManager#updateAssignmentsInternal}} is fully synchronous. 
> The scenario is as follows:
>  # {{updateAssignmentsInternal}} starts a RAFT group for a partition
>  # {{FSMCallerImpl}} finds out that its applied index is below the group 
> committed index, so it starts to apply the missing log entries in its 
> {{init()}} method (this is still done synchronously)
>  # While doing so, it invokes {{{}PartitionListener{}}}, which tries to 
> execute an insert
>  # To make an insert, a PK is needed, so it the insertion code tries to 
> obtain a PK from its future like this: {{pkFuture.join()}}
>  # That future is completed from {{{}IndexManager#createIndexLocally(){}}}, 
> which is invoked by {{ConfigurationNotifier}} later than 
> {{updateassignmentsInternal}} in the same thread
>  # As a result, the PK future cannot be completed before the sync 
> {{updateAssignmentsInternal}} finishes its job and returns, and it cannot 
> finish its job before the PK future is completed
> We should make {{updateAssignmentsInternal}} async. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to