[ https://issues.apache.org/jira/browse/PHOENIX-3970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16058879#comment-16058879 ]
James Taylor commented on PHOENIX-3970: --------------------------------------- I think we need a way to repro the deadlock issue before making a change like this. Vincent's bug fix for the sorting plus using a high priority queue will fix deadlock issues IMHO. The n^2 sorting was done while the row is locked which explains the deadlocks. I don't see how this patch helps. Samarth - how about doing the partial rebuild triggered from UngroupedAggregateRegionObserver asynchronously as we do with stats collection? The client thread in MetaDataRegionObserver could just poll until the async work is finished. We'd need to do a bit more tracking for each scan being finished on the index header row. Not sure all of this is actually necessary with Vincent's sort fix. The intent is that the partial index rebuild is fast. An alternative would be to disable index and clear the index_disable_timestamp if the scan times out (since this is an indication something is wrong). > Ensure that automatic partial index rebuilds are served from the index > handler pool > ----------------------------------------------------------------------------------- > > Key: PHOENIX-3970 > URL: https://issues.apache.org/jira/browse/PHOENIX-3970 > Project: Phoenix > Issue Type: Bug > Reporter: Lars Hofhansl > Assignee: Lars Hofhansl > Attachments: 3970.txt, 3970-v2.txt > > > This (and other issues) have rendered multiple larger cluster inoperable. -- This message was sent by Atlassian JIRA (v6.4.14#64029)