[ 
https://issues.apache.org/jira/browse/IMPALA-12060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17718367#comment-17718367
 ] 

Wenzhe Zhou commented on IMPALA-12060:
--------------------------------------

Submitted a patch: https://gerrit.cloudera.org/#/c/19826/
Currently the DNS resolution are done on subscriber's side, not on statestore.  
Subscribers (coordinator/executor/catalogd) do DNS resolution if 
"FLAGS_statestore_subscriber_use_resolved_address" is set as true, then send 
the resolved heart-beat address to statestore.  statestore don't have host-name 
of the subscribers and cannot do DN resolution now.
In the patch, the DNS resolution are still done on subscriber's side. But each 
subscriber need to re-do DNS resolution when it try to re-register to 
statestore in recovery mode. If DNS resolution fails, it will not re-register 
itself to statestore. This prevents the catalogd in passive mode (original 
leader) from rejoining the cluster. When catalogd leader failover happens, new 
leader catalogd could do DNS resolution, and send registration to statestore. 
Then statestore un-register original leader catalogd before register new leader 
catalogd to ensure only one catalogd instance in the cluster.

Another approach is to move DNS resolution from subscriber to statestore so 
that statestore have host-name of the subscribers and could do DNS resolution. 
statestore will try to do DNS resolution with hostname of the subscribers when 
it try to register a subscriber. If DNS resolution fails, statestore will 
reject the registration request.

> statestore should only allow a single catalog instance to be part of cluster
> ----------------------------------------------------------------------------
>
>                 Key: IMPALA-12060
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12060
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Abhishek Rawat
>            Assignee: Wenzhe Zhou
>            Priority: Critical
>
> Today Impala doesn't support multiple catalogd instances to be part of the 
> cluster. When using external Active Passive HA mechanisms like leader 
> election, it may be possible for multiple catalog replicas to be part of the 
> cluster, in some cases. This is mainly because catalogds have no idea that 
> there are other catalogd instances. One way to address this issue could be 
> for statestore to always ensure that there could only be a single instance of 
> catalogd in the cluster. Coordinator also has a config `catalog_service_host` 
> - both statestore and coordinator should have a consistent view and consider 
> the proper catalogd instance to be part of the cluster.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to