[ 
https://issues.apache.org/jira/browse/HAWQ-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Radar Lei reassigned HAWQ-1549:
-------------------------------

    Assignee: Shubham Sharma  (was: Radar Lei)

>  Re-syncing standby fails even when stop mode is fast
> -----------------------------------------------------
>
>                 Key: HAWQ-1549
>                 URL: https://issues.apache.org/jira/browse/HAWQ-1549
>             Project: Apache HAWQ
>          Issue Type: Bug
>          Components: Command Line Tools, Standby master
>            Reporter: Shubham Sharma
>            Assignee: Shubham Sharma
>            Priority: Major
>             Fix For: 2.3.0.0-incubating
>
>
> Recently observed a behaviour while re-syncing standby from hawq command line.
> Here are the reproduction steps -
> 1 - Open a client connection to hawq using psql
> 2 - From a different terminal run command - hawq init standby -n -v -M fast
> 3 - Standby resync fails with error
> {code}
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-There are other 
> connections to this instance, shutdown mode smart aborted
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-Either remove 
> connections, or use 'hawq stop master -M fast' or 'hawq stop master -M 
> immediate'
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[WARNING]:-See hawq stop 
> --help for all options
> 20171113:03:49:21:158354 hawq_stop:hdp3:gpadmin-[ERROR]:-Active connections. 
> Aborting shutdown...
> 20171113:03:49:21:158143 hawq_init:hdp3:gpadmin-[ERROR]:-Stop hawq cluster 
> failed, exit
> {code}
> 4 - When -M (stop mode) is passed it should terminate existing client 
> connections. 
> The source of this issue appears to be tools/bin/hawq_ctl method 
> _resync_standby. When this is called the command formation does not include 
> stop_mode options as passed to the arguments.
> {code}
>  def _resync_standby(self):
>         logger.info("Re-sync standby")
>         cmd = "%s; hawq stop master -a;" % source_hawq_env
>         check_return_code(local_ssh(cmd, logger), logger, "Stop hawq cluster 
> failed, exit")
>         ......
>         ......
> {code}
> I can start this and submit a PR when changes are done.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to