subject:"Re\: Disaster after maintenance"

Re: Disaster after maintenance

2019-03-20 Thread Sergey Levitskiy

+1 on the advice to start from scratch.



Provisioning is failing because it can’t spin up either SSVM or proxy due to 
not enough capacity. The reason might be:

  *   Not enough capacity either CPU or RAM. increasing overprovisioning 
factors or reducing disable thresholds might help.
  *   Hosts in error state
  *   Cluster disabled
  *   Problem accessing primary and/or secondary storage mount from management 
server host







2019-03-20 15:07:39,218 DEBUG [c.c.d.DeploymentPlanningManagerImpl] 
(Work-Job-Executor-37:ctx-3cad2de4 job-5120/job-7077 ctx-6b705264) 
(logid:49483c7a) Trying to allocate a host and storage pools from dc:3, 
pod:null,cluster:null, requested cpu: 500, requested ram: 536870912

2019-03-20 15:07:39,218 DEBUG [c.c.d.DeploymentPlanningManagerImpl] 
(Work-Job-Executor-37:ctx-3cad2de4 job-5120/job-7077 ctx-6b705264) 
(logid:49483c7a) Is ROOT volume READY (pool already allocated)?: No

2019-03-20 15:07:39,219 DEBUG [c.c.d.DeploymentPlanningManagerImpl] 
(Work-Job-Executor-37:ctx-3cad2de4 job-5120/job-7077 ctx-6b705264) 
(logid:49483c7a) Deploy avoids pods: null, clusters: null, hosts: null

2019-03-20 15:07:39,219 DEBUG [c.c.d.FirstFitPlanner] 
(Work-Job-Executor-37:ctx-3cad2de4 job-5120/job-7077 ctx-6b705264) 
(logid:49483c7a) Searching all possible resources under this Zone: 3

2019-03-20 15:07:39,219 DEBUG [c.c.d.FirstFitPlanner] 
(Work-Job-Executor-38:ctx-f824bfeb job-5119/job-7076 ctx-9498) 
(logid:bc39cd2a) No clusters found having a host with enough capacity, 
returning.

2019-03-20 15:07:39,219 DEBUG [c.c.d.FirstFitPlanner] 
(Work-Job-Executor-37:ctx-3cad2de4 job-5120/job-7077 ctx-6b705264) 
(logid:49483c7a) Listing clusters in order of aggregate capacity, that have 
(atleast one host with) enough CPU and RAM capacity under this Zone: 3

2019-03-20 15:07:39,221 DEBUG [c.c.d.FirstFitPlanner] 
(Work-Job-Executor-37:ctx-3cad2de4 job-5120/job-7077 ctx-6b705264) 
(logid:49483c7a) No clusters found having a host with enough capacity, 
returning.





On 3/20/19, 10:38 AM, "Andrija Panic"  wrote:



Hi Jevgeni,



I would perhaps consider you continue with plan B from your separate email

thread (root volumes --> create snapshots, convert snaps to template,

download template somewhere safe - for DATA volumes, also create snapshots,

then convert to volume and download it (or simply directly download

existing DATA volume if VM is stopped).

Once you are safe, and all templates, and VM volumes are safe, you are good

to reinstall.

Seriously, I'm not sure how to proceed via ML - if this was my own setup,

probably would be able to fix it...



In next installment, start with clean 4.11.2 (4.10 was never released as an

official release and was SERIOUSLY broken), or even 4.12 which has just

been released (will be in 1-2 days).

In this new installment, please dedicate a VM (or physical server) that

will host mgmt+DB+NFS (or even better separate NFS on different server

etc.) - but certainly do NOT collocate management components with KVM role.

When you build zone successfully, you can then import all templates and

upload all volumes (which you saved previously to some external place, web

server)

This will allow you to restore your VMs - possibly with just different IPs

versus original ones.



Considering failed DB upgrades and issues you see now, I assume your env,

might be severely broken at this point, and warrants starting from

scratch...



Hope that makes sense - so again, download all root and data volumes to

safe place (consider some petrol + matches fun) and then reinstall with a

fresh and shiny infra.



Alternatively, I would try to wipe all new zones (this takes some time and

certain steps) and then continue troubleshooting with failed-to-start VRs.



Cheers



On Wed, 20 Mar 2019 at 17:59, Jevgeni Zolotarjov 

wrote:



> It started with 4.10 and then gradually upgraded with all stops, when new

> releases were available.

>

>

> >>> Why do you have 3 zones in this installation - what is the setup ?

> >>> SSVM and CPVM (for whatever zone) are failing to be created...

> Its a result of attempts to create new zone and somehow move VMs to this

> new zone. These all are unsuccessful attempts.

> Before problem started there was 1 zone and There should be just 1 zone in

> reality.

>

>

> >>> yes, the VR can't be started, it get's timeout - in AGENT logs, I see

> that

> >>> it attemps to create a volume on primary storage...

> I guess this is the root cause. I checked, and primary storage is

> accessible via NFS share on both hosts. How to troubleshoot it?

>

>

> On Wed, Mar 20, 2019 at 6:29 PM Andrija Panic 

> wrote:

>

> > Hi,

> >

> > 2019-03-20 06:41:50,446 INFO  [c.c.u.DatabaseUpgradeChecker] (main:null)

> >

Re: Disaster after maintenance

2019-03-20 Thread Andrija Panic

Hi Jevgeni,

I would perhaps consider you continue with plan B from your separate email
thread (root volumes --> create snapshots, convert snaps to template,
download template somewhere safe - for DATA volumes, also create snapshots,
then convert to volume and download it (or simply directly download
existing DATA volume if VM is stopped).
Once you are safe, and all templates, and VM volumes are safe, you are good
to reinstall.
Seriously, I'm not sure how to proceed via ML - if this was my own setup,
probably would be able to fix it...

In next installment, start with clean 4.11.2 (4.10 was never released as an
official release and was SERIOUSLY broken), or even 4.12 which has just
been released (will be in 1-2 days).
In this new installment, please dedicate a VM (or physical server) that
will host mgmt+DB+NFS (or even better separate NFS on different server
etc.) - but certainly do NOT collocate management components with KVM role.
When you build zone successfully, you can then import all templates and
upload all volumes (which you saved previously to some external place, web
server)
This will allow you to restore your VMs - possibly with just different IPs
versus original ones.

Considering failed DB upgrades and issues you see now, I assume your env,
might be severely broken at this point, and warrants starting from
scratch...

Hope that makes sense - so again, download all root and data volumes to
safe place (consider some petrol + matches fun) and then reinstall with a
fresh and shiny infra.

Alternatively, I would try to wipe all new zones (this takes some time and
certain steps) and then continue troubleshooting with failed-to-start VRs.

Cheers

On Wed, 20 Mar 2019 at 17:59, Jevgeni Zolotarjov 
wrote:

> It started with 4.10 and then gradually upgraded with all stops, when new
> releases were available.
>
>
> >>> Why do you have 3 zones in this installation - what is the setup ?
> >>> SSVM and CPVM (for whatever zone) are failing to be created...
> Its a result of attempts to create new zone and somehow move VMs to this
> new zone. These all are unsuccessful attempts.
> Before problem started there was 1 zone and There should be just 1 zone in
> reality.
>
>
> >>> yes, the VR can't be started, it get's timeout - in AGENT logs, I see
> that
> >>> it attemps to create a volume on primary storage...
> I guess this is the root cause. I checked, and primary storage is
> accessible via NFS share on both hosts. How to troubleshoot it?
>
>
> On Wed, Mar 20, 2019 at 6:29 PM Andrija Panic 
> wrote:
>
> > Hi,
> >
> > 2019-03-20 06:41:50,446 INFO  [c.c.u.DatabaseUpgradeChecker] (main:null)
> > (logid:) DB version = 4.10.0.0 Code Version = 4.11.2.0
> > 2019-03-20 06:41:50,447 DEBUG [c.c.u.DatabaseUpgradeChecker] (main:null)
> > (logid:) Running upgrade Upgrade41000to41100 to upgrade from
> > 4.10.0.0-4.11.0.0 to 4.11.0.0
> > fails due to
> > java.sql.SQLException: Error on rename of './cloud/ldap_trust_map' to
> > './cloud/#sql2-2f01-13d' (errno: 152)
> >
> > Then later...
> >
> > com.cloud.exception.InsufficientServerCapacityException: Unable to
> create a
> > deployment for VM[SecondaryStorageVm|s-734-VM]Scope=interface
> > com.cloud.dc.DataCenter; id=3
> > com.cloud.exception.InsufficientServerCapacityException: Unable to
> create a
> > deployment for VM[ConsoleProxy|v-733-VM]Scope=interface
> > com.cloud.dc.DataCenter; id=3
> >
> > 2019-03-20 15:02:39,113 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > (secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 1 is ready to launch
> > secondary storage VM
> > 2019-03-20 15:02:39,117 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > (secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 2 is not ready to
> launch
> > secondary storage VM yet
> > 2019-03-20 15:02:39,122 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > (secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 3 is ready to launch
> > secondary storage VM
> >
> > so did you start with clean 4.11.2 install, or was it upgraded one - I
> see
> > in logs an upgrade from DB version 4.10 to 4.11 was tried and failed...
> > Why do you have 3 zones in this installation - what is the setup ?
> > SSVM and CPVM (for whatever zone) are failing to be created...
> >
> > yes, the VR can't be started, it get's timeout - in AGENT logs, I see
> that
> > it attemps to create a volume on primary storage...
> >
> >
> > Also, for SSVM I got this one...
> > 2019-03-20 14:38:09,227 DEBUG [c.c.d.FirstFitPlanner]
> > (Work-Job-Executor-96:ctx-04c5c9f2 job-5120/job-6960 ctx-fde3d4d7)
> > (logid:49483c7a) No clusters found having a host with enough capacity,
> > returning.
> >
> > Andrija
> >
> > On Wed, 20 Mar 2019 at 16:39, Jevgeni Zolotarjov  >
> > wrote:
> >
> > > Basic Zone - Yes
> > >
> > > router has been actually started/created on KVM side - not created, not
> > > started. Thats the main problem, I guess
> > >
> > > agent.log
> > > https://drive.google.com/open?id=1rATxHKqgNKo2kD23BtlrZy_9gFXC-Bq-
> > >
> > > management log
> > >

Re: Disaster after maintenance

2019-03-20 Thread Jevgeni Zolotarjov

It started with 4.10 and then gradually upgraded with all stops, when new
releases were available.


>>> Why do you have 3 zones in this installation - what is the setup ?
>>> SSVM and CPVM (for whatever zone) are failing to be created...
Its a result of attempts to create new zone and somehow move VMs to this
new zone. These all are unsuccessful attempts.
Before problem started there was 1 zone and There should be just 1 zone in
reality.


>>> yes, the VR can't be started, it get's timeout - in AGENT logs, I see
that
>>> it attemps to create a volume on primary storage...
I guess this is the root cause. I checked, and primary storage is
accessible via NFS share on both hosts. How to troubleshoot it?


On Wed, Mar 20, 2019 at 6:29 PM Andrija Panic 
wrote:

> Hi,
>
> 2019-03-20 06:41:50,446 INFO  [c.c.u.DatabaseUpgradeChecker] (main:null)
> (logid:) DB version = 4.10.0.0 Code Version = 4.11.2.0
> 2019-03-20 06:41:50,447 DEBUG [c.c.u.DatabaseUpgradeChecker] (main:null)
> (logid:) Running upgrade Upgrade41000to41100 to upgrade from
> 4.10.0.0-4.11.0.0 to 4.11.0.0
> fails due to
> java.sql.SQLException: Error on rename of './cloud/ldap_trust_map' to
> './cloud/#sql2-2f01-13d' (errno: 152)
>
> Then later...
>
> com.cloud.exception.InsufficientServerCapacityException: Unable to create a
> deployment for VM[SecondaryStorageVm|s-734-VM]Scope=interface
> com.cloud.dc.DataCenter; id=3
> com.cloud.exception.InsufficientServerCapacityException: Unable to create a
> deployment for VM[ConsoleProxy|v-733-VM]Scope=interface
> com.cloud.dc.DataCenter; id=3
>
> 2019-03-20 15:02:39,113 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> (secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 1 is ready to launch
> secondary storage VM
> 2019-03-20 15:02:39,117 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> (secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 2 is not ready to launch
> secondary storage VM yet
> 2019-03-20 15:02:39,122 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> (secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 3 is ready to launch
> secondary storage VM
>
> so did you start with clean 4.11.2 install, or was it upgraded one - I see
> in logs an upgrade from DB version 4.10 to 4.11 was tried and failed...
> Why do you have 3 zones in this installation - what is the setup ?
> SSVM and CPVM (for whatever zone) are failing to be created...
>
> yes, the VR can't be started, it get's timeout - in AGENT logs, I see that
> it attemps to create a volume on primary storage...
>
>
> Also, for SSVM I got this one...
> 2019-03-20 14:38:09,227 DEBUG [c.c.d.FirstFitPlanner]
> (Work-Job-Executor-96:ctx-04c5c9f2 job-5120/job-6960 ctx-fde3d4d7)
> (logid:49483c7a) No clusters found having a host with enough capacity,
> returning.
>
> Andrija
>
> On Wed, 20 Mar 2019 at 16:39, Jevgeni Zolotarjov 
> wrote:
>
> > Basic Zone - Yes
> >
> > router has been actually started/created on KVM side - not created, not
> > started. Thats the main problem, I guess
> >
> > agent.log
> > https://drive.google.com/open?id=1rATxHKqgNKo2kD23BtlrZy_9gFXC-Bq-
> >
> > management log
> > https://drive.google.com/open?id=1H2jI0roeiWxtzReB8qV6QxDkNpaki99A
> >
> > >> Can you confirm your zone/pod/cluster/hosts are all in Enabled state,
> > i.e.
> > YES, all green
> >
> > >> Can you connect your both KVM hosts can access/mount both Primary and
> > Secondary Storage
> > YES. Double checked
> >
> > >>>Can you also explain your infrastructure - you said you have two hosts
> > only, where does CloudStack management run?
> > 2 hosts:
> > host1: 192.168.1.14
> > host2: 192.168.1.5
> >
> > Servers are standing next to each other - connected to the same switch
> > Management server runs on the same physical server with host1
> >
> > I noticed, that Virtual router gets created after I try to start any of
> the
> > existing guest VM
> > Here are logs
> > management:
> > https://drive.google.com/open?id=1H2jI0roeiWxtzReB8qV6QxDkNpaki99A
> >
> > agent on host1:
> > https://drive.google.com/open?id=1u8YHYIuyU2MA2UKY7G5z7q8p5XxU1zsy
> >
> > agent on host2:
> > https://drive.google.com/open?id=1YzkCL-FmTgPva-QHHp5vTM5Nb3qAXxz4
> >
> > But this virtual router stays in Starting state forever and hence VMs do
> > not start either.
> >
> > On Wed, Mar 20, 2019 at 2:49 PM Andrija Panic 
> > wrote:
> >
> > > Just to confirm, you are using Basic Zone in CloudStack, right ?
> > >
> > > Can you confirm that router has been actually started/created on KVM
> > side,
> > > again, as requested please post logs (mgmt and agent - and note the
> time
> > > around which you tried to start VR last time it partially succeeded) -
> we
> > > can't guess what went wrong without logs.
> > >
> > > I would push more effort solving this one, instead of reinstalling -
> you
> > > might hit the issue again and then it's no good.
> > >
> > > Can you confirm your zone/pod/cluster/hosts are all in Enabled state,
> > i.e.
> > > not disabled and hosts connected AND both SSVM and CPVM are
> > > connectedUP/green
> > >

Re: Disaster after maintenance

2019-03-20 Thread Andrija Panic

Hi,

2019-03-20 06:41:50,446 INFO  [c.c.u.DatabaseUpgradeChecker] (main:null)
(logid:) DB version = 4.10.0.0 Code Version = 4.11.2.0
2019-03-20 06:41:50,447 DEBUG [c.c.u.DatabaseUpgradeChecker] (main:null)
(logid:) Running upgrade Upgrade41000to41100 to upgrade from
4.10.0.0-4.11.0.0 to 4.11.0.0
fails due to
java.sql.SQLException: Error on rename of './cloud/ldap_trust_map' to
'./cloud/#sql2-2f01-13d' (errno: 152)

Then later...

com.cloud.exception.InsufficientServerCapacityException: Unable to create a
deployment for VM[SecondaryStorageVm|s-734-VM]Scope=interface
com.cloud.dc.DataCenter; id=3
com.cloud.exception.InsufficientServerCapacityException: Unable to create a
deployment for VM[ConsoleProxy|v-733-VM]Scope=interface
com.cloud.dc.DataCenter; id=3

2019-03-20 15:02:39,113 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
(secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 1 is ready to launch
secondary storage VM
2019-03-20 15:02:39,117 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
(secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 2 is not ready to launch
secondary storage VM yet
2019-03-20 15:02:39,122 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
(secstorage-1:ctx-059f87f3) (logid:cf6cf89a) Zone 3 is ready to launch
secondary storage VM

so did you start with clean 4.11.2 install, or was it upgraded one - I see
in logs an upgrade from DB version 4.10 to 4.11 was tried and failed...
Why do you have 3 zones in this installation - what is the setup ?
SSVM and CPVM (for whatever zone) are failing to be created...

yes, the VR can't be started, it get's timeout - in AGENT logs, I see that
it attemps to create a volume on primary storage...


Also, for SSVM I got this one...
2019-03-20 14:38:09,227 DEBUG [c.c.d.FirstFitPlanner]
(Work-Job-Executor-96:ctx-04c5c9f2 job-5120/job-6960 ctx-fde3d4d7)
(logid:49483c7a) No clusters found having a host with enough capacity,
returning.

Andrija

On Wed, 20 Mar 2019 at 16:39, Jevgeni Zolotarjov 
wrote:

> Basic Zone - Yes
>
> router has been actually started/created on KVM side - not created, not
> started. Thats the main problem, I guess
>
> agent.log
> https://drive.google.com/open?id=1rATxHKqgNKo2kD23BtlrZy_9gFXC-Bq-
>
> management log
> https://drive.google.com/open?id=1H2jI0roeiWxtzReB8qV6QxDkNpaki99A
>
> >> Can you confirm your zone/pod/cluster/hosts are all in Enabled state,
> i.e.
> YES, all green
>
> >> Can you connect your both KVM hosts can access/mount both Primary and
> Secondary Storage
> YES. Double checked
>
> >>>Can you also explain your infrastructure - you said you have two hosts
> only, where does CloudStack management run?
> 2 hosts:
> host1: 192.168.1.14
> host2: 192.168.1.5
>
> Servers are standing next to each other - connected to the same switch
> Management server runs on the same physical server with host1
>
> I noticed, that Virtual router gets created after I try to start any of the
> existing guest VM
> Here are logs
> management:
> https://drive.google.com/open?id=1H2jI0roeiWxtzReB8qV6QxDkNpaki99A
>
> agent on host1:
> https://drive.google.com/open?id=1u8YHYIuyU2MA2UKY7G5z7q8p5XxU1zsy
>
> agent on host2:
> https://drive.google.com/open?id=1YzkCL-FmTgPva-QHHp5vTM5Nb3qAXxz4
>
> But this virtual router stays in Starting state forever and hence VMs do
> not start either.
>
> On Wed, Mar 20, 2019 at 2:49 PM Andrija Panic 
> wrote:
>
> > Just to confirm, you are using Basic Zone in CloudStack, right ?
> >
> > Can you confirm that router has been actually started/created on KVM
> side,
> > again, as requested please post logs (mgmt and agent - and note the time
> > around which you tried to start VR last time it partially succeeded) - we
> > can't guess what went wrong without logs.
> >
> > I would push more effort solving this one, instead of reinstalling - you
> > might hit the issue again and then it's no good.
> >
> > Can you confirm your zone/pod/cluster/hosts are all in Enabled state,
> i.e.
> > not disabled and hosts connected AND both SSVM and CPVM are
> > connectedUP/green
> > Is your dashboard in GUI all green - no issues there ?
> > Can you connect your both KVM hosts can access/mount both Primary and
> > Secondary Storage
> >
> > On Wed, 20 Mar 2019 at 13:15, Jevgeni Zolotarjov  >
> > wrote:
> >
> > > After dozen of attempts, the Virtual Router could finally be recreated.
> > But
> > > its in eternal Starting status, and console prompts it required upgrade
> > and
> > > Version is UNKNOWN
> > >
> > > It does not resolve the problem, I cannot move further form this point.
> > > Any hints?
> > >
> > > Or I am condemned to do reinstall cloudstack from scratch?
> > >
> > > On Wed, Mar 20, 2019 at 11:08 AM Jevgeni Zolotarjov <
> > > j.zolotar...@gmail.com>
> > > wrote:
> > >
> > > > Under this defaultGuestNetwork, I go to Virtual Appliances. There is
> no
> > > > VMS - "no data to show"
> > > >
> > > > I dont have any network, other than this single default one.
> > > >
> > > > I've tried adding new network - Add guest network. But I am not able
>

Re: Disaster after maintenance

2019-03-20 Thread Jevgeni Zolotarjov

Basic Zone - Yes

router has been actually started/created on KVM side - not created, not
started. Thats the main problem, I guess

agent.log
https://drive.google.com/open?id=1rATxHKqgNKo2kD23BtlrZy_9gFXC-Bq-

management log
https://drive.google.com/open?id=1H2jI0roeiWxtzReB8qV6QxDkNpaki99A

>> Can you confirm your zone/pod/cluster/hosts are all in Enabled state,
i.e.
YES, all green

>> Can you connect your both KVM hosts can access/mount both Primary and
Secondary Storage
YES. Double checked

>>>Can you also explain your infrastructure - you said you have two hosts
only, where does CloudStack management run?
2 hosts:
host1: 192.168.1.14
host2: 192.168.1.5

Servers are standing next to each other - connected to the same switch
Management server runs on the same physical server with host1

I noticed, that Virtual router gets created after I try to start any of the
existing guest VM
Here are logs
management:
https://drive.google.com/open?id=1H2jI0roeiWxtzReB8qV6QxDkNpaki99A

agent on host1:
https://drive.google.com/open?id=1u8YHYIuyU2MA2UKY7G5z7q8p5XxU1zsy

agent on host2:
https://drive.google.com/open?id=1YzkCL-FmTgPva-QHHp5vTM5Nb3qAXxz4

But this virtual router stays in Starting state forever and hence VMs do
not start either.

On Wed, Mar 20, 2019 at 2:49 PM Andrija Panic 
wrote:

> Just to confirm, you are using Basic Zone in CloudStack, right ?
>
> Can you confirm that router has been actually started/created on KVM side,
> again, as requested please post logs (mgmt and agent - and note the time
> around which you tried to start VR last time it partially succeeded) - we
> can't guess what went wrong without logs.
>
> I would push more effort solving this one, instead of reinstalling - you
> might hit the issue again and then it's no good.
>
> Can you confirm your zone/pod/cluster/hosts are all in Enabled state, i.e.
> not disabled and hosts connected AND both SSVM and CPVM are
> connectedUP/green
> Is your dashboard in GUI all green - no issues there ?
> Can you connect your both KVM hosts can access/mount both Primary and
> Secondary Storage
>
> On Wed, 20 Mar 2019 at 13:15, Jevgeni Zolotarjov 
> wrote:
>
> > After dozen of attempts, the Virtual Router could finally be recreated.
> But
> > its in eternal Starting status, and console prompts it required upgrade
> and
> > Version is UNKNOWN
> >
> > It does not resolve the problem, I cannot move further form this point.
> > Any hints?
> >
> > Or I am condemned to do reinstall cloudstack from scratch?
> >
> > On Wed, Mar 20, 2019 at 11:08 AM Jevgeni Zolotarjov <
> > j.zolotar...@gmail.com>
> > wrote:
> >
> > > Under this defaultGuestNetwork, I go to Virtual Appliances. There is no
> > > VMS - "no data to show"
> > >
> > > I dont have any network, other than this single default one.
> > >
> > > I've tried adding new network - Add guest network. But I am not able to
> > do
> > > so, cause in the wizard popup, it offers empty dropdown with Zones
> > > selection. And this wizard doesnt not allow to go further without
> > selecting
> > > Zone
> > >
> > > On Wed, Mar 20, 2019 at 10:28 AM Andrija Panic <
> andrija.pa...@gmail.com>
> > > wrote:
> > >
> > >> you need to delete/remove all VMs inside this network (tick the
> > "Expunge"
> > >> button during VM deletion - if you want to really delete the VMs) in
> > order
> > >> to be able to delete the network - OR simply attach this VM to another
> > >> network, make this new network a DEFAULT one (NIC that is...), and
> then
> > >> detach from old network - and then effectively your VM was "removed"
> > from
> > >> old network - after this you should be able to delete the old
> network. I
> > >> assume some DB incosistencies perhaps, being the reason you can not
> > >> restart
> > >> the network.
> > >>
> > >> Did you try restarting some other Network - or deploying a new
> network,
> > >> spin a VM in it, then again try to restart this new network - does it
> > >> work ?
> > >>
> > >> Andrija
> > >>
> > >> On Wed, 20 Mar 2019 at 08:58, Jevgeni Zolotarjov <
> > j.zolotar...@gmail.com>
> > >> wrote:
> > >>
> > >> > >>>Stop mgmt,
> > >> > >>>Stop all agents
> > >> > >>>Restart libvirtd (and check libvirt logs afterwards)
> > >> > >>>Start agents
> > >> > >>>Start mgmt.
> > >> >
> > >> > I did that numerous time. Nothing really suspicious
> > >> > I can see that systems VMs are running - both in cloudstack console
> > and
> > >> > with virsh list -all
> > >> >
> > >> > It is apparently problem with network.
> > >> > Is there a way to force recreation of defaultGuestNetwork? or force
> > >> > recreation of Virtual Router.
> > >> > I am unable to delete network, which is supposed to rebuild network
> > with
> > >> > its router. Thats the issue
> > >> >
> > >> > The issue with libvirtd was, that eventually at some point it was
> > >> updated
> > >> > during 4 months of running, and not rebooted. It still worked. We
> had
> > to
> > >> > add listen_tcp = 1 for libvirtd to start working again.
> > >> >
> > >> > On Wed,

Re: Disaster after maintenance

2019-03-20 Thread Dag Sonstebo

Jevgeni,

Can you also explain your infrastructure - you said you have two hosts only, 
where does CloudStack management run?

Reason I'm asking is when checking your logs from yesterday the IP address 
192.168.1.14 seems to be used for management, NFS and a KVM host? Is this the 
case, do you co-host everything on the same server?

Regards,
Dag Sonstebo
Cloud Architect
ShapeBlue
 

On 20/03/2019, 12:49, "Andrija Panic"  wrote:

Just to confirm, you are using Basic Zone in CloudStack, right ?

Can you confirm that router has been actually started/created on KVM side,
again, as requested please post logs (mgmt and agent - and note the time
around which you tried to start VR last time it partially succeeded) - we
can't guess what went wrong without logs.

I would push more effort solving this one, instead of reinstalling - you
might hit the issue again and then it's no good.

Can you confirm your zone/pod/cluster/hosts are all in Enabled state, i.e.
not disabled and hosts connected AND both SSVM and CPVM are
connectedUP/green
Is your dashboard in GUI all green - no issues there ?
Can you connect your both KVM hosts can access/mount both Primary and
Secondary Storage

On Wed, 20 Mar 2019 at 13:15, Jevgeni Zolotarjov 
wrote:

> After dozen of attempts, the Virtual Router could finally be recreated. 
But
> its in eternal Starting status, and console prompts it required upgrade 
and
> Version is UNKNOWN
>
> It does not resolve the problem, I cannot move further form this point.
> Any hints?
>
> Or I am condemned to do reinstall cloudstack from scratch?
>
> On Wed, Mar 20, 2019 at 11:08 AM Jevgeni Zolotarjov <
> j.zolotar...@gmail.com>
> wrote:
>
> > Under this defaultGuestNetwork, I go to Virtual Appliances. There is no
> > VMS - "no data to show"
> >
> > I dont have any network, other than this single default one.
> >
> > I've tried adding new network - Add guest network. But I am not able to
> do
> > so, cause in the wizard popup, it offers empty dropdown with Zones
> > selection. And this wizard doesnt not allow to go further without
> selecting
> > Zone
> >
> > On Wed, Mar 20, 2019 at 10:28 AM Andrija Panic 
> > wrote:
> >
> >> you need to delete/remove all VMs inside this network (tick the
> "Expunge"
> >> button during VM deletion - if you want to really delete the VMs) in
> order
> >> to be able to delete the network - OR simply attach this VM to another
> >> network, make this new network a DEFAULT one (NIC that is...), and then
> >> detach from old network - and then effectively your VM was "removed"
> from
> >> old network - after this you should be able to delete the old network. 
I
> >> assume some DB incosistencies perhaps, being the reason you can not
> >> restart
> >> the network.
> >>
> >> Did you try restarting some other Network - or deploying a new network,
> >> spin a VM in it, then again try to restart this new network - does it
> >> work ?
> >>
> >> Andrija
> >>
> >> On Wed, 20 Mar 2019 at 08:58, Jevgeni Zolotarjov <
> j.zolotar...@gmail.com>
> >> wrote:
> >>
> >> > >>>Stop mgmt,
> >> > >>>Stop all agents
> >> > >>>Restart libvirtd (and check libvirt logs afterwards)
> >> > >>>Start agents
> >> > >>>Start mgmt.
> >> >
> >> > I did that numerous time. Nothing really suspicious
> >> > I can see that systems VMs are running - both in cloudstack console
> and
> >> > with virsh list -all
> >> >
> >> > It is apparently problem with network.
> >> > Is there a way to force recreation of defaultGuestNetwork? or force
> >> > recreation of Virtual Router.
> >> > I am unable to delete network, which is supposed to rebuild network
> with
> >> > its router. Thats the issue
> >> >
> >> > The issue with libvirtd was, that eventually at some point it was
> >> updated
> >> > during 4 months of running, and not rebooted. It still worked. We had
> to
> >> > add listen_tcp = 1 for libvirtd to start working again.
> >> >
> >> > On Wed, Mar 20, 2019 at 9:49 AM Andrija Panic <
> andrija.pa...@gmail.com>
> >> > wrote:
> >> >
> >> > > As Sergey suggested... but i would also verify no libvirt issues or
> >> > storage
> >> > > pool issues - so perhaps:
> >> > >
> >> > > Stop mgmt,
> >> > > Stop all agents
> >> > > Restart libvirtd (and check libvirt logs afterwards)
> >> > > Start agents
> >> > > Start mgmt.
> >> > >
> >> > > What was originally issue with libvirtd ?
> >> > > That sounds fishy to me...
> >> > >
> >> > > Andrija
> >> > >
> >> > > On Wed, Mar 20, 2019, 02:15 Sergey Levitskiy 
> >> > wrote:
> >> > >
> >> > > > select * from networks where

Re: Disaster after maintenance

2019-03-20 Thread Andrija Panic

Just to confirm, you are using Basic Zone in CloudStack, right ?

Can you confirm that router has been actually started/created on KVM side,
again, as requested please post logs (mgmt and agent - and note the time
around which you tried to start VR last time it partially succeeded) - we
can't guess what went wrong without logs.

I would push more effort solving this one, instead of reinstalling - you
might hit the issue again and then it's no good.

Can you confirm your zone/pod/cluster/hosts are all in Enabled state, i.e.
not disabled and hosts connected AND both SSVM and CPVM are
connectedUP/green
Is your dashboard in GUI all green - no issues there ?
Can you connect your both KVM hosts can access/mount both Primary and
Secondary Storage

On Wed, 20 Mar 2019 at 13:15, Jevgeni Zolotarjov 
wrote:

> After dozen of attempts, the Virtual Router could finally be recreated. But
> its in eternal Starting status, and console prompts it required upgrade and
> Version is UNKNOWN
>
> It does not resolve the problem, I cannot move further form this point.
> Any hints?
>
> Or I am condemned to do reinstall cloudstack from scratch?
>
> On Wed, Mar 20, 2019 at 11:08 AM Jevgeni Zolotarjov <
> j.zolotar...@gmail.com>
> wrote:
>
> > Under this defaultGuestNetwork, I go to Virtual Appliances. There is no
> > VMS - "no data to show"
> >
> > I dont have any network, other than this single default one.
> >
> > I've tried adding new network - Add guest network. But I am not able to
> do
> > so, cause in the wizard popup, it offers empty dropdown with Zones
> > selection. And this wizard doesnt not allow to go further without
> selecting
> > Zone
> >
> > On Wed, Mar 20, 2019 at 10:28 AM Andrija Panic 
> > wrote:
> >
> >> you need to delete/remove all VMs inside this network (tick the
> "Expunge"
> >> button during VM deletion - if you want to really delete the VMs) in
> order
> >> to be able to delete the network - OR simply attach this VM to another
> >> network, make this new network a DEFAULT one (NIC that is...), and then
> >> detach from old network - and then effectively your VM was "removed"
> from
> >> old network - after this you should be able to delete the old network. I
> >> assume some DB incosistencies perhaps, being the reason you can not
> >> restart
> >> the network.
> >>
> >> Did you try restarting some other Network - or deploying a new network,
> >> spin a VM in it, then again try to restart this new network - does it
> >> work ?
> >>
> >> Andrija
> >>
> >> On Wed, 20 Mar 2019 at 08:58, Jevgeni Zolotarjov <
> j.zolotar...@gmail.com>
> >> wrote:
> >>
> >> > >>>Stop mgmt,
> >> > >>>Stop all agents
> >> > >>>Restart libvirtd (and check libvirt logs afterwards)
> >> > >>>Start agents
> >> > >>>Start mgmt.
> >> >
> >> > I did that numerous time. Nothing really suspicious
> >> > I can see that systems VMs are running - both in cloudstack console
> and
> >> > with virsh list -all
> >> >
> >> > It is apparently problem with network.
> >> > Is there a way to force recreation of defaultGuestNetwork? or force
> >> > recreation of Virtual Router.
> >> > I am unable to delete network, which is supposed to rebuild network
> with
> >> > its router. Thats the issue
> >> >
> >> > The issue with libvirtd was, that eventually at some point it was
> >> updated
> >> > during 4 months of running, and not rebooted. It still worked. We had
> to
> >> > add listen_tcp = 1 for libvirtd to start working again.
> >> >
> >> > On Wed, Mar 20, 2019 at 9:49 AM Andrija Panic <
> andrija.pa...@gmail.com>
> >> > wrote:
> >> >
> >> > > As Sergey suggested... but i would also verify no libvirt issues or
> >> > storage
> >> > > pool issues - so perhaps:
> >> > >
> >> > > Stop mgmt,
> >> > > Stop all agents
> >> > > Restart libvirtd (and check libvirt logs afterwards)
> >> > > Start agents
> >> > > Start mgmt.
> >> > >
> >> > > What was originally issue with libvirtd ?
> >> > > That sounds fishy to me...
> >> > >
> >> > > Andrija
> >> > >
> >> > > On Wed, Mar 20, 2019, 02:15 Sergey Levitskiy 
> >> > wrote:
> >> > >
> >> > > > select * from networks where removed is null;
> >> > > > select * from vm_instance where id=87;
> >> > > > select id,name from vm_instance where name like 'r%' and removed
> is
> >> > null;
> >> > > >
> >> > > > Basically since the network offering is not redundant this error
> is
> >> > only
> >> > > > thrown when there is no router associated with your network.
> Usually
> >> > > > management server restart tries to implement network again. Please
> >> > > restart
> >> > > > management server, save and share management server log.
> >> > > >
> >> > > >
> >> > > >
> >> > > >
> >> > > > On 3/19/19, 3:31 PM, "Jevgeni Zolotarjov" <
> j.zolotar...@gmail.com>
> >> > > wrote:
> >> > > >
> >> > > > Check network_offering table for  value in column
> >> > > > redundant_router_service  for the network offering you use.
> >> > > > in table network_offering_table all records have
> >> > > > redundant_router_service =

Re: Disaster after maintenance

2019-03-20 Thread Jevgeni Zolotarjov

After dozen of attempts, the Virtual Router could finally be recreated. But
its in eternal Starting status, and console prompts it required upgrade and
Version is UNKNOWN

It does not resolve the problem, I cannot move further form this point.
Any hints?

Or I am condemned to do reinstall cloudstack from scratch?

On Wed, Mar 20, 2019 at 11:08 AM Jevgeni Zolotarjov 
wrote:

> Under this defaultGuestNetwork, I go to Virtual Appliances. There is no
> VMS - "no data to show"
>
> I dont have any network, other than this single default one.
>
> I've tried adding new network - Add guest network. But I am not able to do
> so, cause in the wizard popup, it offers empty dropdown with Zones
> selection. And this wizard doesnt not allow to go further without selecting
> Zone
>
> On Wed, Mar 20, 2019 at 10:28 AM Andrija Panic 
> wrote:
>
>> you need to delete/remove all VMs inside this network (tick the "Expunge"
>> button during VM deletion - if you want to really delete the VMs) in order
>> to be able to delete the network - OR simply attach this VM to another
>> network, make this new network a DEFAULT one (NIC that is...), and then
>> detach from old network - and then effectively your VM was "removed" from
>> old network - after this you should be able to delete the old network. I
>> assume some DB incosistencies perhaps, being the reason you can not
>> restart
>> the network.
>>
>> Did you try restarting some other Network - or deploying a new network,
>> spin a VM in it, then again try to restart this new network - does it
>> work ?
>>
>> Andrija
>>
>> On Wed, 20 Mar 2019 at 08:58, Jevgeni Zolotarjov 
>> wrote:
>>
>> > >>>Stop mgmt,
>> > >>>Stop all agents
>> > >>>Restart libvirtd (and check libvirt logs afterwards)
>> > >>>Start agents
>> > >>>Start mgmt.
>> >
>> > I did that numerous time. Nothing really suspicious
>> > I can see that systems VMs are running - both in cloudstack console and
>> > with virsh list -all
>> >
>> > It is apparently problem with network.
>> > Is there a way to force recreation of defaultGuestNetwork? or force
>> > recreation of Virtual Router.
>> > I am unable to delete network, which is supposed to rebuild network with
>> > its router. Thats the issue
>> >
>> > The issue with libvirtd was, that eventually at some point it was
>> updated
>> > during 4 months of running, and not rebooted. It still worked. We had to
>> > add listen_tcp = 1 for libvirtd to start working again.
>> >
>> > On Wed, Mar 20, 2019 at 9:49 AM Andrija Panic 
>> > wrote:
>> >
>> > > As Sergey suggested... but i would also verify no libvirt issues or
>> > storage
>> > > pool issues - so perhaps:
>> > >
>> > > Stop mgmt,
>> > > Stop all agents
>> > > Restart libvirtd (and check libvirt logs afterwards)
>> > > Start agents
>> > > Start mgmt.
>> > >
>> > > What was originally issue with libvirtd ?
>> > > That sounds fishy to me...
>> > >
>> > > Andrija
>> > >
>> > > On Wed, Mar 20, 2019, 02:15 Sergey Levitskiy 
>> > wrote:
>> > >
>> > > > select * from networks where removed is null;
>> > > > select * from vm_instance where id=87;
>> > > > select id,name from vm_instance where name like 'r%' and removed is
>> > null;
>> > > >
>> > > > Basically since the network offering is not redundant this error is
>> > only
>> > > > thrown when there is no router associated with your network. Usually
>> > > > management server restart tries to implement network again. Please
>> > > restart
>> > > > management server, save and share management server log.
>> > > >
>> > > >
>> > > >
>> > > >
>> > > > On 3/19/19, 3:31 PM, "Jevgeni Zolotarjov" 
>> > > wrote:
>> > > >
>> > > > Check network_offering table for  value in column
>> > > > redundant_router_service  for the network offering you use.
>> > > > in table network_offering_table all records have
>> > > > redundant_router_service =
>> > > > 0
>> > > >
>> > > > Can you also run the following:
>> > > > >>>select name, state, removed  from host where name like 'r%'
>> > > > returns zero rows - nothing
>> > > >
>> > > > >>>select * from domain_router;
>> > > > # id, element_id, public_mac_address, public_ip_address,
>> > > > public_netmask,
>> > > > guest_netmask, guest_ip_address, is_redundant_router, priority,
>> > > > redundant_state, stop_pending, role, template_version,
>> > > scripts_version,
>> > > > vpc_id, update_state
>> > > > '4', '1', NULL, NULL, NULL, NULL, NULL, '0', NULL, 'UNKNOWN',
>> '0',
>> > > > 'VIRTUAL_ROUTER', 'Cloudstack Release 4.11.0 Sun Jan 14 15:37:28
>> > UTC
>> > > > 2018',
>> > > > '57db7bd8118977a5f2cd3ef1c7503633\n', NULL, NULL
>> > > > '49', '1', NULL, NULL, NULL, NULL, NULL, '0', NULL, 'UNKNOWN',
>> '0',
>> > > > 'VIRTUAL_ROUTER', 'Cloudstack Release 4.11.0 Sun Jan 14 15:37:28
>> > UTC
>> > > > 2018',
>> > > > 'c03a474302d89fa82d345e10fe4cb751\n', NULL, NULL
>> > > > '73', '1', NULL, NULL, NULL, NULL, NULL, '0', NULL, 'UNKNOWN',
>> '0',
>> > > > 'VIRTUAL_ROUTER',

Re: Disaster after maintenance

2019-03-20 Thread Jevgeni Zolotarjov

   > >
> > > > >
> > > >
> > >
> >
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > > > > > > > > > > >> > >>at
> > > > > > > > > > >
> > > > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > > > > > > > > > > Method)
> > > > > > > > > > > > > > > >> > >>at
> > > > > > > > > > >
> > > > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > > > > > > Source)
> > > > > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > >
> > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > > &g

Re: Disaster after maintenance

2019-03-20 Thread Andrija Panic

; > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > > > > > > > > > >> > >>at
> > > > > > > > > >
> > > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > > > > > > > > > Method)
> > > > > > > > > > > > > > >> > >>at
> > > > > > > > > >
> > > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > > > > > Source)
> > > > > > > > > > > > > > >> > >>at
> > > > > > > > > > > >
> > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > > > > > >> > Source)
> > > > > > > > > > > > > > >> > >>at
> > > > > java.lang.reflect.Method.invoke(Unknown
> > > > > > > > Source)
> > > > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > > > >> > >>
> > > > > > > > > > > > > > >> >
> > > > > > > > > > > > > > >>
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
>

Re: Disaster after maintenance

2019-03-20 Thread Jevgeni Zolotarjov

rchestrator.restartNetwork(NetworkOrchestrator.java:2853)
> > > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > > >> > >>
> > > > > > > > > > > > > >> >
> > > > > > > > > > > > > >>
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > > > > > > > > >> > >>at
> > > > > > > > >
> > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > > > > > > > > Method)
> > > > > > > > > > > > > >> > >>at
> > > > > > > > >
> > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > > > > Source)
> > > > > > > > > > > > > >> > >>at
> > > > > > > > > > >
> > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > > > > >> > Source)
> > > > > > > > > > > > > >> > >>at
> > > > java.lang.reflect.Method.invoke(Unknown
> > > > > > > Source)
> > > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > > >> > >>
> > > > > > > > > > > > > >> >
> > > > > > > > > > > > > >>
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > > >> > >>
> > > > > > > > > > > > > >> >
> > > > > > > > > > > > > >>
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > > >> > >>
> > >

Re: Disaster after maintenance

2019-03-20 Thread Andrija Panic

invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > >> > >>
> > > > > > > > > > > > >> >
> > > > > > > > > > > > >>
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > > > > > >> > >>at
> > > > > > > > > > > > >> > >>
> > > > > > > > > > > > >> >
> > > > > > > > > > > > >>
> >     >     > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > &g

Re: Disaster after maintenance

2019-03-19 Thread Sergey Levitskiy

> > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > >

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

t; > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > > > > > > >> > >>at
> > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > > > > > > Method)
> > > > > > > > > > > >> > >>at
> > > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > > Source)
> > > > > > > > > > > >> > >>at
> > > > > > > > >
> sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > > >> > Source)
> > > > > > > > > > > >> > >>at
> > java.lang.reflect.Method.invoke(Unknown
> > > > > Source)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > > > > >> > >>at
> > > > > > > > > > > >> > >>
> > > > > > > > > > > >> >
> > > > > > > > > > > >>
> >     > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > > > > > >> > >>

Re: Disaster after maintenance

2019-03-19 Thread Sergey Levitskiy

> > >
> >
> 
com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > > > > > >> > >>at
> > > > > > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > > > > > Method)
> > > > > > > > > > >> > >>at
> > > > > > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > Source)
> > > > > > > > > > >> > >>at
> > > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > > > > > >> > Source)
> > > > > > > > > > >> > >>at
> java.lang.reflect.Method.invoke(Unknown
> > > > Source)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
    > > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > > > > >> > &

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

ion.java:197)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > > > >> > >>    at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> > > > > > > > > > >> > >>at
> > > > > > > > > > >> > >>
> > > > > > > > > > >> >
> > > > > > > > > > >>
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> > > > >

Re: Disaster after maintenance

2019-03-19 Thread Sergey Levitskiy

; >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > > > > >> > >>at
> > > > > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > > > > Method)
> > > > > > > > > >> > >>at
> > > > > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > > > > Source)
> > > > > > > > > >> > >>at
> > > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > > > > >> > Source)
> > > > > > > > > >> > >>at java.lang.reflect.Method.invoke(Unknown
> > > Source)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
    > > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> 
org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> &

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

t; org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >
> > > > > > > > > >>
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> > > > > > > > > >> > >>at
> > > > > > > > > >> > >>
> > > > > > > > > >> >

Re: Disaster after maintenance

2019-03-19 Thread Andrija Panic

ator.implementNetworkElements(NetworkOrchestrator.java:1203)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.rollingRestartRouters(NetworkOrchestrator.java:2948)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.restartNetwork(NetworkOrchestrator.java:2853)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > > > >> > >>at
> > > > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > > > Method)
> > > > > > > > >> > >>at
> > > > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > > > Source)
> > > > > > > > >> > >>at
> > > > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > > > >> > Source)
> > > > > > > > >> > >>at java.lang.reflect.Method.invoke(Unknown
> > Source)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > > >> > >>at
> > > > > > > > >> > >>
> > > > > > > > >> >
> > > > > > > > >>
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > >

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

; > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> > > > > > > >> > >>at
> com.sun.proxy.$Proxy229.restartNetwork(Unknown
> > > > > Source)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> > > > > > > >> > >>at
> > > > > > > >> com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> > > > > > > >> > >>at
> > > > > > > >> > >>
> > > > > > > >> >
> > > > > > > >>
> > > > > > >
> > > > > >
> > > > >
> > &

Re: Disaster after maintenance

2019-03-19 Thread Andrija Panic

gt; > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.network.element.VirtualRouterElement.implement(VirtualRouterElement.java:243)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.implementNetworkElements(NetworkOrchestrator.java:1203)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.rollingRestartRouters(NetworkOrchestrator.java:2948)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.restartNetwork(NetworkOrchestrator.java:2853)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > > >> > >>at
> > sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > > Method)
> > > > > > >> > >>at
> > sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > > Source)
> > > > > > >> > >>at
> > > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > > >> > Source)
> > > > > > >> > >>at java.lang.reflect.Method.invoke(Unknown Source)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > > > > >> >
> > > > > > >>
> > > > > >
> > > > >
> > > >
> > >
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > > > > >> > >>at
> > > > > > >> > >>
> > > &g

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

ion.NetworkOrchestrator.restartNetwork(NetworkOrchestrator.java:2853)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> > > > > >> > >>at
> sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> > > > > Method)
> > > > > >> > >>at
> sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> > > > > Source)
> > > > > >> > >>at
> > > sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> > > > > >> > Source)
> > > > > >> > >>at java.lang.reflect.Method.invoke(Unknown Source)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> > > > > >> > >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown
> > > Source)
> > > > > >> > >>at
> > > > > >> > >>
> > > > > >> >
> > > > > >>
> > > > >
> > > >
> > >
> >
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
&g

Re: Disaster after maintenance

2019-03-19 Thread Andrija Panic

;>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> > > > >> > >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown
> > Source)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> > > > >> > >>at
> > > > >> com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
> > >
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> > > > >> > >>at
> > > > >> > >>
> > > > >> >
> > > > >>
> > > >
&

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

; >> >
> > > >>
> > >
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> > > >> > >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown
> Source)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> > > >> > >>at
> > > >> com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
> > > >> > >>at
> > > >> > >>
> > > >> >
> > > >>
> > >
> >
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.run(AsyncJobManagerImpl.java:529)
> > > >> > >>at
> > > java.util.concurrent.Executors$RunnableAdapter.call(Unknown
> > > >> > >> Source)
> > > >> > >>at java.util.concurrent.FutureTask.run(Unknown Source)
> > > >> > >>at
> > java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
> > > >> > >> Source)
> > > >> > >>at
> > > java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
> > > >> > >> Source)
> > > >> > >>at java.lang.Thread.run(Unknown Source)
> > > >> > >> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > > >> > >> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099)
> > > Complete
> > > >> > async
> > > >> > >> job-5093, jobStatus: FAILED, resultCode: 530, result:
> > > >> > >>
> > > >>

Re: Disaster after maintenance

2019-03-19 Thread Andrija Panic

oud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
> > >> > >>at
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.run(AsyncJobManagerImpl.java:529)
> > >> > >>at
> > java.util.concurrent.Executors$RunnableAdapter.call(Unknown
> > >> > >> Source)
> > >> > >>at java.util.concurrent.FutureTask.run(Unknown Source)
> > >> > >>at
> java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
> > >> > >> Source)
> > >> > >>at
> > java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
> > >> > >> Source)
> > >> > >>at java.lang.Thread.run(Unknown Source)
> > >> > >> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > >> > >> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099)
> > Complete
> > >> > async
> > >> > >> job-5093, jobStatus: FAILED, resultCode: 530, result:
> > >> > >>
> > >> >
> > >>
> >
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Resource
> > >> > >> [DataCenter:1] is un
> > >> > >>
> > >> > >> On Tue, Mar 19, 2019 at 4:19 PM Andrija Panic <
> > >> > andrija.pa...@shapeblue.com>
> > >> > >> wrote:
> > >> > >>
> > >> > >>> 
> > >> > >>> Your network can't be deleted due to "Can't delete the network,
> > not
> > >> all
> > >> > >>> user vms are expunged. Vm
> > >> > >>> VM[User|i-2-11-VM] is in Stopped state" - which is fine.
> > >> > >>>
> > >> > >>> You should be able to just start the user VM - but if you have
> > >> actually
> > >> > >>> delete the VR itself, then just do Network restart with
> "cleanup"
> > >> and
> > >> > it
> > >> > >>> will recreate a new VR, after which you should be able to start
> > the
> > >> VM.
> > >> > >>>
> > >> > >>> Andrija
> > >> > >>>
> > >> > >>> andrija.pa...@shapeblue.com
> > >> > >>> www.shapeblue.com
> > >> > >>> Amadeus House, Floral Street, London  WC2E 9DPUK
> > >> > >>> @shapeblue
> > >> > >>>
> > >> > >>>
> > >> > >>>
> > >> > >>>
> > >> > >>> -Original Message-
> > >> > >>> From: Jevgeni Zolotarjov 
> > >> > >>> Sent: 19 March 2019 15:10
> > >> > >>> To: users@cloudstac

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

Element.java:243)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.implementNetworkElements(NetworkOrchestrator.java:1203)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.rollingRestartRouters(NetworkOrchestrator.java:2948)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.restartNetwork(NetworkOrchestrator.java:2853)
> >> > >>at
> >> > >>
> >> >
> >>
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> >> > >>at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
> Method)
> >> > >>at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown
> Source)
> >> > >>at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
> >> > Source)
> >> > >>at java.lang.reflect.Method.invoke(Unknown Source)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> >> > >>at
> >> > >>
> >> >
> >>
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> >> > >>at
> >> > >>
> >> >
> >>
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> >> > >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown Source)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> >> > >>at
> >> com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> >> > >>at
> >> > >>
> >> >
> >>
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> >> > >>at
> >> > >>
> >> >
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
> >> > >>at
> >> > >>
> >&g

Re: Disaster after maintenance

2019-03-19 Thread Andrija Panic

 >>at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> > >>at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
>> > >>at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown
>> > Source)
>> > >>at java.lang.reflect.Method.invoke(Unknown Source)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
>> > >>at
>> > >>
>> >
>> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
>> > >>at
>> > >>
>> >
>> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
>> > >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown Source)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
>> > >>at
>> com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
>> > >>at
>> > >>
>> >
>> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
>> > >>at
>> > >>
>> >
>> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.run(AsyncJobManagerImpl.java:529)
>> > >>at java.util.concurrent.Executors$RunnableAdapter.call(Unknown
>> > >> Source)
>> > >>at java.util.concurrent.FutureTask.run(Unknown Source)
>> > >>at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
>> > >> Source)
>> > >>at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
>> > >> Source)
>> > >>at java.lang.Thread.run(Unknown Source)
>> > >> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
>> > >> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099) Complete
>> > async
>> > >> job-5093, jobStatus: FAILED, resultCode: 530, result:
>> > >>
>> >
>> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Resource
>> > >> [DataCenter:1] is un

Re: Disaster after maintenance

2019-03-19 Thread Andrija Panic

work.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> > >>at
> > >>
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> > >>at
> > >>
> >
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> > >>at
> > >>
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > >>at
> > >>
> >
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> > >>at
> > >>
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> > >>at
> > >>
> >
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> > >>at
> > >>
> >
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> > >>at
> > >>
> >
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> > >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown Source)
> > >>at
> > >>
> >
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> > >>at com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> > >>at
> > >>
> >
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> > >>at
> > >>
> >
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> > >>at
> > >>
> >
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> > >>at
> > >>
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> > >>at
> > >>
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> > >>at
> > >>
> >
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
> > >>at
> > >>
> >
> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
> > >>at
> > >>
> >
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.run(AsyncJobManagerImpl.java:529)
> > >>at java.util.concurrent.Executors$RunnableAdapter.call(Unknown
> > >> Source)
> > >>at java.util.concurrent.FutureTask.run(Unknown Source)
> > >>at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
> > >> Source)
> > >>at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
> > >> Source)
> > >>at java.lang.Thread.run(Unknown Source)
> > >> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > >> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099) Complete
> > async
> > >> job-5093, jobStatus: FAILED, resultCode: 530, result:
> > >>
> >
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Resource
> > >> [DataCenter:1] is un
> > >>
> > >> On Tue, Mar 19, 2019 at 4:19 PM Andrija Panic <
> > andrija.pa...@shapeblue.com>
> > >> wrote:
> > >>
> > >>> 
> > >>> Your network can't be deleted due to "Can't delete the network, not
> all
> > >>> user vms are expunged. Vm
> > >>> VM[User|i-2-11-VM] is in Stopped state" - which is fine.
> > >>>
> > >>> You should be able to just start the user VM - but if you have
> actually
> > >>> delete the VR itself, then just do Network restart with "cleanup" and
> > it
> > >>> will recreate a new VR, after which you should be able to start the
> VM.
> > >>>
> > >>> Andrija
> > >>>
> > >>> andrija.pa...@shapeblue.com
> > >>> www.shapeblue.com
> > >>> Amadeus House, Fl

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

nterceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> >>at
> >>
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> >>at
> >>
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown Source)
> >>at
> >>
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> >>at com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> >>at
> >>
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> >>at
> >>
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> >>at
> >>
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> >>at
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> >>at
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> >>at
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
> >>at
> >>
> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
> >>at
> >>
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.run(AsyncJobManagerImpl.java:529)
> >>at java.util.concurrent.Executors$RunnableAdapter.call(Unknown
> >> Source)
> >>at java.util.concurrent.FutureTask.run(Unknown Source)
> >>at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
> >> Source)
> >>at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
> >> Source)
> >>at java.lang.Thread.run(Unknown Source)
> >> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> >> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099) Complete
> async
> >> job-5093, jobStatus: FAILED, resultCode: 530, result:
> >>
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Resource
> >> [DataCenter:1] is un
> >>
> >> On Tue, Mar 19, 2019 at 4:19 PM Andrija Panic <
> andrija.pa...@shapeblue.com>
> >> wrote:
> >>
> >>> 
> >>> Your network can't be deleted due to "Can't delete the network, not all
> >>> user vms are expunged. Vm
> >>> VM[User|i-2-11-VM] is in Stopped state" - which is fine.
> >>>
> >>> You should be able to just start the user VM - but if you have actually
> >>> delete the VR itself, then just do Network restart with "cleanup" and
> it
> >>> will recreate a new VR, after which you should be able to start the VM.
> >>>
> >>> Andrija
> >>>
> >>> andrija.pa...@shapeblue.com
> >>> www.shapeblue.com
> >>> Amadeus House, Floral Street, London  WC2E 9DPUK
> >>> @shapeblue
> >>>
> >>>
> >>>
> >>>
> >>> -Original Message-
> >>> From: Jevgeni Zolotarjov 
> >>> Sent: 19 March 2019 15:10
> >>> To: users@cloudstack.apache.org
> >>> Subject: Re: Disaster after maintenance
> >>>
> >>> I mean I cannot delete network: In the management server log I see
> >>>
> >>> ==
> >>> 019-03-19 14:06:36,316 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> >>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Executing
> >>> AsyncJobVO {id:5090, userId: 2, accountId: 2, instanceType: Network,
> >>> instanceId: 204, cmd:
> >>> org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd,
> cmdInfo:
> >>>
> >>>
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2641","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
> >>>
> >>>
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2

Re: Disaster after maintenance

2019-03-19 Thread Andrija Panic

ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> >>at
> >>
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> >>at com.sun.proxy.$Proxy229.restartNetwork(Unknown Source)
> >>at
> >>
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> >>at com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> >>at
> >>
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> >>at
> >>
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> >>at
> >>
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> >>at
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> >>at
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> >>at
> >>
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
> >>at
> >>
> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
> >>at
> >>
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.run(AsyncJobManagerImpl.java:529)
> >>at java.util.concurrent.Executors$RunnableAdapter.call(Unknown
> >> Source)
> >>at java.util.concurrent.FutureTask.run(Unknown Source)
> >>at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
> >> Source)
> >>at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
> >> Source)
> >>at java.lang.Thread.run(Unknown Source)
> >> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> >> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099) Complete
> async
> >> job-5093, jobStatus: FAILED, resultCode: 530, result:
> >>
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Resource
> >> [DataCenter:1] is un
> >>
> >> On Tue, Mar 19, 2019 at 4:19 PM Andrija Panic <
> andrija.pa...@shapeblue.com>
> >> wrote:
> >>
> >>> 
> >>> Your network can't be deleted due to "Can't delete the network, not all
> >>> user vms are expunged. Vm
> >>> VM[User|i-2-11-VM] is in Stopped state" - which is fine.
> >>>
> >>> You should be able to just start the user VM - but if you have actually
> >>> delete the VR itself, then just do Network restart with "cleanup" and
> it
> >>> will recreate a new VR, after which you should be able to start the VM.
> >>>
> >>> Andrija
> >>>
> >>> andrija.pa...@shapeblue.com
> >>> www.shapeblue.com
> >>> Amadeus House, Floral Street, London  WC2E 9DPUK
> >>> @shapeblue
> >>>
> >>>
> >>>
> >>>
> >>> -Original Message-
> >>> From: Jevgeni Zolotarjov 
> >>> Sent: 19 March 2019 15:10
> >>> To: users@cloudstack.apache.org
> >>> Subject: Re: Disaster after maintenance
> >>>
> >>> I mean I cannot delete network: In the management server log I see
> >>>
> >>> ==
> >>> 019-03-19 14:06:36,316 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> >>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Executing
> >>> AsyncJobVO {id:5090, userId: 2, accountId: 2, instanceType: Network,
> >>> instanceId: 204, cmd:
> >>> org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd,
> cmdInfo:
> >>>
> >>>
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2641","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
> >>>
> >>>
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1553004396247"},
> >>> cmdVersion: 0, stat

Re: Disaster after maintenance

2019-03-19 Thread Boris Stoyanov

Impl$5.runInContext(AsyncJobManagerImpl.java:581)
>>at
>> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
>>at
>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
>>at
>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
>>at
>> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
>>at
>> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:46)
>>at
>> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.run(AsyncJobManagerImpl.java:529)
>>at java.util.concurrent.Executors$RunnableAdapter.call(Unknown
>> Source)
>>at java.util.concurrent.FutureTask.run(Unknown Source)
>>at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown
>> Source)
>>at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
>> Source)
>>at java.lang.Thread.run(Unknown Source)
>> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
>> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099) Complete async
>> job-5093, jobStatus: FAILED, resultCode: 530, result:
>> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Resource
>> [DataCenter:1] is un
>> 
>> On Tue, Mar 19, 2019 at 4:19 PM Andrija Panic 
>> wrote:
>> 
>>> 
>>> Your network can't be deleted due to "Can't delete the network, not all
>>> user vms are expunged. Vm
>>> VM[User|i-2-11-VM] is in Stopped state" - which is fine.
>>> 
>>> You should be able to just start the user VM - but if you have actually
>>> delete the VR itself, then just do Network restart with "cleanup" and it
>>> will recreate a new VR, after which you should be able to start the VM.
>>> 
>>> Andrija
>>> 
>>> andrija.pa...@shapeblue.com
>>> www.shapeblue.com
>>> Amadeus House, Floral Street, London  WC2E 9DPUK
>>> @shapeblue
>>> 
>>> 
>>> 
>>> 
>>> -Original Message-
>>> From: Jevgeni Zolotarjov 
>>> Sent: 19 March 2019 15:10
>>> To: users@cloudstack.apache.org
>>> Subject: Re: Disaster after maintenance
>>> 
>>> I mean I cannot delete network: In the management server log I see
>>> 
>>> ==
>>> 019-03-19 14:06:36,316 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
>>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Executing
>>> AsyncJobVO {id:5090, userId: 2, accountId: 2, instanceType: Network,
>>> instanceId: 204, cmd:
>>> org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
>>> 
>>> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2641","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
>>> 
>>> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1553004396247"},
>>> cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
>>> result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
>>> null, lastPolled: null, created: null}
>>> 2019-03-19 14:06:36,351 WARN  [o.a.c.e.o.NetworkOrchestrator]
>>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090 ctx-134954fa) (logid:c734edfc)
>>> Can't delete the network, not all user vms are expunged. Vm
>>> VM[User|i-2-11-VM] is in Stopped state
>>> 2019-03-19 14:06:36,356 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
>>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Complete async
>>> job-5090, jobStatus: FAILED, resultCode: 530, result:
>>> 
>>> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Failed
>>> to delete network"}
>>> ==
>>> 
>>> 
>>> I deleted a router, expecting it to be recreated on deleting network. But
>>> I am unable to delete network because of above e

RE: Disaster after maintenance

2019-03-19 Thread Paul Angus

Somewhere in your error, I think that I saw a reference to rolling reboot. Try 
disabling that in the global settings.
( network.rolling.restart )

paul.an...@shapeblue.com 
www.shapeblue.com
Amadeus House, Floral Street, London  WC2E 9DPUK
@shapeblue
  
 


-Original Message-
From: Jevgeni Zolotarjov  
Sent: 19 March 2019 17:29
To: users@cloudstack.apache.org
Subject: Re: Disaster after maintenance

Guys, please help with it.
What can be done here?
There is too much valuable data.

On Tue, Mar 19, 2019 at 4:21 PM Jevgeni Zolotarjov 
wrote:

> Tried that just now and got error:
> Resource [DataCenter:1] is unreachable: Can't find all necessary 
> running routers!
>
> In the log I see:
> =
>
> 2019-03-19 14:20:39,644 DEBUG [o.a.c.e.o.NetworkOrchestrator] 
> (API-Job-Executor-4:ctx-7b6b69eb job-5093 ctx-9be30648) 
> (logid:265a6099) Restarting network 204...
> 2019-03-19 14:20:39,645 DEBUG [o.a.c.e.o.NetworkOrchestrator] 
> (API-Job-Executor-4:ctx-7b6b69eb job-5093 ctx-9be30648) 
> (logid:265a6099) Performing rolling restart of routers of network 
> Ntwk[204|Guest|6]
> 2019-03-19 14:20:39,649 DEBUG [o.a.c.e.o.NetworkOrchestrator] 
> (API-Job-Executor-4:ctx-7b6b69eb job-5093 ctx-9be30648) 
> (logid:265a6099) Asking VirtualRouter to implemenet Ntwk[204|Guest|6]
> 2019-03-19 14:20:39,658 ERROR [c.c.a.ApiAsyncJobDispatcher] 
> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099) Unexpected 
> exception while executing 
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd
> com.cloud.exception.ResourceUnavailableException: Resource 
> [DataCenter:1] is unreachable: Can't find all necessary running routers!
> at
> com.cloud.network.element.VirtualRouterElement.implement(VirtualRouterElement.java:243)
> at
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.implementNetworkElements(NetworkOrchestrator.java:1203)
> at
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.rollingRestartRouters(NetworkOrchestrator.java:2948)
> at
> org.apache.cloudstack.engine.orchestration.NetworkOrchestrator.restartNetwork(NetworkOrchestrator.java:2853)
> at
> com.cloud.network.NetworkServiceImpl.restartNetwork(NetworkServiceImpl.java:1883)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
> at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
> at java.lang.reflect.Method.invoke(Unknown Source)
> at
> org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:338)
> at
> org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:197)
> at
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
> at
> org.apache.cloudstack.network.contrail.management.EventUtils$EventInterceptor.invoke(EventUtils.java:107)
> at
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> at
> com.cloud.event.ActionEventInterceptor.invoke(ActionEventInterceptor.java:51)
> at
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:174)
> at
> org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:92)
> at
> org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:185)
> at
> org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:212)
> at com.sun.proxy.$Proxy229.restartNetwork(Unknown Source)
> at
> org.apache.cloudstack.api.command.user.network.RestartNetworkCmd.execute(RestartNetworkCmd.java:99)
> at com.cloud.api.ApiDispatcher.dispatch(ApiDispatcher.java:150)
> at
> com.cloud.api.ApiAsyncJobDispatcher.runJob(ApiAsyncJobDispatcher.java:108)
> at
> org.apache.cloudstack.framework.jobs.impl.AsyncJobManagerImpl$5.runInContext(AsyncJobManagerImpl.java:581)
> at
> org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:49)
> at
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:56)
> at
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:103)
> at
> org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:53)
> at
> org.apache.cloudstack.managed.context.ManagedContextRunnable.run(M

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

ThreadPoolExecutor$Worker.run(Unknown
> Source)
> at java.lang.Thread.run(Unknown Source)
> 2019-03-19 14:20:39,659 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-4:ctx-7b6b69eb job-5093) (logid:265a6099) Complete async
> job-5093, jobStatus: FAILED, resultCode: 530, result:
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Resource
> [DataCenter:1] is un
>
> On Tue, Mar 19, 2019 at 4:19 PM Andrija Panic 
> wrote:
>
>> 
>> Your network can't be deleted due to "Can't delete the network, not all
>> user vms are expunged. Vm
>> VM[User|i-2-11-VM] is in Stopped state" - which is fine.
>>
>> You should be able to just start the user VM - but if you have actually
>> delete the VR itself, then just do Network restart with "cleanup" and it
>> will recreate a new VR, after which you should be able to start the VM.
>>
>> Andrija
>>
>> andrija.pa...@shapeblue.com
>> www.shapeblue.com
>> Amadeus House, Floral Street, London  WC2E 9DPUK
>> @shapeblue
>>
>>
>>
>>
>> -Original Message-
>> From: Jevgeni Zolotarjov 
>> Sent: 19 March 2019 15:10
>> To: users@cloudstack.apache.org
>> Subject: Re: Disaster after maintenance
>>
>> I mean I cannot delete network: In the management server log I see
>>
>> ==
>> 019-03-19 14:06:36,316 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Executing
>> AsyncJobVO {id:5090, userId: 2, accountId: 2, instanceType: Network,
>> instanceId: 204, cmd:
>> org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
>>
>> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2641","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
>>
>> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1553004396247"},
>> cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
>> result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
>> null, lastPolled: null, created: null}
>> 2019-03-19 14:06:36,351 WARN  [o.a.c.e.o.NetworkOrchestrator]
>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090 ctx-134954fa) (logid:c734edfc)
>> Can't delete the network, not all user vms are expunged. Vm
>> VM[User|i-2-11-VM] is in Stopped state
>> 2019-03-19 14:06:36,356 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
>> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Complete async
>> job-5090, jobStatus: FAILED, resultCode: 530, result:
>>
>> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Failed
>> to delete network"}
>> ==
>>
>>
>> I deleted a router, expecting it to be recreated on deleting network. But
>> I am unable to delete network because of above error
>>
>> On Tue, Mar 19, 2019 at 3:58 PM Jevgeni Zolotarjov <
>> j.zolotar...@gmail.com>
>> wrote:
>>
>> > I've managed to make libvirtd running
>> > Now cloudstack console shows both hosts - running
>> >
>> > But now as I have removed network, VMs are unable to start.
>> >
>> > How can I recreate the network now?
>> >
>> > On Tue, Mar 19, 2019 at 3:14 PM Ivan Kudryavtsev
>> > 
>> > wrote:
>> >
>> >> Jevgeniy, it may be a documentation bug. Take s look:
>> >> https://github.com/apache/cloudstack-documentation/pull/27/files
>> >>
>> >> вт, 19 мар. 2019 г., 9:09 Jevgeni Zolotarjov :
>> >>
>> >> > That's it - libvirtd failed to start on second host.
>> >> > Tried restarting, but it does not start.
>> >> >
>> >> >
>> >> > >> Do you have some NUMA constraints or anything which requires
>> >> particular
>> >> > RAM configuration?
>> >> > No
>> >> >
>> >> >  libvirtd.service - Virtualization daemon
>> >> >Loaded: loaded (/usr/lib/systemd/system/libvirtd.servic

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

due to "Can't delete the network, not all
> user vms are expunged. Vm
> VM[User|i-2-11-VM] is in Stopped state" - which is fine.
>
> You should be able to just start the user VM - but if you have actually
> delete the VR itself, then just do Network restart with "cleanup" and it
> will recreate a new VR, after which you should be able to start the VM.
>
> Andrija
>
> andrija.pa...@shapeblue.com
> www.shapeblue.com
> Amadeus House, Floral Street, London  WC2E 9DPUK
> @shapeblue
>
>
>
>
> -Original Message-
> From: Jevgeni Zolotarjov 
> Sent: 19 March 2019 15:10
> To: users@cloudstack.apache.org
> Subject: Re: Disaster after maintenance
>
> I mean I cannot delete network: In the management server log I see
>
> ==
> 019-03-19 14:06:36,316 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Executing
> AsyncJobVO {id:5090, userId: 2, accountId: 2, instanceType: Network,
> instanceId: 204, cmd:
> org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
>
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2641","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
>
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1553004396247"},
> cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
> null, lastPolled: null, created: null}
> 2019-03-19 14:06:36,351 WARN  [o.a.c.e.o.NetworkOrchestrator]
> (API-Job-Executor-1:ctx-1c0fd4dc job-5090 ctx-134954fa) (logid:c734edfc)
> Can't delete the network, not all user vms are expunged. Vm
> VM[User|i-2-11-VM] is in Stopped state
> 2019-03-19 14:06:36,356 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Complete async
> job-5090, jobStatus: FAILED, resultCode: 530, result:
>
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Failed
> to delete network"}
> ==
>
>
> I deleted a router, expecting it to be recreated on deleting network. But
> I am unable to delete network because of above error
>
> On Tue, Mar 19, 2019 at 3:58 PM Jevgeni Zolotarjov  >
> wrote:
>
> > I've managed to make libvirtd running
> > Now cloudstack console shows both hosts - running
> >
> > But now as I have removed network, VMs are unable to start.
> >
> > How can I recreate the network now?
> >
> > On Tue, Mar 19, 2019 at 3:14 PM Ivan Kudryavtsev
> > 
> > wrote:
> >
> >> Jevgeniy, it may be a documentation bug. Take s look:
> >> https://github.com/apache/cloudstack-documentation/pull/27/files
> >>
> >> вт, 19 мар. 2019 г., 9:09 Jevgeni Zolotarjov :
> >>
> >> > That's it - libvirtd failed to start on second host.
> >> > Tried restarting, but it does not start.
> >> >
> >> >
> >> > >> Do you have some NUMA constraints or anything which requires
> >> particular
> >> > RAM configuration?
> >> > No
> >> >
> >> >  libvirtd.service - Virtualization daemon
> >> >Loaded: loaded (/usr/lib/systemd/system/libvirtd.service;
> >> > enabled; vendor preset: enabled)
> >> >Active: failed (Result: start-limit) since Tue 2019-03-19
> >> > 13:03:07
> >> GMT;
> >> > 12s ago
> >> >  Docs: man:libvirtd(8)
> >> >https://libvirt.org
> >> >   Process: 892 ExecStart=/usr/sbin/libvirtd $LIBVIRTD_ARGS
> >> > (code=exited,
> >> > status=1/FAILURE)
> >> >  Main PID: 892 (code=exited, status=1/FAILURE)
> >> > Tasks: 19 (limit: 32768)
> >> >CGroup: /system.slice/libvirtd.service
> >> >├─11338 /usr/sbin/libvirtd -d -l
> >> >├─11909 /usr/sbin/dnsmasq
> >> > --conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro
> >> > --dhcp-script=/usr/libexec/libvirt_leaseshelper
> >> >└─11910 /usr/sbin/dnsmasq
> >> > --conf-file

RE: Disaster after maintenance

2019-03-19 Thread Andrija Panic

Your network can't be deleted due to "Can't delete the network, not all user 
vms are expunged. Vm
VM[User|i-2-11-VM] is in Stopped state" - which is fine.

You should be able to just start the user VM - but if you have actually delete 
the VR itself, then just do Network restart with "cleanup" and it will recreate 
a new VR, after which you should be able to start the VM.

Andrija

andrija.pa...@shapeblue.com 
www.shapeblue.com
Amadeus House, Floral Street, London  WC2E 9DPUK
@shapeblue

-Original Message-
From: Jevgeni Zolotarjov  
Sent: 19 March 2019 15:10
To: users@cloudstack.apache.org
Subject: Re: Disaster after maintenance

I mean I cannot delete network: In the management server log I see

==
019-03-19 14:06:36,316 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] 
(API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Executing 
AsyncJobVO {id:5090, userId: 2, accountId: 2, instanceType: Network,
instanceId: 204, cmd:
org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
{"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2641","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1553004396247"},
cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
null, lastPolled: null, created: null}
2019-03-19 14:06:36,351 WARN  [o.a.c.e.o.NetworkOrchestrator] 
(API-Job-Executor-1:ctx-1c0fd4dc job-5090 ctx-134954fa) (logid:c734edfc) Can't 
delete the network, not all user vms are expunged. Vm VM[User|i-2-11-VM] is in 
Stopped state
2019-03-19 14:06:36,356 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] 
(API-Job-Executor-1:ctx-1c0fd4dc job-5090) (logid:c734edfc) Complete async 
job-5090, jobStatus: FAILED, resultCode: 530, result:
org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Failed
to delete network"}
==

I deleted a router, expecting it to be recreated on deleting network. But I am 
unable to delete network because of above error

On Tue, Mar 19, 2019 at 3:58 PM Jevgeni Zolotarjov 
wrote:

> I've managed to make libvirtd running
> Now cloudstack console shows both hosts - running
>
> But now as I have removed network, VMs are unable to start.
>
> How can I recreate the network now?
>
> On Tue, Mar 19, 2019 at 3:14 PM Ivan Kudryavtsev 
> 
> wrote:
>
>> Jevgeniy, it may be a documentation bug. Take s look:
>> https://github.com/apache/cloudstack-documentation/pull/27/files
>>
>> вт, 19 мар. 2019 г., 9:09 Jevgeni Zolotarjov :
>>
>> > That's it - libvirtd failed to start on second host.
>> > Tried restarting, but it does not start.
>> >
>> >
>> > >> Do you have some NUMA constraints or anything which requires
>> particular
>> > RAM configuration?
>> > No
>> >
>> >  libvirtd.service - Virtualization daemon
>> >Loaded: loaded (/usr/lib/systemd/system/libvirtd.service; 
>> > enabled; vendor preset: enabled)
>> >Active: failed (Result: start-limit) since Tue 2019-03-19 
>> > 13:03:07
>> GMT;
>> > 12s ago
>> >  Docs: man:libvirtd(8)
>> >https://libvirt.org
>> >   Process: 892 ExecStart=/usr/sbin/libvirtd $LIBVIRTD_ARGS 
>> > (code=exited,
>> > status=1/FAILURE)
>> >  Main PID: 892 (code=exited, status=1/FAILURE)
>> > Tasks: 19 (limit: 32768)
>> >CGroup: /system.slice/libvirtd.service
>> >├─11338 /usr/sbin/libvirtd -d -l
>> >├─11909 /usr/sbin/dnsmasq 
>> > --conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro 
>> > --dhcp-script=/usr/libexec/libvirt_leaseshelper
>> >└─11910 /usr/sbin/dnsmasq 
>> > --conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro 
>> > --dhcp-script=/usr/libexec/libvirt_leaseshelper
>> >
>> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to 
>> > start Virtualization daemon.
>> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit 
>> > libvirtd.service entered failed state.
>> > Mar 19 13:03:07 mtl1-apphst04.mt.pb

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]:
>> libvirtd.service
>> > failed.
>> >
>> >
>> > On Tue, Mar 19, 2019 at 3:04 PM Paul Angus 
>> > wrote:
>> >
>> > > Can you check that the cloudstack agent is running on the host and the
>> > > agent logs (usual logs directory)
>> > > Also worth checking that libvirt has started ok.  Do you have some
>> NUMA
>> > > constraints or anything which requires particular RAM configuration?
>> > >
>> > > paul.an...@shapeblue.com
>> > > www.shapeblue.com
>> > > Amadeus House, Floral Street, London  WC2E 9DPUK
>> > > @shapeblue
>> > >
>> > >
>> > >
>> > >
>> > > -Original Message-
>> > > From: Jevgeni Zolotarjov 
>> > > Sent: 19 March 2019 14:49
>> > > To: users@cloudstack.apache.org
>> > > Subject: Re: Disaster after maintenance
>> > >
>> > > Can you try migrating a VM to the server that you changed the RAM
>> amount?
>> > >
>> > > Also:
>> > > What is the hypervisor version?
>> > > KVM
>> > > QEMU Version : 2.0.0
>> > > Release : 1.el7.6
>> > >
>> > >
>> > > Host status in ACS?
>> > > 1st server: Unsecure
>> > > 2nd server: Disconnected
>> > >
>> > > Did you try to force a VM to start/deploy in this server where you
>> > changed
>> > > the RAM?
>> > > Host status became disconnected. I don't know how to make it
>> "connected"
>> > > again
>> > >
>> > >
>> > >
>> > > On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner <
>> > > rafaelweingart...@gmail.com> wrote:
>> > >
>> > > > Can you try migrating a VM to the server that you changed the RAM
>> > amount?
>> > > >
>> > > > Also:
>> > > > What is the hypervisor version?
>> > > > Host status in ACS?
>> > > > Did you try to force a VM to start/deploy in this server where you
>> > > > changed the RAM?
>> > > >
>> > > >
>> > > > On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov
>> > > > > > > > >
>> > > > wrote:
>> > > >
>> > > > > We have Cloudstack 4.11.2 setup running fine for few months (>4)
>> The
>> > > > > setup is very simple: 2 hosts We decided to do a maintenance to
>> > > > > increase RAM on both servers
>> > > > >
>> > > > > For this we put first server to maintenance. All VMS moved to
>> second
>> > > > > host after a while.
>> > > > >
>> > > > > Then first server was shutdown, RAM increased, server turned ON.
>> > > > > Now nothing starts on first server.
>> > > > >
>> > > > >
>> > > > > Tried to delete network, but this fails as well
>> > > > >
>> > > > > Please help !
>> > > > >
>> > > > > Here is extract from log:
>> > > > > ==
>> > > > > 2019-03-19 12:27:53,064 DEBUG
>> [o.a.c.s.SecondaryStorageManagerImpl]
>> > > > > (secstorage-1:ctx-16d6c797) (logid:7e3160ce) Zone 1 is ready to
>> > > > > launch secondary storage VM
>> > > > > 2019-03-19 12:27:53,125 DEBUG [c.c.c.ConsoleProxyManagerImpl]
>> > > > > (consoleproxy-1:ctx-cbd034b9) (logid:0a8c8bf4) Zone 1 is ready to
>> > > > > launch console proxy
>> > > > > 2019-03-19 12:27:53,181 DEBUG [c.c.a.ApiServlet]
>> > > > > (qtp510113906-285:ctx-6c5e11c3) (logid:cd8e30be) ===START===
>> > > > 192.168.5.140
>> > > > > -- GET
>> > > > >
>> > > > >
>> > > >
>> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1
>> > > > =json&_=1552998473154
>> > > > > 2019-03-19 12:27:53,186 DEBUG [c.c.a.ApiServer]
>> > > > > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be)
>> CIDRs
>> > > > > from which account
>> > > > > 'Acct[15863393-8e8d-11e7-8f52-f04da2002bbe-admin]' is
>> > > > allowed
>> > > > > to perform API c

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

I've managed to make libvirtd running
Now cloudstack console shows both hosts - running

But now as I have removed network, VMs are unable to start.

How can I recreate the network now?

On Tue, Mar 19, 2019 at 3:14 PM Ivan Kudryavtsev 
wrote:

> Jevgeniy, it may be a documentation bug. Take s look:
> https://github.com/apache/cloudstack-documentation/pull/27/files
>
> вт, 19 мар. 2019 г., 9:09 Jevgeni Zolotarjov :
>
> > That's it - libvirtd failed to start on second host.
> > Tried restarting, but it does not start.
> >
> >
> > >> Do you have some NUMA constraints or anything which requires
> particular
> > RAM configuration?
> > No
> >
> >  libvirtd.service - Virtualization daemon
> >Loaded: loaded (/usr/lib/systemd/system/libvirtd.service; enabled;
> > vendor preset: enabled)
> >Active: failed (Result: start-limit) since Tue 2019-03-19 13:03:07
> GMT;
> > 12s ago
> >  Docs: man:libvirtd(8)
> >https://libvirt.org
> >   Process: 892 ExecStart=/usr/sbin/libvirtd $LIBVIRTD_ARGS (code=exited,
> > status=1/FAILURE)
> >  Main PID: 892 (code=exited, status=1/FAILURE)
> > Tasks: 19 (limit: 32768)
> >CGroup: /system.slice/libvirtd.service
> >├─11338 /usr/sbin/libvirtd -d -l
> >├─11909 /usr/sbin/dnsmasq
> > --conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro
> > --dhcp-script=/usr/libexec/libvirt_leaseshelper
> >└─11910 /usr/sbin/dnsmasq
> > --conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro
> > --dhcp-script=/usr/libexec/libvirt_leaseshelper
> >
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to start
> > Virtualization daemon.
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit
> > libvirtd.service entered failed state.
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
> > failed.
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
> > holdoff time over, scheduling restart.
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Stopped
> > Virtualization daemon.
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: start request
> > repeated too quickly for libvirtd.service
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to start
> > Virtualization daemon.
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit
> > libvirtd.service entered failed state.
> > Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
> > failed.
> >
> >
> > On Tue, Mar 19, 2019 at 3:04 PM Paul Angus 
> > wrote:
> >
> > > Can you check that the cloudstack agent is running on the host and the
> > > agent logs (usual logs directory)
> > > Also worth checking that libvirt has started ok.  Do you have some NUMA
> > > constraints or anything which requires particular RAM configuration?
> > >
> > > paul.an...@shapeblue.com
> > > www.shapeblue.com
> > > Amadeus House, Floral Street, London  WC2E 9DPUK
> > > @shapeblue
> > >
> > >
> > >
> > >
> > > -Original Message-
> > > From: Jevgeni Zolotarjov 
> > > Sent: 19 March 2019 14:49
> > > To: users@cloudstack.apache.org
> > > Subject: Re: Disaster after maintenance
> > >
> > > Can you try migrating a VM to the server that you changed the RAM
> amount?
> > >
> > > Also:
> > > What is the hypervisor version?
> > > KVM
> > > QEMU Version : 2.0.0
> > > Release : 1.el7.6
> > >
> > >
> > > Host status in ACS?
> > > 1st server: Unsecure
> > > 2nd server: Disconnected
> > >
> > > Did you try to force a VM to start/deploy in this server where you
> > changed
> > > the RAM?
> > > Host status became disconnected. I don't know how to make it
> "connected"
> > > again
> > >
> > >
> > >
> > > On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner <
> > > rafaelweingart...@gmail.com> wrote:
> > >
> > > > Can you try migrating a VM to the server that you changed the RAM
> > amount?
> > > >
> > > > Also:
> > > > What is the hypervisor version?
> > > > Host status in ACS?
> > > > Did you try to force a VM to start/deploy in this server where you
> > > > changed the RAM?
> > > >
> > > >
> > > > On Tue, Mar 19, 2019 at 9:3

RE: Disaster after maintenance

2019-03-19 Thread Paul Angus

Libvirtd has its own logs, so you'll need to look at those I'm afraid.

paul.an...@shapeblue.com 
www.shapeblue.com
Amadeus House, Floral Street, London  WC2E 9DPUK
@shapeblue
  
 


-Original Message-
From: Jevgeni Zolotarjov  
Sent: 19 March 2019 15:09
To: users@cloudstack.apache.org
Subject: Re: Disaster after maintenance

That's it - libvirtd failed to start on second host.
Tried restarting, but it does not start.


>> Do you have some NUMA constraints or anything which requires 
>> particular
RAM configuration?
No

 libvirtd.service - Virtualization daemon
   Loaded: loaded (/usr/lib/systemd/system/libvirtd.service; enabled; vendor 
preset: enabled)
   Active: failed (Result: start-limit) since Tue 2019-03-19 13:03:07 GMT; 12s 
ago
 Docs: man:libvirtd(8)
   https://libvirt.org
  Process: 892 ExecStart=/usr/sbin/libvirtd $LIBVIRTD_ARGS (code=exited,
status=1/FAILURE)
 Main PID: 892 (code=exited, status=1/FAILURE)
Tasks: 19 (limit: 32768)
   CGroup: /system.slice/libvirtd.service
   ├─11338 /usr/sbin/libvirtd -d -l
   ├─11909 /usr/sbin/dnsmasq
--conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro 
--dhcp-script=/usr/libexec/libvirt_leaseshelper
   └─11910 /usr/sbin/dnsmasq
--conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro 
--dhcp-script=/usr/libexec/libvirt_leaseshelper

Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to start 
Virtualization daemon.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit libvirtd.service 
entered failed state.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service failed.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service 
holdoff time over, scheduling restart.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Stopped Virtualization 
daemon.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: start request repeated 
too quickly for libvirtd.service Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt 
systemd[1]: Failed to start Virtualization daemon.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit libvirtd.service 
entered failed state.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service failed.


On Tue, Mar 19, 2019 at 3:04 PM Paul Angus  wrote:

> Can you check that the cloudstack agent is running on the host and the 
> agent logs (usual logs directory) Also worth checking that libvirt has 
> started ok.  Do you have some NUMA constraints or anything which 
> requires particular RAM configuration?
>
> paul.an...@shapeblue.com
> www.shapeblue.com
> Amadeus House, Floral Street, London  WC2E 9DPUK @shapeblue
>
>
>
>
> -Original Message-
> From: Jevgeni Zolotarjov 
> Sent: 19 March 2019 14:49
> To: users@cloudstack.apache.org
> Subject: Re: Disaster after maintenance
>
> Can you try migrating a VM to the server that you changed the RAM amount?
>
> Also:
> What is the hypervisor version?
> KVM
> QEMU Version : 2.0.0
> Release : 1.el7.6
>
>
> Host status in ACS?
> 1st server: Unsecure
> 2nd server: Disconnected
>
> Did you try to force a VM to start/deploy in this server where you changed
> the RAM?
> Host status became disconnected. I don't know how to make it "connected"
> again
>
>
>
> On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner <
> rafaelweingart...@gmail.com> wrote:
>
> > Can you try migrating a VM to the server that you changed the RAM amount?
> >
> > Also:
> > What is the hypervisor version?
> > Host status in ACS?
> > Did you try to force a VM to start/deploy in this server where you
> > changed the RAM?
> >
> >
> > On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov
> >  > >
> > wrote:
> >
> > > We have Cloudstack 4.11.2 setup running fine for few months (>4) The
> > > setup is very simple: 2 hosts We decided to do a maintenance to
> > > increase RAM on both servers
> > >
> > > For this we put first server to maintenance. All VMS moved to second
> > > host after a while.
> > >
> > > Then first server was shutdown, RAM increased, server turned ON.
> > > Now nothing starts on first server.
> > >
> > >
> > > Tried to delete network, but this fails as well
> > >
> > > Please help !
> > >
> > > Here is extract from log:
> > > ==
> > > 2019-03-19 12:27:53,064 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > > (secstorage-1:ctx-16d6c797) (logid:7e3160ce) Zone 1 is ready to
> > > launch secondary storage VM
> > > 2019-03-19 12:27:53,125 DEBUG [c.c.c.ConsoleProxyManagerImpl]
> > > (consoleproxy-1:ctx-cbd034b9) (logid:0

Re: Disaster after maintenance

2019-03-19 Thread Ivan Kudryavtsev

Jevgeniy, it may be a documentation bug. Take s look:
https://github.com/apache/cloudstack-documentation/pull/27/files

вт, 19 мар. 2019 г., 9:09 Jevgeni Zolotarjov :

> That's it - libvirtd failed to start on second host.
> Tried restarting, but it does not start.
>
>
> >> Do you have some NUMA constraints or anything which requires particular
> RAM configuration?
> No
>
>  libvirtd.service - Virtualization daemon
>Loaded: loaded (/usr/lib/systemd/system/libvirtd.service; enabled;
> vendor preset: enabled)
>Active: failed (Result: start-limit) since Tue 2019-03-19 13:03:07 GMT;
> 12s ago
>  Docs: man:libvirtd(8)
>https://libvirt.org
>   Process: 892 ExecStart=/usr/sbin/libvirtd $LIBVIRTD_ARGS (code=exited,
> status=1/FAILURE)
>  Main PID: 892 (code=exited, status=1/FAILURE)
> Tasks: 19 (limit: 32768)
>CGroup: /system.slice/libvirtd.service
>├─11338 /usr/sbin/libvirtd -d -l
>├─11909 /usr/sbin/dnsmasq
> --conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro
> --dhcp-script=/usr/libexec/libvirt_leaseshelper
>└─11910 /usr/sbin/dnsmasq
> --conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro
> --dhcp-script=/usr/libexec/libvirt_leaseshelper
>
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to start
> Virtualization daemon.
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit
> libvirtd.service entered failed state.
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
> failed.
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
> holdoff time over, scheduling restart.
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Stopped
> Virtualization daemon.
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: start request
> repeated too quickly for libvirtd.service
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to start
> Virtualization daemon.
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit
> libvirtd.service entered failed state.
> Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
> failed.
>
>
> On Tue, Mar 19, 2019 at 3:04 PM Paul Angus 
> wrote:
>
> > Can you check that the cloudstack agent is running on the host and the
> > agent logs (usual logs directory)
> > Also worth checking that libvirt has started ok.  Do you have some NUMA
> > constraints or anything which requires particular RAM configuration?
> >
> > paul.an...@shapeblue.com
> > www.shapeblue.com
> > Amadeus House, Floral Street, London  WC2E 9DPUK
> > @shapeblue
> >
> >
> >
> >
> > -Original Message-
> > From: Jevgeni Zolotarjov 
> > Sent: 19 March 2019 14:49
> > To: users@cloudstack.apache.org
> > Subject: Re: Disaster after maintenance
> >
> > Can you try migrating a VM to the server that you changed the RAM amount?
> >
> > Also:
> > What is the hypervisor version?
> > KVM
> > QEMU Version : 2.0.0
> > Release : 1.el7.6
> >
> >
> > Host status in ACS?
> > 1st server: Unsecure
> > 2nd server: Disconnected
> >
> > Did you try to force a VM to start/deploy in this server where you
> changed
> > the RAM?
> > Host status became disconnected. I don't know how to make it "connected"
> > again
> >
> >
> >
> > On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner <
> > rafaelweingart...@gmail.com> wrote:
> >
> > > Can you try migrating a VM to the server that you changed the RAM
> amount?
> > >
> > > Also:
> > > What is the hypervisor version?
> > > Host status in ACS?
> > > Did you try to force a VM to start/deploy in this server where you
> > > changed the RAM?
> > >
> > >
> > > On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov
> > >  > > >
> > > wrote:
> > >
> > > > We have Cloudstack 4.11.2 setup running fine for few months (>4) The
> > > > setup is very simple: 2 hosts We decided to do a maintenance to
> > > > increase RAM on both servers
> > > >
> > > > For this we put first server to maintenance. All VMS moved to second
> > > > host after a while.
> > > >
> > > > Then first server was shutdown, RAM increased, server turned ON.
> > > > Now nothing starts on first server.
> > > >
> > > >
> > > > Tried to delete network, but this fails as well
> > > >
> > > > Please help !
> > > >
>

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

That's it - libvirtd failed to start on second host.
Tried restarting, but it does not start.


>> Do you have some NUMA constraints or anything which requires particular
RAM configuration?
No

 libvirtd.service - Virtualization daemon
   Loaded: loaded (/usr/lib/systemd/system/libvirtd.service; enabled;
vendor preset: enabled)
   Active: failed (Result: start-limit) since Tue 2019-03-19 13:03:07 GMT;
12s ago
 Docs: man:libvirtd(8)
   https://libvirt.org
  Process: 892 ExecStart=/usr/sbin/libvirtd $LIBVIRTD_ARGS (code=exited,
status=1/FAILURE)
 Main PID: 892 (code=exited, status=1/FAILURE)
Tasks: 19 (limit: 32768)
   CGroup: /system.slice/libvirtd.service
   ├─11338 /usr/sbin/libvirtd -d -l
   ├─11909 /usr/sbin/dnsmasq
--conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro
--dhcp-script=/usr/libexec/libvirt_leaseshelper
   └─11910 /usr/sbin/dnsmasq
--conf-file=/var/lib/libvirt/dnsmasq/default.conf --leasefile-ro
--dhcp-script=/usr/libexec/libvirt_leaseshelper

Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to start
Virtualization daemon.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit
libvirtd.service entered failed state.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
failed.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
holdoff time over, scheduling restart.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Stopped
Virtualization daemon.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: start request
repeated too quickly for libvirtd.service
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Failed to start
Virtualization daemon.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: Unit
libvirtd.service entered failed state.
Mar 19 13:03:07 mtl1-apphst04.mt.pbt.com.mt systemd[1]: libvirtd.service
failed.


On Tue, Mar 19, 2019 at 3:04 PM Paul Angus  wrote:

> Can you check that the cloudstack agent is running on the host and the
> agent logs (usual logs directory)
> Also worth checking that libvirt has started ok.  Do you have some NUMA
> constraints or anything which requires particular RAM configuration?
>
> paul.an...@shapeblue.com
> www.shapeblue.com
> Amadeus House, Floral Street, London  WC2E 9DPUK
> @shapeblue
>
>
>
>
> -Original Message-
> From: Jevgeni Zolotarjov 
> Sent: 19 March 2019 14:49
> To: users@cloudstack.apache.org
> Subject: Re: Disaster after maintenance
>
> Can you try migrating a VM to the server that you changed the RAM amount?
>
> Also:
> What is the hypervisor version?
> KVM
> QEMU Version : 2.0.0
> Release : 1.el7.6
>
>
> Host status in ACS?
> 1st server: Unsecure
> 2nd server: Disconnected
>
> Did you try to force a VM to start/deploy in this server where you changed
> the RAM?
> Host status became disconnected. I don't know how to make it "connected"
> again
>
>
>
> On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner <
> rafaelweingart...@gmail.com> wrote:
>
> > Can you try migrating a VM to the server that you changed the RAM amount?
> >
> > Also:
> > What is the hypervisor version?
> > Host status in ACS?
> > Did you try to force a VM to start/deploy in this server where you
> > changed the RAM?
> >
> >
> > On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov
> >  > >
> > wrote:
> >
> > > We have Cloudstack 4.11.2 setup running fine for few months (>4) The
> > > setup is very simple: 2 hosts We decided to do a maintenance to
> > > increase RAM on both servers
> > >
> > > For this we put first server to maintenance. All VMS moved to second
> > > host after a while.
> > >
> > > Then first server was shutdown, RAM increased, server turned ON.
> > > Now nothing starts on first server.
> > >
> > >
> > > Tried to delete network, but this fails as well
> > >
> > > Please help !
> > >
> > > Here is extract from log:
> > > ==
> > > 2019-03-19 12:27:53,064 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > > (secstorage-1:ctx-16d6c797) (logid:7e3160ce) Zone 1 is ready to
> > > launch secondary storage VM
> > > 2019-03-19 12:27:53,125 DEBUG [c.c.c.ConsoleProxyManagerImpl]
> > > (consoleproxy-1:ctx-cbd034b9) (logid:0a8c8bf4) Zone 1 is ready to
> > > launch console proxy
> > > 2019-03-19 12:27:53,181 DEBUG [c.c.a.ApiServlet]
> > > (qtp510113906-285:ctx-6c5e11c3) (logid:cd8e30be) ===START===
> > 192.168.5.140
> > > -- GET
> > >
> > >
> > command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1
&

RE: Disaster after maintenance

2019-03-19 Thread Paul Angus

Can you check that the cloudstack agent is running on the host and the agent 
logs (usual logs directory)
Also worth checking that libvirt has started ok.  Do you have some NUMA 
constraints or anything which requires particular RAM configuration?

paul.an...@shapeblue.com 
www.shapeblue.com
Amadeus House, Floral Street, London  WC2E 9DPUK
@shapeblue
  
 


-Original Message-
From: Jevgeni Zolotarjov  
Sent: 19 March 2019 14:49
To: users@cloudstack.apache.org
Subject: Re: Disaster after maintenance

Can you try migrating a VM to the server that you changed the RAM amount?

Also:
What is the hypervisor version?
KVM
QEMU Version : 2.0.0
Release : 1.el7.6


Host status in ACS?
1st server: Unsecure
2nd server: Disconnected

Did you try to force a VM to start/deploy in this server where you changed the 
RAM?
Host status became disconnected. I don't know how to make it "connected"
again



On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner < 
rafaelweingart...@gmail.com> wrote:

> Can you try migrating a VM to the server that you changed the RAM amount?
>
> Also:
> What is the hypervisor version?
> Host status in ACS?
> Did you try to force a VM to start/deploy in this server where you 
> changed the RAM?
>
>
> On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov 
>  >
> wrote:
>
> > We have Cloudstack 4.11.2 setup running fine for few months (>4) The 
> > setup is very simple: 2 hosts We decided to do a maintenance to 
> > increase RAM on both servers
> >
> > For this we put first server to maintenance. All VMS moved to second 
> > host after a while.
> >
> > Then first server was shutdown, RAM increased, server turned ON.
> > Now nothing starts on first server.
> >
> >
> > Tried to delete network, but this fails as well
> >
> > Please help !
> >
> > Here is extract from log:
> > ==
> > 2019-03-19 12:27:53,064 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > (secstorage-1:ctx-16d6c797) (logid:7e3160ce) Zone 1 is ready to 
> > launch secondary storage VM
> > 2019-03-19 12:27:53,125 DEBUG [c.c.c.ConsoleProxyManagerImpl]
> > (consoleproxy-1:ctx-cbd034b9) (logid:0a8c8bf4) Zone 1 is ready to 
> > launch console proxy
> > 2019-03-19 12:27:53,181 DEBUG [c.c.a.ApiServlet]
> > (qtp510113906-285:ctx-6c5e11c3) (logid:cd8e30be) ===START===
> 192.168.5.140
> > -- GET
> >
> >
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1
> =json&_=1552998473154
> > 2019-03-19 12:27:53,186 DEBUG [c.c.a.ApiServer]
> > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) CIDRs 
> > from which account 
> > 'Acct[15863393-8e8d-11e7-8f52-f04da2002bbe-admin]' is
> allowed
> > to perform API calls: 0.0.0.0/0,::/0
> > 2019-03-19 12:27:53,208 INFO  [o.a.c.f.j.i.AsyncJobMonitor]
> > (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:f6751fa7) Add 
> > job-5081 into job monitoring
> > 2019-03-19 12:27:53,209 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) submit
> async
> > job-5081, details: AsyncJobVO {id:5081, userId: 2, accountId: 2,
> > instanceType: Network, instanceId: 204, cmd:
> > org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
> >
> >
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId
> ":"2615","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"
> interface
> >
> >
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}"
> ,"ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmd
> EventType":"NETWORK.DELETE","_":"1552998473154"},
> > cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> > result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
> > null, lastPolled: null, created: null}
> > 2019-03-19 12:27:53,211 DEBUG [c.c.a.ApiServlet]
> > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) 
> > ===END===
> > 192.168.5.140 -- GET
> >
> >
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1
> =json&_=1552998473154
> > 2019-03-19 12:27:53,212 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) 
> > Executing AsyncJobVO {id:5081, userId: 2, accountId: 2, 
> > instanceType: Network,
> > instanceId: 204, cmd:
> > org.apache.cloudstack.api.command.user.

Re: Disaster after maintenance

2019-03-19 Thread Rafael Weingärtner

that is why nothing deploys there. You need to connect this host to ACS.
otherwise, it will just be ignored. Did you check the log files in the
agent (in the host)? And, of course, in ACS?

On Tue, Mar 19, 2019 at 9:49 AM Jevgeni Zolotarjov 
wrote:

> Can you try migrating a VM to the server that you changed the RAM amount?
>
> Also:
> What is the hypervisor version?
> KVM
> QEMU Version : 2.0.0
> Release : 1.el7.6
>
>
> Host status in ACS?
> 1st server: Unsecure
> 2nd server: Disconnected
>
> Did you try to force a VM to start/deploy in this server where you changed
> the RAM?
> Host status became disconnected. I don't know how to make it "connected"
> again
>
>
>
> On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner <
> rafaelweingart...@gmail.com> wrote:
>
> > Can you try migrating a VM to the server that you changed the RAM amount?
> >
> > Also:
> > What is the hypervisor version?
> > Host status in ACS?
> > Did you try to force a VM to start/deploy in this server where you
> changed
> > the RAM?
> >
> >
> > On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov <
> j.zolotar...@gmail.com
> > >
> > wrote:
> >
> > > We have Cloudstack 4.11.2 setup running fine for few months (>4)
> > > The setup is very simple: 2 hosts
> > > We decided to do a maintenance to increase RAM on both servers
> > >
> > > For this we put first server to maintenance. All VMS moved to second
> host
> > > after a while.
> > >
> > > Then first server was shutdown, RAM increased, server turned ON.
> > > Now nothing starts on first server.
> > >
> > >
> > > Tried to delete network, but this fails as well
> > >
> > > Please help !
> > >
> > > Here is extract from log:
> > > ==
> > > 2019-03-19 12:27:53,064 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > > (secstorage-1:ctx-16d6c797) (logid:7e3160ce) Zone 1 is ready to launch
> > > secondary storage VM
> > > 2019-03-19 12:27:53,125 DEBUG [c.c.c.ConsoleProxyManagerImpl]
> > > (consoleproxy-1:ctx-cbd034b9) (logid:0a8c8bf4) Zone 1 is ready to
> launch
> > > console proxy
> > > 2019-03-19 12:27:53,181 DEBUG [c.c.a.ApiServlet]
> > > (qtp510113906-285:ctx-6c5e11c3) (logid:cd8e30be) ===START===
> > 192.168.5.140
> > > -- GET
> > >
> > >
> >
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1=json&_=1552998473154
> > > 2019-03-19 12:27:53,186 DEBUG [c.c.a.ApiServer]
> > > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) CIDRs
> from
> > > which account 'Acct[15863393-8e8d-11e7-8f52-f04da2002bbe-admin]' is
> > allowed
> > > to perform API calls: 0.0.0.0/0,::/0
> > > 2019-03-19 12:27:53,208 INFO  [o.a.c.f.j.i.AsyncJobMonitor]
> > > (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:f6751fa7) Add
> job-5081
> > > into job monitoring
> > > 2019-03-19 12:27:53,209 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) submit
> > async
> > > job-5081, details: AsyncJobVO {id:5081, userId: 2, accountId: 2,
> > > instanceType: Network, instanceId: 204, cmd:
> > > org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd,
> cmdInfo:
> > >
> > >
> >
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2615","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
> > >
> > >
> >
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1552998473154"},
> > > cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> > > result: null, initMsid: 264216221068220, completeMsid: null,
> lastUpdated:
> > > null, lastPolled: null, created: null}
> > > 2019-03-19 12:27:53,211 DEBUG [c.c.a.ApiServlet]
> > > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) ===END===
> > > 192.168.5.140 -- GET
> > >
> > >
> >
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1=json&_=1552998473154
> > > 2019-03-19 12:27:53,212 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > > (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Executing
> > > AsyncJobVO {id:5081, userId: 2, accountId: 2, instanceType: Network,
> > > instanceId: 204, cmd:
> > > org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd,
> cmdInfo:
> > >
> > >
> >
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2615","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
> > >
> > >
> >
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1552998473154"},
> > > cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> > > result: null, initMsid: 264216221068220, completeMsid: null,
> lastUpdated:
> > > null, lastPolled: null, created: null}
> > > 2019-03-19 12:27:53,257 WARN  [o.a.c.e.o.NetworkOrchestrator]
> > > (API-Job-Executor-1:ctx-d4970c19 job-5081 ctx-d5de7979)
>

Re: Disaster after maintenance

2019-03-19 Thread Jevgeni Zolotarjov

Can you try migrating a VM to the server that you changed the RAM amount?

Also:
What is the hypervisor version?
KVM
QEMU Version : 2.0.0
Release : 1.el7.6


Host status in ACS?
1st server: Unsecure
2nd server: Disconnected

Did you try to force a VM to start/deploy in this server where you changed
the RAM?
Host status became disconnected. I don't know how to make it "connected"
again



On Tue, Mar 19, 2019 at 2:42 PM Rafael Weingärtner <
rafaelweingart...@gmail.com> wrote:

> Can you try migrating a VM to the server that you changed the RAM amount?
>
> Also:
> What is the hypervisor version?
> Host status in ACS?
> Did you try to force a VM to start/deploy in this server where you changed
> the RAM?
>
>
> On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov  >
> wrote:
>
> > We have Cloudstack 4.11.2 setup running fine for few months (>4)
> > The setup is very simple: 2 hosts
> > We decided to do a maintenance to increase RAM on both servers
> >
> > For this we put first server to maintenance. All VMS moved to second host
> > after a while.
> >
> > Then first server was shutdown, RAM increased, server turned ON.
> > Now nothing starts on first server.
> >
> >
> > Tried to delete network, but this fails as well
> >
> > Please help !
> >
> > Here is extract from log:
> > ==
> > 2019-03-19 12:27:53,064 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> > (secstorage-1:ctx-16d6c797) (logid:7e3160ce) Zone 1 is ready to launch
> > secondary storage VM
> > 2019-03-19 12:27:53,125 DEBUG [c.c.c.ConsoleProxyManagerImpl]
> > (consoleproxy-1:ctx-cbd034b9) (logid:0a8c8bf4) Zone 1 is ready to launch
> > console proxy
> > 2019-03-19 12:27:53,181 DEBUG [c.c.a.ApiServlet]
> > (qtp510113906-285:ctx-6c5e11c3) (logid:cd8e30be) ===START===
> 192.168.5.140
> > -- GET
> >
> >
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1=json&_=1552998473154
> > 2019-03-19 12:27:53,186 DEBUG [c.c.a.ApiServer]
> > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) CIDRs from
> > which account 'Acct[15863393-8e8d-11e7-8f52-f04da2002bbe-admin]' is
> allowed
> > to perform API calls: 0.0.0.0/0,::/0
> > 2019-03-19 12:27:53,208 INFO  [o.a.c.f.j.i.AsyncJobMonitor]
> > (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:f6751fa7) Add job-5081
> > into job monitoring
> > 2019-03-19 12:27:53,209 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) submit
> async
> > job-5081, details: AsyncJobVO {id:5081, userId: 2, accountId: 2,
> > instanceType: Network, instanceId: 204, cmd:
> > org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
> >
> >
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2615","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
> >
> >
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1552998473154"},
> > cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> > result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
> > null, lastPolled: null, created: null}
> > 2019-03-19 12:27:53,211 DEBUG [c.c.a.ApiServlet]
> > (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) ===END===
> > 192.168.5.140 -- GET
> >
> >
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1=json&_=1552998473154
> > 2019-03-19 12:27:53,212 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Executing
> > AsyncJobVO {id:5081, userId: 2, accountId: 2, instanceType: Network,
> > instanceId: 204, cmd:
> > org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
> >
> >
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2615","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
> >
> >
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1552998473154"},
> > cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> > result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
> > null, lastPolled: null, created: null}
> > 2019-03-19 12:27:53,257 WARN  [o.a.c.e.o.NetworkOrchestrator]
> > (API-Job-Executor-1:ctx-d4970c19 job-5081 ctx-d5de7979) (logid:16897ea6)
> > Can't delete the network, not all user vms are expunged. Vm
> > VM[User|i-2-11-VM] is in Stopped state
> > 2019-03-19 12:27:53,263 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> > (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Complete
> async
> > job-5081, jobStatus: FAILED, resultCode: 530, result:
> >
> >
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Failed
> > to delete network"}
> > 2019-03-19 12:27:53,264 DEBUG

Re: Disaster after maintenance

2019-03-19 Thread Rafael Weingärtner

Can you try migrating a VM to the server that you changed the RAM amount?

Also:
What is the hypervisor version?
Host status in ACS?
Did you try to force a VM to start/deploy in this server where you changed
the RAM?


On Tue, Mar 19, 2019 at 9:39 AM Jevgeni Zolotarjov 
wrote:

> We have Cloudstack 4.11.2 setup running fine for few months (>4)
> The setup is very simple: 2 hosts
> We decided to do a maintenance to increase RAM on both servers
>
> For this we put first server to maintenance. All VMS moved to second host
> after a while.
>
> Then first server was shutdown, RAM increased, server turned ON.
> Now nothing starts on first server.
>
>
> Tried to delete network, but this fails as well
>
> Please help !
>
> Here is extract from log:
> ==
> 2019-03-19 12:27:53,064 DEBUG [o.a.c.s.SecondaryStorageManagerImpl]
> (secstorage-1:ctx-16d6c797) (logid:7e3160ce) Zone 1 is ready to launch
> secondary storage VM
> 2019-03-19 12:27:53,125 DEBUG [c.c.c.ConsoleProxyManagerImpl]
> (consoleproxy-1:ctx-cbd034b9) (logid:0a8c8bf4) Zone 1 is ready to launch
> console proxy
> 2019-03-19 12:27:53,181 DEBUG [c.c.a.ApiServlet]
> (qtp510113906-285:ctx-6c5e11c3) (logid:cd8e30be) ===START===  192.168.5.140
> -- GET
>
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1=json&_=1552998473154
> 2019-03-19 12:27:53,186 DEBUG [c.c.a.ApiServer]
> (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) CIDRs from
> which account 'Acct[15863393-8e8d-11e7-8f52-f04da2002bbe-admin]' is allowed
> to perform API calls: 0.0.0.0/0,::/0
> 2019-03-19 12:27:53,208 INFO  [o.a.c.f.j.i.AsyncJobMonitor]
> (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:f6751fa7) Add job-5081
> into job monitoring
> 2019-03-19 12:27:53,209 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) submit async
> job-5081, details: AsyncJobVO {id:5081, userId: 2, accountId: 2,
> instanceType: Network, instanceId: 204, cmd:
> org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
>
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2615","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
>
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1552998473154"},
> cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
> null, lastPolled: null, created: null}
> 2019-03-19 12:27:53,211 DEBUG [c.c.a.ApiServlet]
> (qtp510113906-285:ctx-6c5e11c3 ctx-0cc34dc6) (logid:cd8e30be) ===END===
> 192.168.5.140 -- GET
>
> command=deleteNetwork=4ba834ed-48f3-468f-b667-9bb2d2c258f1=json&_=1552998473154
> 2019-03-19 12:27:53,212 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Executing
> AsyncJobVO {id:5081, userId: 2, accountId: 2, instanceType: Network,
> instanceId: 204, cmd:
> org.apache.cloudstack.api.command.user.network.DeleteNetworkCmd, cmdInfo:
>
> {"response":"json","ctxUserId":"2","httpmethod":"GET","ctxStartEventId":"2615","id":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","ctxDetails":"{\"interface
>
> com.cloud.network.Network\":\"4ba834ed-48f3-468f-b667-9bb2d2c258f1\"}","ctxAccountId":"2","uuid":"4ba834ed-48f3-468f-b667-9bb2d2c258f1","cmdEventType":"NETWORK.DELETE","_":"1552998473154"},
> cmdVersion: 0, status: IN_PROGRESS, processStatus: 0, resultCode: 0,
> result: null, initMsid: 264216221068220, completeMsid: null, lastUpdated:
> null, lastPolled: null, created: null}
> 2019-03-19 12:27:53,257 WARN  [o.a.c.e.o.NetworkOrchestrator]
> (API-Job-Executor-1:ctx-d4970c19 job-5081 ctx-d5de7979) (logid:16897ea6)
> Can't delete the network, not all user vms are expunged. Vm
> VM[User|i-2-11-VM] is in Stopped state
> 2019-03-19 12:27:53,263 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Complete async
> job-5081, jobStatus: FAILED, resultCode: 530, result:
>
> org.apache.cloudstack.api.response.ExceptionResponse/null/{"uuidList":[],"errorcode":530,"errortext":"Failed
> to delete network"}
> 2019-03-19 12:27:53,264 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Publish async
> job-5081 complete on message bus
> 2019-03-19 12:27:53,264 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Wake up jobs
> related to job-5081
> 2019-03-19 12:27:53,264 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Update db
> status for job-5081
> 2019-03-19 12:27:53,265 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]
> (API-Job-Executor-1:ctx-d4970c19 job-5081) (logid:16897ea6) Wake up jobs
> joined with job-5081 and disjoin all subjobs created from job- 5081
> 2019-03-19 12:27:53,267

44 matches

Mail list logo