Re: Snapshots are not working after upgrading to 4.15.0
Hello Andrei, As yourself noticed, ACS has a hardcoded threshold for secondary storages. In cases that the secondary storage has large capacities, 10% can mean a lot of storage. There is an open PR (https://github.com/apache/cloudstack/pull/4790) that externalize this threshold to allow operators decide how much they need. Also, the logs of secondary storage management were improved with PR https://github.com/apache/cloudstack/pull/4955. With respect to KVM snapshots, volume snapshots are taken in a quite peculiar way. Instead of taking volume snapshots directly, ACS takes a full snapshot of the VM, which may cause freeze on it (VM) due to memory snapshot, and then extracts the disk from the VM snapshot. Due to it, it was open an issue https://github.com/apache/cloudstack/issues/5124 to discuss a new workflow to snapshots on KVM. I am already implementing the solution for the issue 5124, and to improve this whole snapshot process for KVM; However, it is a complex and long standing job. As soon as we have something, I would appreciate to receive some feedback from you. Regards, Guto On 2021/06/16 16:15:51, Andrei Mikhailovsky wrote: > Hello,> > > I've done some more investigation and indeed, the snapshots were not taken because the secondary storage was over 90% used. I have started cleaning some of the older volumes and noticed another problem. After removing snapshots, they do not seem to be removed from the secondary storage. I've removed all snapshots over 24 hours ago and it looks like the disk space hasn't been freed up at all.> > > Looks like there are issues with snapshotting function after all.> > > Andrei> > > > > - Original Message -> > > From: "Harikrishna Patnala" > > > To: "users" > > > Sent: Tuesday, 8 June, 2021 03:33:57> > > Subject: Re: Snapshots are not working after upgrading to 4.15.0> > > > Hi Andrei,> > > > > > Can you check the following things and let us know?> > > > > > > > > 1. Can you try creating a new volume and then create snapshot of that, to check> > > if this an issue with old entries> > > 2. For the snapshots which are failing can you check if you are seeing any> > > error messages like this "Can't find an image storage in zone with less than".> > > This is to check if secondary storage free space check failed.> > > 3. For the snapshots which are failing and if it is delta snapshot can you> > > check if its parent's snapshot entry exists in "snapshot_store_ref" table with> > > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This is> > > to find the secondary storage where the parent snapshot backup is located.> > > > > > Regards,> > > Harikrishna> > > > > > From: Andrei Mikhailovsky > > > Sent: Monday, June 7, 2021 7:00 PM> > > To: users > > > Subject: Snapshots are not working after upgrading to 4.15.0> > > > > > Hello everyone,> > > > > > I am having an issue with volume snapshots since I've upgraded to 4.15.0. None> > > of the volumes are being snapshotted regardless if the snapshot is initiated> > > manually or from the schedule. The strange thing is that if I manually take the> > > snapshot, the GUI shows Success status, but the Storage>Snapshots show an Error> > > status. Here is what I see in the management server logs:> > > > > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl]> > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done> > > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143> > > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor]> > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Remove> > > job-86143 from job monitoring> > > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl]> > > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot> > > com.cloud.utils.exception.CloudRuntimeException: can not find an image stores> > > at> > > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271)> > > at> > > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171)> > > at> > > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238)> > > at> > > org.apache.cloudstack.managed.context.ManagedCon
Re: Snapshots are not working after upgrading to 4.15.0
Hi Andrei, Can you test 4.15.1.0 RC2 which is up for voting/testing and if you're able to reproduce the issue, please file a bugreport if it's not same as https://github.com/apache/cloudstack/issues/4797 #4747 is Ceph specific which unfortunately I don't have an environment to test again but PRs are welcome by any Ceph user/developer. Thanks. Regards. From: Slavka Peleva Sent: Thursday, June 17, 2021 20:50 To: users@cloudstack.apache.org Subject: Re: Snapshots are not working after upgrading to 4.15.0 Hi all, I've compared the delete of snapshots between 4.13 and 4.15.1. The main difference is that when picking the snapshot strategy in 4.13, the deletion is handled by XenserverSnapshotStrategy (renamed DefaultSnapshotStrategy in the newer versions), and for 4.15.1 is handled by StorageSystemSnapshotStrategy. The difference is that the first one deletes the snapshot chain in secondary storage, the second deletes the snapshot only on the primary (Ceph) storage. Gabriel, if you are aware of the problem, can you correct me if I'm wrong? Best regards, Slavka On Thu, Jun 17, 2021 at 4:23 PM Gabriel Bräscher wrote: > Hi Andrei, > > I appreciate all the efforts and the help in narrowing down this issue. It > looks similar and probably it is related to bug #4797 indeed. > This bug is for some time to be fixed and I perfectly understand why you > are not happy. > > I am speaking for myself here and I am not the Release Manager (RM) of > 4.15.1.0 but In my point of view, this does not necessarily impact on > blocking 4.15.1.0. > > Fixing it has been proving a bit trickier and also requires manual tests > with different environment configurations and some time to debug and > develop. > I myself had no time to fix it for 4.15.1.0 thus decided to not hold > 4.15.1.0 as it would mean that many users would not have several bug fixes > due to this one. > > To give some context. I work for a hosting company that has been > contributing to bug fixes and new features for a long time. > We even fixed bugs that do not impact us directly (e.g. issues that affect > storage systems we do not use, or a hypervisor we do not use, etc). > This means that I, as a contributor, sometimes have less time for some > tasks than other ones. > > With that said, I will be re-checking this issue soon(ish) but I cannot > guarantee that I will be able to bring a fix in time for 4.15.1.0. > If any contributor has time to fix it I would be happy to help with review > and testing. > > Best regards, > Gabriel. > > Em qui., 17 de jun. de 2021 às 07:31, Andrei Mikhailovsky > escreveu: > > > Hi Suresh, > > > > This is what I've answered on the db tables: > > > > The table snapshots has NULL under the removed column in all > snapshots > > that I've > > removed. The table snapshot_store_ref has no such column, but the > > state shown > > as Destroyed. > > > > > > I've done some more checking under the ssvm itself, which look ok: > > > > > > root@s-2536-VM:/usr/local/cloud/systemvm# > > /usr/local/cloud/systemvm/ssvm-check.sh > > > > First DNS server is 192.168.169.254 > > PING 192.168.169.254 (192.168.169.254): 56 data bytes > > 64 bytes from 192.168.169.254: icmp_seq=0 ttl=64 time=0.520 ms > > 64 bytes from 192.168.169.254: icmp_seq=1 ttl=64 time=0.294 ms > > --- 192.168.169.254 ping statistics --- > > 2 packets transmitted, 2 packets received, 0% packet loss > > round-trip min/avg/max/stddev = 0.294/0.407/0.520/0.113 ms > > Good: Can ping DNS server > > > > Good: DNS resolves cloudstack.apache.org > > > > nfs is currently mounted > > Mount point is /mnt/SecStorage/ceb27169-9a58-32ef-81b4-33b0b12e9aa2 > > Good: Can write to mount point > > > > Management server is 192.168.169.13. Checking connectivity. > > Good: Can connect to management server 192.168.169.13 port 8250 > > > > Good: Java process is running > > > > Tests Complete. Look for ERROR or WARNING above. > > > > > > The management server does show errors like these, without any further > > details: > > > > 2021-06-17 10:31:06,197 DEBUG [c.c.s.StorageManagerImpl] > > (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to > delete > > snapshot: 55183 from storage > > 2021-06-17 10:31:06,280 DEBUG [o.a.c.s.s.SnapshotObject] > > (StorageMa
Re: Snapshots are not working after upgrading to 4.15.0
led to > update > > state:com.cloud.utils.fsm.NoTransitionException: Unable to transition to > a > > new state from Destroyed via DestroyRequested > > > > > > Regarding the bug 4797. I can't really comment as it has very little > > technical details without the management log errors, etc. But > essentially, > > at the high level, the snapshots are not deleted from the backend in my > > case, just like in the bug 4797. > > > > > > TBH, I am very much surprised that a bug in such an important function of > > ACS has slipped through the testing methods for the 4.15.0 release and > > despite being discovered over 3 months ago, it hasn't been scheduled for > > the fix in 4.15.1 bug fix release. Does that sound right to you? I think > > this issue should be revisited and corrected as it will cause a fill up > of > > the secondary storage and ultimately cause all sorts of issues with > > creation of snapshots. > > > > Andrei > > > > > > - Original Message - > > > From: "Suresh Anaparti" > > > To: "users" > > > Sent: Thursday, 17 June, 2021 11:16:59 > > > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > > > > > Hi Andrei, > > > > > > Have you checked the 'status' and 'removed' timestamp in snapshots > > table, and > > > 'state' in snapshot_store_ref table for these snapshots. > > > > > > Similar issue logged (by Ed, as mentioned in his email) here: > > > https://github.com/apache/cloudstack/issues/4797. Is it the same > issue? > > > > > > Regards, > > > Suresh > > > > > >On 17/06/21, 2:18 PM, "Andrei Mikhailovsky" > > > wrote: > > > > > >Hi Suresh, Please see below the answers to your questions. > > > > > > > > > > > > > > > - Original Message - > > >> From: "Suresh Anaparti" > > >> To: "users" > > >> Sent: Thursday, 17 June, 2021 06:36:27 > > >> Subject: Re: Snapshots are not working after upgrading to 4.15.0 > > > > > >> Hi Andrei, > > >> > > >> Can you check if the storage garbage collector is enabled or not > in > > your env > > >> (specified using the global setting 'storage.cleanup.enabled'). If > > it is > > >> enabled, check the interval & delay setting: > > 'storage.cleanup.interval' and > > >> 'storage.cleanup.delay', and see the logs to confirm cleanup is > > performed or > > >> not. > > > > > >storage.cleanup.enabled is true > > >storage.cleanup.interval is 3600 > > >storage.cleanup.delay is 360086400 > > > > > >> > > >> Also, check the snapshot status / state in snapshots & > > snapshot_store_ref tables > > >> for the snapshots that are not deleted during the cleanup. Is > > 'removed' > > >> timestamp set for them in snapshots table? > > >> > > > > > > > > >The table snapshots has NULL under the removed column in all > > snapshots that I've > > >removed. The table snapshot_store_ref has no such column, but the > > state shown > > >as Destroyed. > > > > > > > > > > > > > > >> Regards, > > >> Suresh > > >> > > >>On 16/06/21, 9:46 PM, "Andrei Mikhailovsky" > > wrote: > > >> > > >>Hello, > > >> > > >>I've done some more investigation and indeed, the snapshots > were > > not taken > > >>because the secondary storage was over 90% used. I have started > > cleaning some > > >>of the older volumes and noticed another problem. After > removing > > snapshots, > > >>they do not seem to be removed from the secondary storage. I've > > removed all > > >>snapshots over 24 hours ago and it looks like the disk space > > hasn't been freed > > >>up at all. > > >> > > >>Looks like there are issues with snapshotting function after > all. > > >> > > >>Andrei > > >> > > >> > > >> > > > > > > > > > > >> > > >> - Original Messag
Re: Snapshots are not working after upgrading to 4.15.0
Hi Andrei, I appreciate all the efforts and the help in narrowing down this issue. It looks similar and probably it is related to bug #4797 indeed. This bug is for some time to be fixed and I perfectly understand why you are not happy. I am speaking for myself here and I am not the Release Manager (RM) of 4.15.1.0 but In my point of view, this does not necessarily impact on blocking 4.15.1.0. Fixing it has been proving a bit trickier and also requires manual tests with different environment configurations and some time to debug and develop. I myself had no time to fix it for 4.15.1.0 thus decided to not hold 4.15.1.0 as it would mean that many users would not have several bug fixes due to this one. To give some context. I work for a hosting company that has been contributing to bug fixes and new features for a long time. We even fixed bugs that do not impact us directly (e.g. issues that affect storage systems we do not use, or a hypervisor we do not use, etc). This means that I, as a contributor, sometimes have less time for some tasks than other ones. With that said, I will be re-checking this issue soon(ish) but I cannot guarantee that I will be able to bring a fix in time for 4.15.1.0. If any contributor has time to fix it I would be happy to help with review and testing. Best regards, Gabriel. Em qui., 17 de jun. de 2021 às 07:31, Andrei Mikhailovsky escreveu: > Hi Suresh, > > This is what I've answered on the db tables: > > The table snapshots has NULL under the removed column in all snapshots > that I've > removed. The table snapshot_store_ref has no such column, but the > state shown > as Destroyed. > > > I've done some more checking under the ssvm itself, which look ok: > > > root@s-2536-VM:/usr/local/cloud/systemvm# > /usr/local/cloud/systemvm/ssvm-check.sh > > First DNS server is 192.168.169.254 > PING 192.168.169.254 (192.168.169.254): 56 data bytes > 64 bytes from 192.168.169.254: icmp_seq=0 ttl=64 time=0.520 ms > 64 bytes from 192.168.169.254: icmp_seq=1 ttl=64 time=0.294 ms > --- 192.168.169.254 ping statistics --- > 2 packets transmitted, 2 packets received, 0% packet loss > round-trip min/avg/max/stddev = 0.294/0.407/0.520/0.113 ms > Good: Can ping DNS server > > Good: DNS resolves cloudstack.apache.org > > nfs is currently mounted > Mount point is /mnt/SecStorage/ceb27169-9a58-32ef-81b4-33b0b12e9aa2 > Good: Can write to mount point > > Management server is 192.168.169.13. Checking connectivity. > Good: Can connect to management server 192.168.169.13 port 8250 > > Good: Java process is running > > Tests Complete. Look for ERROR or WARNING above. > > > The management server does show errors like these, without any further > details: > > 2021-06-17 10:31:06,197 DEBUG [c.c.s.StorageManagerImpl] > (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to delete > snapshot: 55183 from storage > 2021-06-17 10:31:06,280 DEBUG [o.a.c.s.s.SnapshotObject] > (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to update > state:com.cloud.utils.fsm.NoTransitionException: Unable to transition to a > new state from Destroyed via DestroyRequested > 2021-06-17 10:31:06,281 DEBUG [c.c.s.StorageManagerImpl] > (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to delete > snapshot: 84059 from storage > 2021-06-17 10:31:06,363 DEBUG [o.a.c.s.s.SnapshotObject] > (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to update > state:com.cloud.utils.fsm.NoTransitionException: Unable to transition to a > new state from Destroyed via DestroyRequested > > > Regarding the bug 4797. I can't really comment as it has very little > technical details without the management log errors, etc. But essentially, > at the high level, the snapshots are not deleted from the backend in my > case, just like in the bug 4797. > > > TBH, I am very much surprised that a bug in such an important function of > ACS has slipped through the testing methods for the 4.15.0 release and > despite being discovered over 3 months ago, it hasn't been scheduled for > the fix in 4.15.1 bug fix release. Does that sound right to you? I think > this issue should be revisited and corrected as it will cause a fill up of > the secondary storage and ultimately cause all sorts of issues with > creation of snapshots. > > Andrei > > > - Original Message - > > From: "Suresh Anaparti" > > To: "users" > > Sent: Thursday, 17 J
Re: Snapshots are not working after upgrading to 4.15.0
Hi Suresh, This is what I've answered on the db tables: The table snapshots has NULL under the removed column in all snapshots that I've removed. The table snapshot_store_ref has no such column, but the state shown as Destroyed. I've done some more checking under the ssvm itself, which look ok: root@s-2536-VM:/usr/local/cloud/systemvm# /usr/local/cloud/systemvm/ssvm-check.sh First DNS server is 192.168.169.254 PING 192.168.169.254 (192.168.169.254): 56 data bytes 64 bytes from 192.168.169.254: icmp_seq=0 ttl=64 time=0.520 ms 64 bytes from 192.168.169.254: icmp_seq=1 ttl=64 time=0.294 ms --- 192.168.169.254 ping statistics --- 2 packets transmitted, 2 packets received, 0% packet loss round-trip min/avg/max/stddev = 0.294/0.407/0.520/0.113 ms Good: Can ping DNS server Good: DNS resolves cloudstack.apache.org nfs is currently mounted Mount point is /mnt/SecStorage/ceb27169-9a58-32ef-81b4-33b0b12e9aa2 Good: Can write to mount point Management server is 192.168.169.13. Checking connectivity. Good: Can connect to management server 192.168.169.13 port 8250 Good: Java process is running Tests Complete. Look for ERROR or WARNING above. The management server does show errors like these, without any further details: 2021-06-17 10:31:06,197 DEBUG [c.c.s.StorageManagerImpl] (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to delete snapshot: 55183 from storage 2021-06-17 10:31:06,280 DEBUG [o.a.c.s.s.SnapshotObject] (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to update state:com.cloud.utils.fsm.NoTransitionException: Unable to transition to a new state from Destroyed via DestroyRequested 2021-06-17 10:31:06,281 DEBUG [c.c.s.StorageManagerImpl] (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to delete snapshot: 84059 from storage 2021-06-17 10:31:06,363 DEBUG [o.a.c.s.s.SnapshotObject] (StorageManager-Scavenger-1:ctx-b9b038de) (logid:d96d09c4) Failed to update state:com.cloud.utils.fsm.NoTransitionException: Unable to transition to a new state from Destroyed via DestroyRequested Regarding the bug 4797. I can't really comment as it has very little technical details without the management log errors, etc. But essentially, at the high level, the snapshots are not deleted from the backend in my case, just like in the bug 4797. TBH, I am very much surprised that a bug in such an important function of ACS has slipped through the testing methods for the 4.15.0 release and despite being discovered over 3 months ago, it hasn't been scheduled for the fix in 4.15.1 bug fix release. Does that sound right to you? I think this issue should be revisited and corrected as it will cause a fill up of the secondary storage and ultimately cause all sorts of issues with creation of snapshots. Andrei - Original Message - > From: "Suresh Anaparti" > To: "users" > Sent: Thursday, 17 June, 2021 11:16:59 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi Andrei, > > Have you checked the 'status' and 'removed' timestamp in snapshots table, and > 'state' in snapshot_store_ref table for these snapshots. > > Similar issue logged (by Ed, as mentioned in his email) here: > https://github.com/apache/cloudstack/issues/4797. Is it the same issue? > > Regards, > Suresh > >On 17/06/21, 2:18 PM, "Andrei Mikhailovsky" wrote: > >Hi Suresh, Please see below the answers to your questions. > > > > > - Original Message - >> From: "Suresh Anaparti" >> To: "users" >> Sent: Thursday, 17 June, 2021 06:36:27 >> Subject: Re: Snapshots are not working after upgrading to 4.15.0 > >> Hi Andrei, >> >> Can you check if the storage garbage collector is enabled or not in your > env >> (specified using the global setting 'storage.cleanup.enabled'). If it is >> enabled, check the interval & delay setting: 'storage.cleanup.interval' > and >> 'storage.cleanup.delay', and see the logs to confirm cleanup is > performed or >> not. > >storage.cleanup.enabled is true >storage.cleanup.interval is 3600 >storage.cleanup.delay is 360086400 > >> >> Also, check the snapshot status / state in snapshots & > snapshot_store_ref tables >> for the snapshots that are not deleted during the cleanup. Is 'removed' >> timestamp set for them in snapshots table? >> > > >The table snapshots has N
Re: Snapshots are not working after upgrading to 4.15.0
Hi Andrei, Have you checked the 'status' and 'removed' timestamp in snapshots table, and 'state' in snapshot_store_ref table for these snapshots. Similar issue logged (by Ed, as mentioned in his email) here: https://github.com/apache/cloudstack/issues/4797. Is it the same issue? Regards, Suresh On 17/06/21, 2:18 PM, "Andrei Mikhailovsky" wrote: Hi Suresh, Please see below the answers to your questions. - Original Message - > From: "Suresh Anaparti" > To: "users" > Sent: Thursday, 17 June, 2021 06:36:27 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi Andrei, > > Can you check if the storage garbage collector is enabled or not in your env > (specified using the global setting 'storage.cleanup.enabled'). If it is > enabled, check the interval & delay setting: 'storage.cleanup.interval' and > 'storage.cleanup.delay', and see the logs to confirm cleanup is performed or > not. storage.cleanup.enabled is true storage.cleanup.interval is 3600 storage.cleanup.delay is 360086400 > > Also, check the snapshot status / state in snapshots & snapshot_store_ref tables > for the snapshots that are not deleted during the cleanup. Is 'removed' > timestamp set for them in snapshots table? > The table snapshots has NULL under the removed column in all snapshots that I've removed. The table snapshot_store_ref has no such column, but the state shown as Destroyed. > Regards, > Suresh > >On 16/06/21, 9:46 PM, "Andrei Mikhailovsky" wrote: > >Hello, > >I've done some more investigation and indeed, the snapshots were not taken >because the secondary storage was over 90% used. I have started cleaning some >of the older volumes and noticed another problem. After removing snapshots, >they do not seem to be removed from the secondary storage. I've removed all >snapshots over 24 hours ago and it looks like the disk space hasn't been freed >up at all. > >Looks like there are issues with snapshotting function after all. > >Andrei > > > > > > > - Original Message - > > From: "Harikrishna Patnala" >> To: "users" >> Sent: Tuesday, 8 June, 2021 03:33:57 >> Subject: Re: Snapshots are not working after upgrading to 4.15.0 > >> Hi Andrei, >> >> Can you check the following things and let us know? >> >> >> 1. Can you try creating a new volume and then create snapshot of that, to check >> if this an issue with old entries >> 2. For the snapshots which are failing can you check if you are seeing any >> error messages like this "Can't find an image storage in zone with less than". >> This is to check if secondary storage free space check failed. >> 3. For the snapshots which are failing and if it is delta snapshot can you >> check if its parent's snapshot entry exists in "snapshot_store_ref" table with >> 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This is >> to find the secondary storage where the parent snapshot backup is located. >> >> Regards, >> Harikrishna >> >> From: Andrei Mikhailovsky >> Sent: Monday, June 7, 2021 7:00 PM >> To: users >> Subject: Snapshots are not working after upgrading to 4.15.0 >> >> Hello everyone, >> >> I am having an issue with volume snapshots since I've upgraded to 4.15.0. None >> of the volumes are being snapshotted regardless if the snapshot is initiated >> manually or from the schedule. The strange thing is that if I manually take the >> snapshot, the GUI shows Success status, but the Storage>Snapshots show an Error >> status. Here is what I see in the management server logs: >> >> 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] >> (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done >> executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 >> 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] >> (Work-Job-Executor-81:ctx-08dd4222 job
Re: Snapshots are not working after upgrading to 4.15.0
Hi Suresh, Please see below the answers to your questions. - Original Message - > From: "Suresh Anaparti" > To: "users" > Sent: Thursday, 17 June, 2021 06:36:27 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi Andrei, > > Can you check if the storage garbage collector is enabled or not in your env > (specified using the global setting 'storage.cleanup.enabled'). If it is > enabled, check the interval & delay setting: 'storage.cleanup.interval' and > 'storage.cleanup.delay', and see the logs to confirm cleanup is performed or > not. storage.cleanup.enabled is true storage.cleanup.interval is 3600 storage.cleanup.delay is 360086400 > > Also, check the snapshot status / state in snapshots & snapshot_store_ref > tables > for the snapshots that are not deleted during the cleanup. Is 'removed' > timestamp set for them in snapshots table? > The table snapshots has NULL under the removed column in all snapshots that I've removed. The table snapshot_store_ref has no such column, but the state shown as Destroyed. > Regards, > Suresh > >On 16/06/21, 9:46 PM, "Andrei Mikhailovsky" wrote: > >Hello, > >I've done some more investigation and indeed, the snapshots were not taken >because the secondary storage was over 90% used. I have started cleaning > some >of the older volumes and noticed another problem. After removing snapshots, >they do not seem to be removed from the secondary storage. I've removed all >snapshots over 24 hours ago and it looks like the disk space hasn't been > freed >up at all. > >Looks like there are issues with snapshotting function after all. > >Andrei > > > > > > > - Original Message - >> From: "Harikrishna Patnala" >> To: "users" >> Sent: Tuesday, 8 June, 2021 03:33:57 >> Subject: Re: Snapshots are not working after upgrading to 4.15.0 > >> Hi Andrei, >> >> Can you check the following things and let us know? >> >> >> 1. Can you try creating a new volume and then create snapshot of that, > to check >> if this an issue with old entries >> 2. For the snapshots which are failing can you check if you are seeing > any >> error messages like this "Can't find an image storage in zone with less > than". >> This is to check if secondary storage free space check failed. >> 3. For the snapshots which are failing and if it is delta snapshot can > you >> check if its parent's snapshot entry exists in "snapshot_store_ref" > table with >> 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". > This is >> to find the secondary storage where the parent snapshot backup is > located. >> >> Regards, >> Harikrishna >> >> From: Andrei Mikhailovsky >> Sent: Monday, June 7, 2021 7:00 PM >> To: users >> Subject: Snapshots are not working after upgrading to 4.15.0 >> >> Hello everyone, >> >> I am having an issue with volume snapshots since I've upgraded to > 4.15.0. None >> of the volumes are being snapshotted regardless if the snapshot is > initiated >> manually or from the schedule. The strange thing is that if I manually > take the >> snapshot, the GUI shows Success status, but the Storage>Snapshots show > an Error >> status. Here is what I see in the management server logs: >> >> 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] >> (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Done >> executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 >> 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] >> (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Remove >> job-86143 from job monitoring >> 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] >> (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy > snapshot >> com.cloud.utils.exception.CloudRuntimeException: can not find an image > stores >> at >> > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) >> at >> > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) >> at >> > com.clo
Re: Snapshots are not working after upgrading to 4.15.0
Hi Guys, I have already logged this as a big under reference: 4797 Ed On Thu, 17 Jun 2021 at 06:37, Suresh Anaparti wrote: > Hi Andrei, > > Can you check if the storage garbage collector is enabled or not in your > env (specified using the global setting 'storage.cleanup.enabled'). If it > is enabled, check the interval & delay setting: 'storage.cleanup.interval' > and 'storage.cleanup.delay', and see the logs to confirm cleanup is > performed or not. > > Also, check the snapshot status / state in snapshots & snapshot_store_ref > tables for the snapshots that are not deleted during the cleanup. Is > 'removed' timestamp set for them in snapshots table? > > Regards, > Suresh > > On 16/06/21, 9:46 PM, "Andrei Mikhailovsky" > wrote: > > Hello, > > I've done some more investigation and indeed, the snapshots were not > taken because the secondary storage was over 90% used. I have started > cleaning some of the older volumes and noticed another problem. After > removing snapshots, they do not seem to be removed from the secondary > storage. I've removed all snapshots over 24 hours ago and it looks like > the disk space hasn't been freed up at all. > > Looks like there are issues with snapshotting function after all. > > Andrei > > > > > > > - Original Message - > > From: "Harikrishna Patnala" > > To: "users" > > Sent: Tuesday, 8 June, 2021 03:33:57 > > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > > > Hi Andrei, > > > > Can you check the following things and let us know? > > > > > > 1. Can you try creating a new volume and then create snapshot of > that, to check > > if this an issue with old entries > > 2. For the snapshots which are failing can you check if you are > seeing any > > error messages like this "Can't find an image storage in zone with > less than". > > This is to check if secondary storage free space check failed. > > 3. For the snapshots which are failing and if it is delta snapshot > can you > > check if its parent's snapshot entry exists in "snapshot_store_ref" > table with > > 'parent_snapshot_id' of the current snapshot with 'store_role' > "Image". This is > > to find the secondary storage where the parent snapshot backup is > located. > > > > Regards, > > Harikrishna > > > > From: Andrei Mikhailovsky > > Sent: Monday, June 7, 2021 7:00 PM > > To: users > > Subject: Snapshots are not working after upgrading to 4.15.0 > > > > Hello everyone, > > > > I am having an issue with volume snapshots since I've upgraded to > 4.15.0. None > > of the volumes are being snapshotted regardless if the snapshot is > initiated > > manually or from the schedule. The strange thing is that if I > manually take the > > snapshot, the GUI shows Success status, but the Storage>Snapshots > show an Error > > status. Here is what I see in the management server logs: > > > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) > (logid:be34ce01) Done > > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 > > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) > (logid:be34ce01) Remove > > job-86143 from job monitoring > > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] > > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy > snapshot > > com.cloud.utils.exception.CloudRuntimeException: can not find an > image stores > > at > > > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) > > at > > > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) > > at > > > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) > > at > > > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) > > at > > > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) > > at > > > org.apache.cloudstack.managed.context.i
Re: Snapshots are not working after upgrading to 4.15.0
Hi Andrei, Can you check if the storage garbage collector is enabled or not in your env (specified using the global setting 'storage.cleanup.enabled'). If it is enabled, check the interval & delay setting: 'storage.cleanup.interval' and 'storage.cleanup.delay', and see the logs to confirm cleanup is performed or not. Also, check the snapshot status / state in snapshots & snapshot_store_ref tables for the snapshots that are not deleted during the cleanup. Is 'removed' timestamp set for them in snapshots table? Regards, Suresh On 16/06/21, 9:46 PM, "Andrei Mikhailovsky" wrote: Hello, I've done some more investigation and indeed, the snapshots were not taken because the secondary storage was over 90% used. I have started cleaning some of the older volumes and noticed another problem. After removing snapshots, they do not seem to be removed from the secondary storage. I've removed all snapshots over 24 hours ago and it looks like the disk space hasn't been freed up at all. Looks like there are issues with snapshotting function after all. Andrei - Original Message - > From: "Harikrishna Patnala" > To: "users" > Sent: Tuesday, 8 June, 2021 03:33:57 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi Andrei, > > Can you check the following things and let us know? > > > 1. Can you try creating a new volume and then create snapshot of that, to check > if this an issue with old entries > 2. For the snapshots which are failing can you check if you are seeing any > error messages like this "Can't find an image storage in zone with less than". > This is to check if secondary storage free space check failed. > 3. For the snapshots which are failing and if it is delta snapshot can you > check if its parent's snapshot entry exists in "snapshot_store_ref" table with > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This is > to find the secondary storage where the parent snapshot backup is located. > > Regards, > Harikrishna > ________ > From: Andrei Mikhailovsky > Sent: Monday, June 7, 2021 7:00 PM > To: users > Subject: Snapshots are not working after upgrading to 4.15.0 > > Hello everyone, > > I am having an issue with volume snapshots since I've upgraded to 4.15.0. None > of the volumes are being snapshotted regardless if the snapshot is initiated > manually or from the schedule. The strange thing is that if I manually take the > snapshot, the GUI shows Success status, but the Storage>Snapshots show an Error > status. Here is what I see in the management server logs: > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Remove > job-86143 from job monitoring > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot > com.cloud.utils.exception.CloudRuntimeException: can not find an image stores > at > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) > at > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) > at > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) > at > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) > at > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExec
Re: Snapshots are not working after upgrading to 4.15.0
Hello, I've done some more investigation and indeed, the snapshots were not taken because the secondary storage was over 90% used. I have started cleaning some of the older volumes and noticed another problem. After removing snapshots, they do not seem to be removed from the secondary storage. I've removed all snapshots over 24 hours ago and it looks like the disk space hasn't been freed up at all. Looks like there are issues with snapshotting function after all. Andrei - Original Message - > From: "Harikrishna Patnala" > To: "users" > Sent: Tuesday, 8 June, 2021 03:33:57 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi Andrei, > > Can you check the following things and let us know? > > > 1. Can you try creating a new volume and then create snapshot of that, to > check > if this an issue with old entries > 2. For the snapshots which are failing can you check if you are seeing any > error messages like this "Can't find an image storage in zone with less > than". > This is to check if secondary storage free space check failed. > 3. For the snapshots which are failing and if it is delta snapshot can you > check if its parent's snapshot entry exists in "snapshot_store_ref" table > with > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This > is > to find the secondary storage where the parent snapshot backup is located. > > Regards, > Harikrishna > ________ > From: Andrei Mikhailovsky > Sent: Monday, June 7, 2021 7:00 PM > To: users > Subject: Snapshots are not working after upgrading to 4.15.0 > > Hello everyone, > > I am having an issue with volume snapshots since I've upgraded to 4.15.0. None > of the volumes are being snapshotted regardless if the snapshot is initiated > manually or from the schedule. The strange thing is that if I manually take > the > snapshot, the GUI shows Success status, but the Storage>Snapshots show an > Error > status. Here is what I see in the management server logs: > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Remove > job-86143 from job monitoring > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot > com.cloud.utils.exception.CloudRuntimeException: can not find an image stores > at > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) > at > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) > at > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) > at > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) > at > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) > at > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) > at > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) > at java.base/java.lang.Thread.run(Thread.java:829) > 2021-06-07 13:55:20,152 DEBUG [c.c.s.s.SnapshotManagerImpl] > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Backing up of snapshot > failed, for snapshot with ID 53531, left with 2 more attempts > > > I've checked and the Secondary storage is configured and visible in the GUI. I > can also mount it manually from the management server and a couple of host > servers that I've tested. In addition, I can successfully upload an ISO image > and that registers just fine and I can create new VMs using the newly uploaded > ISO image. > > I've had no such problems with 4.13.x ACS, so the issue seems to have been > introduced after doing the upgrade to 4.15.0. > > Could you please let me know how do I fix the issue? > > Cheers > > andrei
Re: Snapshots are not working after upgrading to 4.15.0
Thanks Slavka, I will attempt to clean up some old snaps and see if the backups start working again. Andrei - Original Message - > From: "Slavka Peleva" > To: "users" > Sent: Monday, 14 June, 2021 13:28:01 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi, Andrei, > > After a quick check of the code, I guess that the backup of the snapshot > fails because the secondary storage space should be less than 90%, and you > pointed that yours is 90.6% full. > > Regards, > Slavka > > On Mon, Jun 14, 2021 at 1:10 PM Andrei Mikhailovsky > wrote: > >> Hi Harikrishna, >> >> I've done some more testing testing just now. Please see below the answers >> to your questions/comments: >> >> >> >> - Original Message ----- >> > From: "Harikrishna Patnala" >> > To: "users" >> > Sent: Tuesday, 8 June, 2021 03:33:57 >> > Subject: Re: Snapshots are not working after upgrading to 4.15.0 >> >> > Hi Andrei, >> > >> > Can you check the following things and let us know? >> > >> > >> > 1. Can you try creating a new volume and then create snapshot of that, >> to check >> > if this an issue with old entries >> >> AM: I've done testing with a newly created image which was attached to a >> vm. The same problem happens with new images as well as old images. Please >> see the link below for the management server log. >> >> >> https://zerobin.net/?5781e4b65d9e3605#+GtIC7JBtp70Q0cw65cypJDiSyba/r/JldRsAyOI8l4= >> >> >> > 2. For the snapshots which are failing can you check if you are seeing >> any >> > error messages like this "Can't find an image storage in zone with less >> than". >> > This is to check if secondary storage free space check failed. >> >> >> AM: I do not see any such message in the logs. tried grepping the logs for >> a couple of weeks and nothing comes up. Having said this, the secondary >> storage is about 90.6% full. >> >> >> > 3. For the snapshots which are failing and if it is delta snapshot can >> you >> > check if its parent's snapshot entry exists in "snapshot_store_ref" >> table with >> > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". >> This is >> > to find the secondary storage where the parent snapshot backup is >> located. >> > >> >> AM: all snapshots are failing, not just selective few. Some volumes are >> brand new, as I've indicated above, others do have previous snapshots. I >> only have a single secondary storage, so all snaps should be in one place. >> >> >> >> >> > Regards, >> > Harikrishna >> > >> > From: Andrei Mikhailovsky >> > Sent: Monday, June 7, 2021 7:00 PM >> > To: users >> > Subject: Snapshots are not working after upgrading to 4.15.0 >> > >> > Hello everyone, >> > >> > I am having an issue with volume snapshots since I've upgraded to >> 4.15.0. None >> > of the volumes are being snapshotted regardless if the snapshot is >> initiated >> > manually or from the schedule. The strange thing is that if I manually >> take the >> > snapshot, the GUI shows Success status, but the Storage>Snapshots show >> an Error >> > status. Here is what I see in the management server logs: >> > >> > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] >> > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) >> Done >> > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 >> > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] >> > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) >> Remove >> > job-86143 from job monitoring >> > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] >> > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy >> snapshot >> > com.cloud.utils.exception.CloudRuntimeException: can not find an image >> stores >> > at >> > >> org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) >> > at >> > >> org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) >> > at >> > >> com.cloud.storage.snapshot.Snapsho
Re: Snapshots are not working after upgrading to 4.15.0
Hi, Andrei, After a quick check of the code, I guess that the backup of the snapshot fails because the secondary storage space should be less than 90%, and you pointed that yours is 90.6% full. Regards, Slavka On Mon, Jun 14, 2021 at 1:10 PM Andrei Mikhailovsky wrote: > Hi Harikrishna, > > I've done some more testing testing just now. Please see below the answers > to your questions/comments: > > > > - Original Message - > > From: "Harikrishna Patnala" > > To: "users" > > Sent: Tuesday, 8 June, 2021 03:33:57 > > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > > > Hi Andrei, > > > > Can you check the following things and let us know? > > > > > > 1. Can you try creating a new volume and then create snapshot of that, > to check > > if this an issue with old entries > > AM: I've done testing with a newly created image which was attached to a > vm. The same problem happens with new images as well as old images. Please > see the link below for the management server log. > > > https://zerobin.net/?5781e4b65d9e3605#+GtIC7JBtp70Q0cw65cypJDiSyba/r/JldRsAyOI8l4= > > > > 2. For the snapshots which are failing can you check if you are seeing > any > > error messages like this "Can't find an image storage in zone with less > than". > > This is to check if secondary storage free space check failed. > > > AM: I do not see any such message in the logs. tried grepping the logs for > a couple of weeks and nothing comes up. Having said this, the secondary > storage is about 90.6% full. > > > > 3. For the snapshots which are failing and if it is delta snapshot can > you > > check if its parent's snapshot entry exists in "snapshot_store_ref" > table with > > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". > This is > > to find the secondary storage where the parent snapshot backup is > located. > > > > AM: all snapshots are failing, not just selective few. Some volumes are > brand new, as I've indicated above, others do have previous snapshots. I > only have a single secondary storage, so all snaps should be in one place. > > > > > > Regards, > > Harikrishna > > > > From: Andrei Mikhailovsky > > Sent: Monday, June 7, 2021 7:00 PM > > To: users > > Subject: Snapshots are not working after upgrading to 4.15.0 > > > > Hello everyone, > > > > I am having an issue with volume snapshots since I've upgraded to > 4.15.0. None > > of the volumes are being snapshotted regardless if the snapshot is > initiated > > manually or from the schedule. The strange thing is that if I manually > take the > > snapshot, the GUI shows Success status, but the Storage>Snapshots show > an Error > > status. Here is what I see in the management server logs: > > > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Done > > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 > > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Remove > > job-86143 from job monitoring > > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] > > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy > snapshot > > com.cloud.utils.exception.CloudRuntimeException: can not find an image > stores > > at > > > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) > > at > > > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) > > at > > > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) > > at > > > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) > > at > > > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) > > at > > > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) > > at > > > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) > > at > > > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) > > at > > > java.base/java.ut
Re: Snapshots are not working after upgrading to 4.15.0
Hi Harikrishna, I've done some more testing testing just now. Please see below the answers to your questions/comments: - Original Message - > From: "Harikrishna Patnala" > To: "users" > Sent: Tuesday, 8 June, 2021 03:33:57 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi Andrei, > > Can you check the following things and let us know? > > > 1. Can you try creating a new volume and then create snapshot of that, to > check > if this an issue with old entries AM: I've done testing with a newly created image which was attached to a vm. The same problem happens with new images as well as old images. Please see the link below for the management server log. https://zerobin.net/?5781e4b65d9e3605#+GtIC7JBtp70Q0cw65cypJDiSyba/r/JldRsAyOI8l4= > 2. For the snapshots which are failing can you check if you are seeing any > error messages like this "Can't find an image storage in zone with less > than". > This is to check if secondary storage free space check failed. AM: I do not see any such message in the logs. tried grepping the logs for a couple of weeks and nothing comes up. Having said this, the secondary storage is about 90.6% full. > 3. For the snapshots which are failing and if it is delta snapshot can you > check if its parent's snapshot entry exists in "snapshot_store_ref" table > with > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This > is > to find the secondary storage where the parent snapshot backup is located. > AM: all snapshots are failing, not just selective few. Some volumes are brand new, as I've indicated above, others do have previous snapshots. I only have a single secondary storage, so all snaps should be in one place. > Regards, > Harikrishna > ________ > From: Andrei Mikhailovsky > Sent: Monday, June 7, 2021 7:00 PM > To: users > Subject: Snapshots are not working after upgrading to 4.15.0 > > Hello everyone, > > I am having an issue with volume snapshots since I've upgraded to 4.15.0. None > of the volumes are being snapshotted regardless if the snapshot is initiated > manually or from the schedule. The strange thing is that if I manually take > the > snapshot, the GUI shows Success status, but the Storage>Snapshots show an > Error > status. Here is what I see in the management server logs: > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Remove > job-86143 from job monitoring > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot > com.cloud.utils.exception.CloudRuntimeException: can not find an image stores > at > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) > at > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) > at > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) > at > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) > at > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) > at > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) > at > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) > at java.base/java.lang.Thread.run(Thread.java:829) > 2021-06-07 13:55:20,152 DEBUG [c.c.s.s.SnapshotManagerImpl] > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Backing up of snapshot > failed, for snapshot with ID 53531, left with 2 more attempts > > > I've checked and the Secondary storage is
Re: Snapshots are not working after upgrading to 4.15.0
Oh, should have mentioned that in the original post. I am using KVM hypervisor with ceph/rbd as the primary storage and nfs as the secondary storage. Andrei > From: "Andrija Panic" > To: "users" , "andrei" > Sent: Tuesday, 8 June, 2021 21:38:28 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > [ mailto:and...@arhont.com | @Andrei Mikhailovsky ] can you advise which > hypervisor (and version) you are using, what primary storage - let's see if > the > same is true in 4.15.1 (voting happening right now - feel free to test and > vote > as well, please) > Best, > On Tue, 8 Jun 2021 at 14:25, Andrei Mikhailovsky > wrote: >> Thanks for the suggestions, Harikrishna. I will check it and revert back. >> Andrei >> - Original Message - >>> From: "Harikrishna Patnala" < [ mailto:harikrishna.patn...@shapeblue.com | >> > harikrishna.patn...@shapeblue.com ] > >>> To: "users" < [ mailto:users@cloudstack.apache.org | >>> users@cloudstack.apache.org >> > ] > >> > Sent: Tuesday, 8 June, 2021 03:33:57 >> > Subject: Re: Snapshots are not working after upgrading to 4.15.0 >> > Hi Andrei, >> > Can you check the following things and let us know? >> > 1. Can you try creating a new volume and then create snapshot of that, to >> > check >> > if this an issue with old entries >> > 2. For the snapshots which are failing can you check if you are seeing any >> > error messages like this "Can't find an image storage in zone with less >> > than". >> > This is to check if secondary storage free space check failed. >> > 3. For the snapshots which are failing and if it is delta snapshot can you >> > check if its parent's snapshot entry exists in "snapshot_store_ref" table >> > with >> > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". >> > This is >> > to find the secondary storage where the parent snapshot backup is located. >> > Regards, >> > Harikrishna >> > >> > From: Andrei Mikhailovsky >> > Sent: Monday, June 7, 2021 7:00 PM >>> To: users < [ mailto:users@cloudstack.apache.org | >>> users@cloudstack.apache.org ] >> > > >> > Subject: Snapshots are not working after upgrading to 4.15.0 >> > Hello everyone, >> > I am having an issue with volume snapshots since I've upgraded to 4.15.0. >> > None >> > of the volumes are being snapshotted regardless if the snapshot is >> > initiated >> > manually or from the schedule. The strange thing is that if I manually >> > take the >> > snapshot, the GUI shows Success status, but the Storage>Snapshots show an >> > Error >> > status. Here is what I see in the management server logs: >> > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] >> > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) >> > Done >> > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 >> > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] >> > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) >> > Remove >> > job-86143 from job monitoring >> > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] >> > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy >> > snapshot >> > com.cloud.utils.exception.CloudRuntimeException: can not find an image >> > stores >> > at >> > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) >> > at >> > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) >> > at >> > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) >> > at >> > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) >> > at >> > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) >> > at >> > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) >> > at >> > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) >> > at >> > org.apache.cloudstack.managed.context.ManagedC
Re: Snapshots are not working after upgrading to 4.15.0
@Andrei Mikhailovsky can you advise which hypervisor (and version) you are using, what primary storage - let's see if the same is true in 4.15.1 (voting happening right now - feel free to test and vote as well, please) Best, On Tue, 8 Jun 2021 at 14:25, Andrei Mikhailovsky wrote: > Thanks for the suggestions, Harikrishna. I will check it and revert back. > > Andrei > > > > - Original Message - > > From: "Harikrishna Patnala" > > To: "users" > > Sent: Tuesday, 8 June, 2021 03:33:57 > > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > > > Hi Andrei, > > > > Can you check the following things and let us know? > > > > > > 1. Can you try creating a new volume and then create snapshot of that, > to check > > if this an issue with old entries > > 2. For the snapshots which are failing can you check if you are seeing > any > > error messages like this "Can't find an image storage in zone with less > than". > > This is to check if secondary storage free space check failed. > > 3. For the snapshots which are failing and if it is delta snapshot can > you > > check if its parent's snapshot entry exists in "snapshot_store_ref" > table with > > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". > This is > > to find the secondary storage where the parent snapshot backup is > located. > > > > Regards, > > Harikrishna > > > > From: Andrei Mikhailovsky > > Sent: Monday, June 7, 2021 7:00 PM > > To: users > > Subject: Snapshots are not working after upgrading to 4.15.0 > > > > Hello everyone, > > > > I am having an issue with volume snapshots since I've upgraded to > 4.15.0. None > > of the volumes are being snapshotted regardless if the snapshot is > initiated > > manually or from the schedule. The strange thing is that if I manually > take the > > snapshot, the GUI shows Success status, but the Storage>Snapshots show > an Error > > status. Here is what I see in the management server logs: > > > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Done > > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 > > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] > > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Remove > > job-86143 from job monitoring > > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] > > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy > snapshot > > com.cloud.utils.exception.CloudRuntimeException: can not find an image > stores > > at > > > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) > > at > > > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) > > at > > > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) > > at > > > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) > > at > > > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) > > at > > > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) > > at > > > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) > > at > > > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) > > at > > > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) > > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) > > at > > > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) > > at > > > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) > > at > > > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) > > at java.base/java.lang.Thread.run(Thread.java:829) > > 2021-06-07 13:55:20,152 DEBUG [c.c.s.s.SnapshotManagerImpl] > > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Backing up of > snapshot > > failed, for snapshot with ID 53531, left with 2 more attempts > > > > > > I've checked and the Secondary storage is configured and visible in the > GUI. I > > can also mount it manually from the management server and a couple of > host > > servers that I've tested. In addition, I can successfully upload an ISO > image > > and that registers just fine and I can create new VMs using the newly > uploaded > > ISO image. > > > > I've had no such problems with 4.13.x ACS, so the issue seems to have > been > > introduced after doing the upgrade to 4.15.0. > > > > Could you please let me know how do I fix the issue? > > > > Cheers > > > > andrei > -- Andrija Panić
Re: Snapshots are not working after upgrading to 4.15.0
Thanks for the suggestions, Harikrishna. I will check it and revert back. Andrei - Original Message - > From: "Harikrishna Patnala" > To: "users" > Sent: Tuesday, 8 June, 2021 03:33:57 > Subject: Re: Snapshots are not working after upgrading to 4.15.0 > Hi Andrei, > > Can you check the following things and let us know? > > > 1. Can you try creating a new volume and then create snapshot of that, to > check > if this an issue with old entries > 2. For the snapshots which are failing can you check if you are seeing any > error messages like this "Can't find an image storage in zone with less > than". > This is to check if secondary storage free space check failed. > 3. For the snapshots which are failing and if it is delta snapshot can you > check if its parent's snapshot entry exists in "snapshot_store_ref" table > with > 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This > is > to find the secondary storage where the parent snapshot backup is located. > > Regards, > Harikrishna > ________ > From: Andrei Mikhailovsky > Sent: Monday, June 7, 2021 7:00 PM > To: users > Subject: Snapshots are not working after upgrading to 4.15.0 > > Hello everyone, > > I am having an issue with volume snapshots since I've upgraded to 4.15.0. None > of the volumes are being snapshotted regardless if the snapshot is initiated > manually or from the schedule. The strange thing is that if I manually take > the > snapshot, the GUI shows Success status, but the Storage>Snapshots show an > Error > status. Here is what I see in the management server logs: > > 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done > executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 > 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] > (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) > Remove > job-86143 from job monitoring > 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot > com.cloud.utils.exception.CloudRuntimeException: can not find an image stores > at > org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) > at > org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) > at > com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) > at > org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) > at > org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) > at > java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) > at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) > at > java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) > at > java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) > at > java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) > at java.base/java.lang.Thread.run(Thread.java:829) > 2021-06-07 13:55:20,152 DEBUG [c.c.s.s.SnapshotManagerImpl] > (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Backing up of snapshot > failed, for snapshot with ID 53531, left with 2 more attempts > > > I've checked and the Secondary storage is configured and visible in the GUI. I > can also mount it manually from the management server and a couple of host > servers that I've tested. In addition, I can successfully upload an ISO image > and that registers just fine and I can create new VMs using the newly uploaded > ISO image. > > I've had no such problems with 4.13.x ACS, so the issue seems to have been > introduced after doing the upgrade to 4.15.0. > > Could you please let me know how do I fix the issue? > > Cheers > > andrei
Re: Snapshots are not working after upgrading to 4.15.0
Hi Andrei, Can you check the following things and let us know? 1. Can you try creating a new volume and then create snapshot of that, to check if this an issue with old entries 2. For the snapshots which are failing can you check if you are seeing any error messages like this "Can't find an image storage in zone with less than". This is to check if secondary storage free space check failed. 3. For the snapshots which are failing and if it is delta snapshot can you check if its parent's snapshot entry exists in "snapshot_store_ref" table with 'parent_snapshot_id' of the current snapshot with 'store_role' "Image". This is to find the secondary storage where the parent snapshot backup is located. Regards, Harikrishna From: Andrei Mikhailovsky Sent: Monday, June 7, 2021 7:00 PM To: users Subject: Snapshots are not working after upgrading to 4.15.0 Hello everyone, I am having an issue with volume snapshots since I've upgraded to 4.15.0. None of the volumes are being snapshotted regardless if the snapshot is initiated manually or from the schedule. The strange thing is that if I manually take the snapshot, the GUI shows Success status, but the Storage>Snapshots show an Error status. Here is what I see in the management server logs: 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Remove job-86143 from job monitoring 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot com.cloud.utils.exception.CloudRuntimeException: can not find an image stores at org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) at org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) at com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) at org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) at org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) at java.base/java.lang.Thread.run(Thread.java:829) 2021-06-07 13:55:20,152 DEBUG [c.c.s.s.SnapshotManagerImpl] (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Backing up of snapshot failed, for snapshot with ID 53531, left with 2 more attempts I've checked and the Secondary storage is configured and visible in the GUI. I can also mount it manually from the management server and a couple of host servers that I've tested. In addition, I can successfully upload an ISO image and that registers just fine and I can create new VMs using the newly uploaded ISO image. I've had no such problems with 4.13.x ACS, so the issue seems to have been introduced after doing the upgrade to 4.15.0. Could you please let me know how do I fix the issue? Cheers andrei
Snapshots are not working after upgrading to 4.15.0
Hello everyone, I am having an issue with volume snapshots since I've upgraded to 4.15.0. None of the volumes are being snapshotted regardless if the snapshot is initiated manually or from the schedule. The strange thing is that if I manually take the snapshot, the GUI shows Success status, but the Storage>Snapshots show an Error status. Here is what I see in the management server logs: 2021-06-07 13:55:20,022 DEBUG [o.a.c.f.j.i.AsyncJobManagerImpl] (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Done executing com.cloud.vm.VmWorkTakeVolumeSnapshot for job-86143 2021-06-07 13:55:20,024 INFO [o.a.c.f.j.i.AsyncJobMonitor] (Work-Job-Executor-81:ctx-08dd4222 job-86141/job-86143) (logid:be34ce01) Remove job-86143 from job monitoring 2021-06-07 13:55:20,094 DEBUG [o.a.c.s.s.SnapshotServiceImpl] (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Failed to copy snapshot com.cloud.utils.exception.CloudRuntimeException: can not find an image stores at org.apache.cloudstack.storage.snapshot.SnapshotServiceImpl.backupSnapshot(SnapshotServiceImpl.java:271) at org.apache.cloudstack.storage.snapshot.DefaultSnapshotStrategy.backupSnapshot(DefaultSnapshotStrategy.java:171) at com.cloud.storage.snapshot.SnapshotManagerImpl$BackupSnapshotTask.runInContext(SnapshotManagerImpl.java:1238) at org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102) at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52) at org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45) at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515) at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264) at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:304) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128) at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628) at java.base/java.lang.Thread.run(Thread.java:829) 2021-06-07 13:55:20,152 DEBUG [c.c.s.s.SnapshotManagerImpl] (BackupSnapshotTask-3:ctx-744796da) (logid:607dbb0e) Backing up of snapshot failed, for snapshot with ID 53531, left with 2 more attempts I've checked and the Secondary storage is configured and visible in the GUI. I can also mount it manually from the management server and a couple of host servers that I've tested. In addition, I can successfully upload an ISO image and that registers just fine and I can create new VMs using the newly uploaded ISO image. I've had no such problems with 4.13.x ACS, so the issue seems to have been introduced after doing the upgrade to 4.15.0. Could you please let me know how do I fix the issue? Cheers andrei