Re: [PATCH v0 3/4] migration: add background snapshot

Denis Plotnikov Tue, 28 Jul 2020 02:50:22 -0700



On 27.07.2020 19:48, Dr. David Alan Gilbert wrote:

* Denis Plotnikov (dplotni...@virtuozzo.com) wrote:

...

+static void page_fault_thread_stop(void)
+{
+    if (page_fault_fd) {
+        close(page_fault_fd);
+        page_fault_fd = 0;
+    }

I think you need to do that after you've done the quit and join,
otherwise the fault thread might still be reading this.


Seems to be so

+    if (thread_quit_fd) {
+        uint64_t val = 1;
+        int ret;
+
+        ret = write(thread_quit_fd, &val, sizeof(val));
+        assert(ret == sizeof(val));
+
+        qemu_thread_join(&page_fault_thread);
+        close(thread_quit_fd);
+        thread_quit_fd = 0;
+    }
+}

...

  /**
   * ram_find_and_save_block: finds a dirty page and sends it to f
   *
@@ -1782,6 +2274,7 @@ static int ram_find_and_save_block(RAMState *rs, bool 
last_stage)
      pss.block = rs->last_seen_block;
      pss.page = rs->last_page;
      pss.complete_round = false;
+    pss.page_copy = NULL;

if (!pss.block) {

          pss.block = QLIST_FIRST_RCU(&ram_list.blocks);
@@ -1794,11 +2287,30 @@ static int ram_find_and_save_block(RAMState *rs, bool 
last_stage)
          if (!found) {
              /* priority queue empty, so just search for something dirty */
              found = find_dirty_block(rs, &pss, &again);
+
+            if (found && migrate_background_snapshot()) {
+                /*
+                 * make a copy of the page and
+                 * pass it to the page search status
+                 */
+                int ret;
+                ret = ram_copy_page(pss.block, pss.page, &pss.page_copy);

I'm a bit confused about why we hit this; the way I'd thought about your
code was we turn on the write faulting, do one big save and then fixup
the faults as the save is happening (doing the copies) as the writes
hit; so when does this case hit?


To make it more clear, let me draw the whole picture:

When we do background snapshot, the vm is paused untill all vmstateEXCEPT ram is saved.RAM isn't written at all. That vmstate part is saved in the temporarybuffer.

Then all the RAM is marked as read-only and the vm is un-paused. Notethat at this moment all vm's vCPUs are

running and can touch any part of memory.

After that, the migration thread starts writing the ram content. Once amemory chunk is written, the write protection is removed for that chunk.If a vCPU wants to write to a memory page which is write protected(hasn't been written yet), this write is intercepted, the memory page iscopiedand queued for writing, the memory page write access is restored. Theintention behind of that, is to allow vCPU to work with a memory page assoon as possible.

Once all the RAM has been written, the rest of the vmstate is writtenfrom the buffer. This needs to be so because some of the emulateddevices, saved in thatbuffered vmstate part, expects the RAM content to be available first onits loading.


I hope this description will make things more clear.
If not, please let me know, so I could add more details.

Denis

--
Dr. David Alan Gilbert / dgilb...@redhat.com / Manchester, UK

Re: [PATCH v0 3/4] migration: add background snapshot

Reply via email to