Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-10-12 Thread Stefan Hajnoczi
On Tue, Oct 11, 2011 at 04:22:11PM +0200, Kevin Wolf wrote:
> Am 30.09.2011 17:49, schrieb Amit Shah:
> > On (Fri) 30 Sep 2011 [16:23:30], Stefan Hajnoczi wrote:
> >> On Fri, Sep 30, 2011 at 12:27 PM, Amit Shah  wrote:
> >>> On (Fri) 30 Sep 2011 [11:39:11], Stefan Hajnoczi wrote:
>  QED's metadata caching strategy allows two parallel requests to race for
>  metadata lookup.  The first one to complete will populate the metadata
>  cache and the second one will drop the data it just read in favor of the
>  cached data.
> 
>  There is a use-after-free in qed_read_l2_table_cb() and
>  qed_commit_l2_update() where l2_table->offset was used after the
>  l2_table may have been freed due to a metadata lookup race.  Fix this by
>  keeping the l2_offset in a local variable and not reaching into the
>  possibly freed l2_table.
> 
>  Reported-by: Amit Shah 
>  Signed-off-by: Stefan Hajnoczi 
>  ---
>  Hi Amit,
>  Thanks for reporting the assertion failure you saw at 
>  http://fpaste.org/CDuv/.
>  Does this patch fix the problem?
> >>>
> >>> Yes, this fixes it.
> >>
> >> Were you able to reliably reproduce the assertion failure before?
> > 
> > Absolutely.
> > 
> > I even reverted the patch and tried the same image; same segfault
> > again.
> > 
> >> I wonder because this only happens when two metadata lookups race
> >> (which is rare enough on my setup that I've never seen this failure).
> >> It might be worth trying a few times.
> > 
> > Get the F16 beta-rc LXE live iso, install guest.  It doesn't cleanly
> > reboot, you have to kill the VM.  Next start of the VM produces this
> > segfault.
> > 
> > https://alt.fedoraproject.org/pub/alt/stage/16-Beta.RC2/Live/x86_64/Fedora-16-Beta-x86_64-Live-LXDE.iso
> 
> Can we try to artificially produce it in a qemu-iotests case?

I will take a look.

Stefan



Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-10-11 Thread Kevin Wolf
Am 30.09.2011 17:49, schrieb Amit Shah:
> On (Fri) 30 Sep 2011 [16:23:30], Stefan Hajnoczi wrote:
>> On Fri, Sep 30, 2011 at 12:27 PM, Amit Shah  wrote:
>>> On (Fri) 30 Sep 2011 [11:39:11], Stefan Hajnoczi wrote:
 QED's metadata caching strategy allows two parallel requests to race for
 metadata lookup.  The first one to complete will populate the metadata
 cache and the second one will drop the data it just read in favor of the
 cached data.

 There is a use-after-free in qed_read_l2_table_cb() and
 qed_commit_l2_update() where l2_table->offset was used after the
 l2_table may have been freed due to a metadata lookup race.  Fix this by
 keeping the l2_offset in a local variable and not reaching into the
 possibly freed l2_table.

 Reported-by: Amit Shah 
 Signed-off-by: Stefan Hajnoczi 
 ---
 Hi Amit,
 Thanks for reporting the assertion failure you saw at 
 http://fpaste.org/CDuv/.
 Does this patch fix the problem?
>>>
>>> Yes, this fixes it.
>>
>> Were you able to reliably reproduce the assertion failure before?
> 
> Absolutely.
> 
> I even reverted the patch and tried the same image; same segfault
> again.
> 
>> I wonder because this only happens when two metadata lookups race
>> (which is rare enough on my setup that I've never seen this failure).
>> It might be worth trying a few times.
> 
> Get the F16 beta-rc LXE live iso, install guest.  It doesn't cleanly
> reboot, you have to kill the VM.  Next start of the VM produces this
> segfault.
> 
> https://alt.fedoraproject.org/pub/alt/stage/16-Beta.RC2/Live/x86_64/Fedora-16-Beta-x86_64-Live-LXDE.iso

Can we try to artificially produce it in a qemu-iotests case?

Kevin



Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-10-05 Thread Anthony Liguori

On 09/30/2011 05:39 AM, Stefan Hajnoczi wrote:

QED's metadata caching strategy allows two parallel requests to race for
metadata lookup.  The first one to complete will populate the metadata
cache and the second one will drop the data it just read in favor of the
cached data.

There is a use-after-free in qed_read_l2_table_cb() and
qed_commit_l2_update() where l2_table->offset was used after the
l2_table may have been freed due to a metadata lookup race.  Fix this by
keeping the l2_offset in a local variable and not reaching into the
possibly freed l2_table.

Reported-by: Amit Shah
Signed-off-by: Stefan Hajnoczi


Applied.  Thanks.

Regards,

Anthony Liguori


---
Hi Amit,
Thanks for reporting the assertion failure you saw at http://fpaste.org/CDuv/.
Does this patch fix the problem?

If not, please send details on your setup and how to reproduce the issue.

Thanks,
Stefan

  block/qed-table.c |6 +++---
  block/qed.c   |4 ++--
  2 files changed, 5 insertions(+), 5 deletions(-)

diff --git a/block/qed-table.c b/block/qed-table.c
index d96afa8..f31f9ff 100644
--- a/block/qed-table.c
+++ b/block/qed-table.c
@@ -222,21 +222,21 @@ static void qed_read_l2_table_cb(void *opaque, int ret)
  QEDRequest *request = read_l2_table_cb->request;
  BDRVQEDState *s = read_l2_table_cb->s;
  CachedL2Table *l2_table = request->l2_table;
+uint64_t l2_offset = read_l2_table_cb->l2_offset;

  if (ret) {
  /* can't trust loaded L2 table anymore */
  qed_unref_l2_cache_entry(l2_table);
  request->l2_table = NULL;
  } else {
-l2_table->offset = read_l2_table_cb->l2_offset;
+l2_table->offset = l2_offset;

  qed_commit_l2_cache_entry(&s->l2_cache, l2_table);

  /* This is guaranteed to succeed because we just committed the entry
   * to the cache.
   */
-request->l2_table = qed_find_l2_cache_entry(&s->l2_cache,
-l2_table->offset);
+request->l2_table = qed_find_l2_cache_entry(&s->l2_cache, l2_offset);
  assert(request->l2_table != NULL);
  }

diff --git a/block/qed.c b/block/qed.c
index 624e261..e87dc4d 100644
--- a/block/qed.c
+++ b/block/qed.c
@@ -911,14 +911,14 @@ static void qed_commit_l2_update(void *opaque, int ret)
  QEDAIOCB *acb = opaque;
  BDRVQEDState *s = acb_to_s(acb);
  CachedL2Table *l2_table = acb->request.l2_table;
+uint64_t l2_offset = l2_table->offset;

  qed_commit_l2_cache_entry(&s->l2_cache, l2_table);

  /* This is guaranteed to succeed because we just committed the entry to 
the
   * cache.
   */
-acb->request.l2_table = qed_find_l2_cache_entry(&s->l2_cache,
-l2_table->offset);
+acb->request.l2_table = qed_find_l2_cache_entry(&s->l2_cache, l2_offset);
  assert(acb->request.l2_table != NULL);

  qed_aio_next_io(opaque, ret);





Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-10-03 Thread Anthony Liguori

On 09/30/2011 10:26 AM, Stefan Hajnoczi wrote:

On Fri, Sep 30, 2011 at 11:39 AM, Stefan Hajnoczi
  wrote:

QED's metadata caching strategy allows two parallel requests to race for
metadata lookup.  The first one to complete will populate the metadata
cache and the second one will drop the data it just read in favor of the
cached data.

There is a use-after-free in qed_read_l2_table_cb() and
qed_commit_l2_update() where l2_table->offset was used after the
l2_table may have been freed due to a metadata lookup race.  Fix this by
keeping the l2_offset in a local variable and not reaching into the
possibly freed l2_table.

Reported-by: Amit Shah
Signed-off-by: Stefan Hajnoczi


Please consider this for -stable.  It's a pretty small/simple fix and
can prevent an assertion failure.


Justin is looking to cut a stable release this week.  Since Kevin is out on 
holiday, unless anyone objects, I'll commit this so that it can make the next 
stable release.


Regards,

Anthony Liguori



Stefan







Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-09-30 Thread Amit Shah
On (Fri) 30 Sep 2011 [16:23:30], Stefan Hajnoczi wrote:
> On Fri, Sep 30, 2011 at 12:27 PM, Amit Shah  wrote:
> > On (Fri) 30 Sep 2011 [11:39:11], Stefan Hajnoczi wrote:
> >> QED's metadata caching strategy allows two parallel requests to race for
> >> metadata lookup.  The first one to complete will populate the metadata
> >> cache and the second one will drop the data it just read in favor of the
> >> cached data.
> >>
> >> There is a use-after-free in qed_read_l2_table_cb() and
> >> qed_commit_l2_update() where l2_table->offset was used after the
> >> l2_table may have been freed due to a metadata lookup race.  Fix this by
> >> keeping the l2_offset in a local variable and not reaching into the
> >> possibly freed l2_table.
> >>
> >> Reported-by: Amit Shah 
> >> Signed-off-by: Stefan Hajnoczi 
> >> ---
> >> Hi Amit,
> >> Thanks for reporting the assertion failure you saw at 
> >> http://fpaste.org/CDuv/.
> >> Does this patch fix the problem?
> >
> > Yes, this fixes it.
> 
> Were you able to reliably reproduce the assertion failure before?

Absolutely.

I even reverted the patch and tried the same image; same segfault
again.

> I wonder because this only happens when two metadata lookups race
> (which is rare enough on my setup that I've never seen this failure).
> It might be worth trying a few times.

Get the F16 beta-rc LXE live iso, install guest.  It doesn't cleanly
reboot, you have to kill the VM.  Next start of the VM produces this
segfault.

https://alt.fedoraproject.org/pub/alt/stage/16-Beta.RC2/Live/x86_64/Fedora-16-Beta-x86_64-Live-LXDE.iso


Amit



Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-09-30 Thread Stefan Hajnoczi
On Fri, Sep 30, 2011 at 11:39 AM, Stefan Hajnoczi
 wrote:
> QED's metadata caching strategy allows two parallel requests to race for
> metadata lookup.  The first one to complete will populate the metadata
> cache and the second one will drop the data it just read in favor of the
> cached data.
>
> There is a use-after-free in qed_read_l2_table_cb() and
> qed_commit_l2_update() where l2_table->offset was used after the
> l2_table may have been freed due to a metadata lookup race.  Fix this by
> keeping the l2_offset in a local variable and not reaching into the
> possibly freed l2_table.
>
> Reported-by: Amit Shah 
> Signed-off-by: Stefan Hajnoczi 

Please consider this for -stable.  It's a pretty small/simple fix and
can prevent an assertion failure.

Stefan



Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-09-30 Thread Stefan Hajnoczi
On Fri, Sep 30, 2011 at 12:27 PM, Amit Shah  wrote:
> On (Fri) 30 Sep 2011 [11:39:11], Stefan Hajnoczi wrote:
>> QED's metadata caching strategy allows two parallel requests to race for
>> metadata lookup.  The first one to complete will populate the metadata
>> cache and the second one will drop the data it just read in favor of the
>> cached data.
>>
>> There is a use-after-free in qed_read_l2_table_cb() and
>> qed_commit_l2_update() where l2_table->offset was used after the
>> l2_table may have been freed due to a metadata lookup race.  Fix this by
>> keeping the l2_offset in a local variable and not reaching into the
>> possibly freed l2_table.
>>
>> Reported-by: Amit Shah 
>> Signed-off-by: Stefan Hajnoczi 
>> ---
>> Hi Amit,
>> Thanks for reporting the assertion failure you saw at 
>> http://fpaste.org/CDuv/.
>> Does this patch fix the problem?
>
> Yes, this fixes it.

Were you able to reliably reproduce the assertion failure before?

I wonder because this only happens when two metadata lookups race
(which is rare enough on my setup that I've never seen this failure).
It might be worth trying a few times.

Stefan



Re: [Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-09-30 Thread Amit Shah
On (Fri) 30 Sep 2011 [11:39:11], Stefan Hajnoczi wrote:
> QED's metadata caching strategy allows two parallel requests to race for
> metadata lookup.  The first one to complete will populate the metadata
> cache and the second one will drop the data it just read in favor of the
> cached data.
> 
> There is a use-after-free in qed_read_l2_table_cb() and
> qed_commit_l2_update() where l2_table->offset was used after the
> l2_table may have been freed due to a metadata lookup race.  Fix this by
> keeping the l2_offset in a local variable and not reaching into the
> possibly freed l2_table.
> 
> Reported-by: Amit Shah 
> Signed-off-by: Stefan Hajnoczi 
> ---
> Hi Amit,
> Thanks for reporting the assertion failure you saw at http://fpaste.org/CDuv/.
> Does this patch fix the problem?

Yes, this fixes it.

Thanks,

Amit



[Qemu-devel] [PATCH] qed: fix use-after-free during l2 cache commit

2011-09-30 Thread Stefan Hajnoczi
QED's metadata caching strategy allows two parallel requests to race for
metadata lookup.  The first one to complete will populate the metadata
cache and the second one will drop the data it just read in favor of the
cached data.

There is a use-after-free in qed_read_l2_table_cb() and
qed_commit_l2_update() where l2_table->offset was used after the
l2_table may have been freed due to a metadata lookup race.  Fix this by
keeping the l2_offset in a local variable and not reaching into the
possibly freed l2_table.

Reported-by: Amit Shah 
Signed-off-by: Stefan Hajnoczi 
---
Hi Amit,
Thanks for reporting the assertion failure you saw at http://fpaste.org/CDuv/.
Does this patch fix the problem?

If not, please send details on your setup and how to reproduce the issue.

Thanks,
Stefan

 block/qed-table.c |6 +++---
 block/qed.c   |4 ++--
 2 files changed, 5 insertions(+), 5 deletions(-)

diff --git a/block/qed-table.c b/block/qed-table.c
index d96afa8..f31f9ff 100644
--- a/block/qed-table.c
+++ b/block/qed-table.c
@@ -222,21 +222,21 @@ static void qed_read_l2_table_cb(void *opaque, int ret)
 QEDRequest *request = read_l2_table_cb->request;
 BDRVQEDState *s = read_l2_table_cb->s;
 CachedL2Table *l2_table = request->l2_table;
+uint64_t l2_offset = read_l2_table_cb->l2_offset;
 
 if (ret) {
 /* can't trust loaded L2 table anymore */
 qed_unref_l2_cache_entry(l2_table);
 request->l2_table = NULL;
 } else {
-l2_table->offset = read_l2_table_cb->l2_offset;
+l2_table->offset = l2_offset;
 
 qed_commit_l2_cache_entry(&s->l2_cache, l2_table);
 
 /* This is guaranteed to succeed because we just committed the entry
  * to the cache.
  */
-request->l2_table = qed_find_l2_cache_entry(&s->l2_cache,
-l2_table->offset);
+request->l2_table = qed_find_l2_cache_entry(&s->l2_cache, l2_offset);
 assert(request->l2_table != NULL);
 }
 
diff --git a/block/qed.c b/block/qed.c
index 624e261..e87dc4d 100644
--- a/block/qed.c
+++ b/block/qed.c
@@ -911,14 +911,14 @@ static void qed_commit_l2_update(void *opaque, int ret)
 QEDAIOCB *acb = opaque;
 BDRVQEDState *s = acb_to_s(acb);
 CachedL2Table *l2_table = acb->request.l2_table;
+uint64_t l2_offset = l2_table->offset;
 
 qed_commit_l2_cache_entry(&s->l2_cache, l2_table);
 
 /* This is guaranteed to succeed because we just committed the entry to the
  * cache.
  */
-acb->request.l2_table = qed_find_l2_cache_entry(&s->l2_cache,
-l2_table->offset);
+acb->request.l2_table = qed_find_l2_cache_entry(&s->l2_cache, l2_offset);
 assert(acb->request.l2_table != NULL);
 
 qed_aio_next_io(opaque, ret);
-- 
1.7.6.3