Re: BTRFS concerns

2011-06-06 Thread Bryn M. Reeves
On 06/02/2011 08:28 PM, Richard W.M. Jones wrote:
 Maybe I'm not understanding your question correctly, but a filesystem
 is more general than LVM.  You can create directories corresponding to
 your current VGs and files for your LVs, with the advantage that you
 can nest directories which you can't do with LVM VGs.

If you mean nesting in the sense that an LVM2 LV is used as a PV then the tools
support this just fine:

# lvcreate -n nested_pv1_lv -L5G bmr_vg1
  Logical volume nested_pv1_lv created
# pvcreate /dev/bmr_vg1/nested_pv1_lv
  Physical volume /dev/bmr_vg1/nested_pv1_lv successfully created
# vgcreate bmr_nested_vg0 /dev/bmr_vg1/nested_pv1_lv
  Volume group bmr_nested_vg0 successfully created
# lvcreate -n nested_pv2_lv -L5G bmr_nested_vg0
  Insufficient free extents (1279) in volume group bmr_nested_vg0: 1280 required
# lvcreate -n nested_pv2_lv -l 1279 bmr_nested_vg0
  Logical volume nested_pv2_lv created
# pvcreate /dev/bmr_nested_vg0/nested_pv2_lv
  Physical volume /dev/bmr_nested_vg0/nested_pv2_lv successfully created
# vgcreate bmr_nested_vg1 /dev/bmr_nested_vg0/nested_pv2_lv
  Volume group bmr_nested_vg1 successfully created
# lvcreate -n l0 -l 1278 bmr_nested_vg1
  Logical volume l0 created

The 1-extent-per-nesting-level size decrement is to allow for the MDA allocated
on each PV. The pvcreates are also strictly speaking unnecessary on recent tool
versions since a vgcreate will now label a new device automatically but I'm
old-fashioned ;-)

You can view the device nesting via dmsetup:

# dmsetup ls --tree bmr_nested_vg1-l0
[...]
bmr_nested_vg1-l0 (253:10)
 └─bmr_nested_vg0-nested_pv2_lv (253:9)
└─bmr_vg1-nested_pv1_lv (253:8)
   └─ (9:1)
[...]

# vgs
  VG #PV #LV #SN Attr   VSize   VFree
  bmr_nested_vg0   1   1   0 wz--n-   5.00g 0
  bmr_nested_vg1   1   1   0 wz--n-   4.99g 0
  bmr_vg0  1   4   0 wz--n- 222.82g 25.68g
  bmr_vg1  1   3   0 wz--n-  19.53g  6.62g

If things aren't activating properly on boot/reboot then it's likely an
initscripts limitation or bug. Udev based activation should simplify that but
you can also work around it depending on your distro via e.g. rc.local.

Regards,
Bryn.
-- 
devel mailing list
devel@lists.fedoraproject.org
https://admin.fedoraproject.org/mailman/listinfo/devel

BTRFS concerns (was: Re: Plan for tomorrow's FESCo meeting (2011-06-01))

2011-06-02 Thread Richard W.M. Jones
On Wed, Jun 01, 2011 at 04:15:59PM +0100, Peter Robinson wrote:
 I will be unable to attend tomorrow but I have concerns of making btrfs
 default without a well tested fsck. I'm aware one is due soon but I don't
 believe 3-4 months is enough time to test it well enough. On 2.6.38.x I
 still get regular kernel abrt crashes on resume. Is it even marked stable in
 the upstream kernel yet?

Another concern is whether btrfs is going to work well to store
virtual machine disk images (ie. to replace LVM for that purpose,
where LVM is known to work very efficiently).

Last time I looked -- which I admit was a really long time ago -- it
behaved fairly pathologically with these huge monolithic files that
are rewritten in-place.

(Edit: just noticed this bug:
https://bugzilla.redhat.com/show_bug.cgi?id=689127 )

Rich.

-- 
Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones
virt-top is 'top' for virtual machines.  Tiny program with many
powerful monitoring features, net stats, disk stats, logging, etc.
http://et.redhat.com/~rjones/virt-top
-- 
devel mailing list
devel@lists.fedoraproject.org
https://admin.fedoraproject.org/mailman/listinfo/devel


Re: BTRFS concerns (was: Re: Plan for tomorrow's FESCo meeting (2011-06-01))

2011-06-02 Thread Josef Bacik
On Thu, Jun 2, 2011 at 11:46 AM, Richard W.M. Jones rjo...@redhat.com wrote:
 On Wed, Jun 01, 2011 at 04:15:59PM +0100, Peter Robinson wrote:
 I will be unable to attend tomorrow but I have concerns of making btrfs
 default without a well tested fsck. I'm aware one is due soon but I don't
 believe 3-4 months is enough time to test it well enough. On 2.6.38.x I
 still get regular kernel abrt crashes on resume. Is it even marked stable in
 the upstream kernel yet?

 Another concern is whether btrfs is going to work well to store
 virtual machine disk images (ie. to replace LVM for that purpose,
 where LVM is known to work very efficiently).

 Last time I looked -- which I admit was a really long time ago -- it
 behaved fairly pathologically with these huge monolithic files that
 are rewritten in-place.

 (Edit: just noticed this bug:
 https://bugzilla.redhat.com/show_bug.cgi?id=689127 )


These sort of issues are my priority and I've spent the last 2 months
specifically working on the kvm performance differences between ext4
and btrfs.  Now we're not on par with ext4 yet, but we aren't 2-3
times slower any more, maybe at the most we're 20% slower.  Thanks,

Josef
-- 
devel mailing list
devel@lists.fedoraproject.org
https://admin.fedoraproject.org/mailman/listinfo/devel


Re: BTRFS concerns

2011-06-02 Thread Genes MailLists
On 06/02/2011 01:14 PM, Josef Bacik wrote:


 
 These sort of issues are my priority and I've spent the last 2 months
 specifically working on the kvm performance differences between ext4
 and btrfs.  Now we're not on par with ext4 yet, but we aren't 2-3
 times slower any more, maybe at the most we're 20% slower.  Thanks,
 
 Josef


  Wondering - Are all the pieces (user space included) in place for RAID
5 yet ?
-- 
devel mailing list
devel@lists.fedoraproject.org
https://admin.fedoraproject.org/mailman/listinfo/devel


Re: BTRFS concerns (was: Re: Plan for tomorrow's FESCo meeting (2011-06-01))

2011-06-02 Thread Richard W.M. Jones
On Thu, Jun 02, 2011 at 12:44:46PM -0500, Chris Adams wrote:
 Once upon a time, Josef Bacik jo...@toxicpanda.com said:
  These sort of issues are my priority and I've spent the last 2 months
  specifically working on the kvm performance differences between ext4
  and btrfs.  Now we're not on par with ext4 yet, but we aren't 2-3
  times slower any more, maybe at the most we're 20% slower.  Thanks,
 
 How does it compare to straight LVM for virtual images?  I create a big
 LV and then only use part of it for the host OS VG; when I create VMs, I
 create a VG for each (or I can snapshot an existing base VG).
 
 It is my understanding that one goal for btrfs is to take LVM out of the
 picture for the common case; i.e. btrfs can do its own logical volume
 management.  If that's the case, there needs to be something comparable
 to the VM-on-VG setup (in terms of ease-of-management and performance).

Maybe I'm not understanding your question correctly, but a filesystem
is more general than LVM.  You can create directories corresponding to
your current VGs and files for your LVs, with the advantage that you
can nest directories which you can't do with LVM VGs.

However the performance issue will be critical -- even 5% slower
really matters for VMs.  But I hope btrfs can close this gap because
the filesystem design is really nice.

Rich.

-- 
Richard Jones, Virtualization Group, Red Hat http://people.redhat.com/~rjones
Read my programming blog: http://rwmj.wordpress.com
Fedora now supports 80 OCaml packages (the OPEN alternative to F#)
http://cocan.org/getting_started_with_ocaml_on_red_hat_and_fedora
-- 
devel mailing list
devel@lists.fedoraproject.org
https://admin.fedoraproject.org/mailman/listinfo/devel


Re: BTRFS concerns (was: Re: Plan for tomorrow's FESCo meeting (2011-06-01))

2011-06-02 Thread Chris Adams
Once upon a time, Richard W.M. Jones rjo...@redhat.com said:
 Maybe I'm not understanding your question correctly, but a filesystem
 is more general than LVM.  You can create directories corresponding to
 your current VGs and files for your LVs, with the advantage that you
 can nest directories which you can't do with LVM VGs.
 
 However the performance issue will be critical -- even 5% slower
 really matters for VMs.  But I hope btrfs can close this gap because
 the filesystem design is really nice.

That was really my original point (that I didn't really state clearly I
guess); btrfs performance with VM disk images should be compared against
LVM VGs as well against ext4.

-- 
Chris Adams cmad...@hiwaay.net
Systems and Network Administrator - HiWAAY Internet Services
I don't speak for anybody but myself - that's enough trouble.
-- 
devel mailing list
devel@lists.fedoraproject.org
https://admin.fedoraproject.org/mailman/listinfo/devel