Re: [Gluster-users] Slow write times to gluster disk

Ravishankar N Thu, 13 Apr 2017 21:58:22 -0700

I'm not sure if the version you are running (glusterfs 3.7.11 ) workswith NFS-Ganesha as the link seems to suggest version >=3.8 as aper-requisite. Adding Soumya for help. If it is not supported, then youmight have to go the plain glusterNFS way.

Regards,
Ravi

On 04/14/2017 03:48 AM, Pat Haley wrote:

Hi Ravi (and list),
We are planning on testing the NFS route to see what kind of speed-upwe get. A little research led us to the following:
https://gluster.readthedocs.io/en/latest/Administrator%20Guide/NFS-Ganesha%20GlusterFS%20Integration/
Is this correct path to take to mount 2 xfs volumes as a singlegluster file system volume? If not, what would be a better path?
Pat



On 04/11/2017 12:21 AM, Ravishankar N wrote:
On 04/11/2017 12:42 AM, Pat Haley wrote:
Hi Ravi,
Thanks for the reply. And yes, we are using the gluster native(fuse) mount. Since this is not my area of expertise I have a fewquestions (mostly clarifications)
Is a factor of 20 slow-down typical when compare a fuse-mountedfilesytem versus an NFS-mounted filesystem or should we also belooking for additional issues? (Note the first dd test describedbelow was run on the server that hosts the file-systems so nonetwork communication was involved).
Though both the gluster bricks and the mounts are on the samephysical machine in your setup, the I/O still passes throughdifferent layers of kernel/user-space fuse stack although I don'tknow if 20x slow down on gluster vs NFS share is normal. Why don'tyou try doing a gluster NFS mount on the machine and try the dd testand compare it with the gluster fuse mount results?
You also mention tweaking " write-behind xlator settings". Would youexpect better speed improvements from switching the mounting fromfuse to gnfs or from tweaking the settings? Also are these mutuallyexclusive or would the be additional benefits from both switching togfns and tweaking?
You should test these out and find the answers yourself. :-)
My next question is to make sure I'm clear on the comment " if thegluster node containing the gnfs server goes down, all mounts doneusing that node will fail". If you have 2 servers, each 1 brick inthe over-all gluster FS, and one server fails, then for gnfs nothingon either server is visible to other nodes while under fuse only thefiles on the dead server are not visible. Is this what you meant?
Yes, for gnfs mounts, all I/O from various mounts go to the gnfsserver process (on the machine whose IP was used at the time ofmounting) which then sends the I/O to the brick processes. For fuse,the gluster fuse mount itself talks directly to the bricks.
Finally, you mention "even for gnfs mounts, you can achievefail-over by using CTDB". Do you know if CTDB would have anyperformance impact (i.e. in a worst cast scenario could adding CTDBto gnfs erase the speed benefits of going to gnfs in the first place)?
I don't think it would. You can even achieve load balancing via CTDBto use different gnfs servers for different clients. But I don't knowif this is needed/ helpful in your current setup where everything(bricks and clients) seem to be on just one server.
-Ravi
Thanks

Pat


On 04/08/2017 12:58 AM, Ravishankar N wrote:
Hi Pat,
I'm assuming you are using gluster native (fuse mount). If ithelps, you could try mounting it via gluster NFS (gnfs) and thensee if there is an improvement in speed. Fuse mounts are slowerthan gnfs mounts but you get the benefit of avoiding a single pointof failure. Unlike fuse mounts, if the gluster node containing thegnfs server goes down, all mounts done using that node will fail).For fuse mounts, you could try tweaking the write-behind xlatorsettings to see if it helps. See the performance.write-behind andperformance.write-behind-window-size options in `gluster volume sethelp`. Of course, even for gnfs mounts, you can achieve fail-overby using CTDB.
Thanks,
Ravi

On 04/08/2017 12:07 AM, Pat Haley wrote:
Hi,
We noticed a dramatic slowness when writing to a gluster disk whencompared to writing to an NFS disk. Specifically when using dd(data duplicator) to write a 4.3 GB file of zeros:
  * on NFS disk (/home): 9.5 Gb/s
  * on gluster disk (/gdata): 508 Mb/s
The gluser disk is 2 bricks joined together, no replication oranything else. The hardware is (literally) the same:
  * one server with 70 hard disks  and a hardware RAID card.
  * 4 disks in a RAID-6 group (the NFS disk)
  * 32 disks in a RAID-6 group (the max allowed by the card,
    /mnt/brick1)
  * 32 disks in another RAID-6 group (/mnt/brick2)
  * 2 hot spare
Some additional information and more tests results (after changingthe log level):
glusterfs 3.7.11 built on Apr 27 2016 14:09:22
CentOS release 6.8 (Final)
RAID bus controller: LSI Logic / Symbios Logic MegaRAID SAS-3 3108[Invader] (rev 02)
*Create the file to /gdata (gluster)*
[root@mseas-data2 gdata]# dd if=/dev/zero of=/gdata/zero1 bs=1Mcount=1000
1000+0 records in
1000+0 records out
1048576000 bytes (1.0 GB) copied, 1.91876 s, *546 MB/s*

*Create the file to /home (ext4)*
[root@mseas-data2 gdata]# dd if=/dev/zero of=/home/zero1 bs=1Mcount=1000
1000+0 records in
1000+0 records out
1048576000 bytes (1.0 GB) copied, 0.686021 s, *1.5 GB/s - *3 timesas fast*
Copy from /gdata to /gdata (gluster to gluster)
*[root@mseas-data2 gdata]# dd if=/gdata/zero1 of=/gdata/zero2
2048000+0 records in
2048000+0 records out
1048576000 bytes (1.0 GB) copied, 101.052 s, *10.4 MB/s* -realllyyy slooowww
*Copy from /gdata to /gdata* *2nd time *(gluster to gluster)**
[root@mseas-data2 gdata]# dd if=/gdata/zero1 of=/gdata/zero2
2048000+0 records in
2048000+0 records out
1048576000 bytes (1.0 GB) copied, 92.4904 s, *11.3 MB/s* -realllyyy slooowww again
*Copy from /home to /home (ext4 to ext4)*
[root@mseas-data2 gdata]# dd if=/home/zero1 of=/home/zero2
2048000+0 records in
2048000+0 records out
1048576000 bytes (1.0 GB) copied, 3.53263 s, *297 MB/s *30 timesas fast
*Copy from /home to /home (ext4 to ext4)*
[root@mseas-data2 gdata]# dd if=/home/zero1 of=/home/zero3
2048000+0 records in
2048000+0 records out
1048576000 bytes (1.0 GB) copied, 4.1737 s, *251 MB/s* - 30 timesas fast
As a test, can we copy data directly to the xfs mountpoint(/mnt/brick1) and bypass gluster?
Any help you could give us would be appreciated.

Thanks

--

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Pat Haley                          Email:[email protected]
Center for Ocean Engineering       Phone:  (617) 253-6824
Dept. of Mechanical Engineering    Fax:    (617) 253-8125
MIT, Room 5-213http://web.mit.edu/phaley/www/
77 Massachusetts Avenue
Cambridge, MA  02139-4301


_______________________________________________
Gluster-users mailing list
[email protected]
http://lists.gluster.org/mailman/listinfo/gluster-users
--

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Pat Haley                          Email:[email protected]
Center for Ocean Engineering       Phone:  (617) 253-6824
Dept. of Mechanical Engineering    Fax:    (617) 253-8125
MIT, Room 5-213http://web.mit.edu/phaley/www/
77 Massachusetts Avenue
Cambridge, MA  02139-4301
--

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Pat Haley                          Email:[email protected]
Center for Ocean Engineering       Phone:  (617) 253-6824
Dept. of Mechanical Engineering    Fax:    (617) 253-8125
MIT, Room 5-213http://web.mit.edu/phaley/www/
77 Massachusetts Avenue
Cambridge, MA  02139-4301

_______________________________________________
Gluster-users mailing list
[email protected]
http://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] Slow write times to gluster disk

Reply via email to