subject:"\[openstack\-dev\] \[Ceph\] Why performance of benchmarks with small blocks is extremely small\?"

Re: [openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

2014-09-30 Thread Gregory Farnum

On Sat, Sep 27, 2014 at 8:14 AM, Timur Nurlygayanov
 wrote:
> Hello all,
>
> I installed OpenStack with Glance + Ceph OSD with replication factor 2 and
> now I can see the write operations are extremly slow.
> For example, I can see only 0.04 MB/s write speed when I run rados bench
> with 512b blocks:
>
> rados bench -p test 60 write --no-cleanup -t 1 -b 512
>
>  Maintaining 1 concurrent writes of 512 bytes for up to 60 seconds or 0
> objects
>  Object prefix: benchmark_data_node-17.domain.tld_15862
>sec Cur ops   started  finishedavg MB/s cur MB/s   last lat
> avg lat
>  0   0 0 0  00
> -   0
>  1   183820.0400341   0.0400391
> 0.008465   0.0120985
>  2   1   169   168  0.04101110.0419922
> 0.080433   0.0118995
>  3   1   240   239  0.03889590.034668
> 0.008052   0.0125385
>  4   1   356   355  0.0433309   0.0566406
> 0.00837 0.0112662
>  5   1   472   471  0.0459919   0.0566406
> 0.008343   0.0106034
>  6   1   550   549  0.0446735   0.0380859
> 0.036639   0.0108791
>  7   1   581   580  0.0404538   0.0151367
> 0.008614   0.0120654
>
>
> My test environment configuration:
> Hardware servers with 1Gb network interfaces, 64Gb RAM and 16 CPU cores per
> node, HDDs WDC WD5003ABYX-01WERA0.
> OpenStack with 1 controller, 1 compute and 2 ceph nodes (ceph on separate
> nodes).
> CentOS 6.5, kernel 2.6.32-431.el6.x86_64.
>
> I tested several config options for optimizations, like in
> /etc/ceph/ceph.conf:
>
> [default]
> ...
> osd_pool_default_pg_num = 1024
> osd_pool_default_pgp_num = 1024
> osd_pool_default_flag_hashpspool = true
> ...
> [osd]
> osd recovery max active = 1
> osd max backfills = 1
> filestore max sync interval = 30
> filestore min sync interval = 29
> filestore flusher = false
> filestore queue max ops = 1
> filestore op threads = 16
> osd op threads = 16
> ...
> [client]
> rbd_cache = true
> rbd_cache_writethrough_until_flush = true
>
> and in /etc/cinder/cinder.conf:
>
> [DEFAULT]
> volume_tmp_dir=/tmp
>
> but in the result performance was increased only on ~30 % and it not looks
> like huge success.
>
> Non-default mount options and TCP optimization increase the speed in about
> 1%:
>
> [root@node-17 ~]# mount | grep ceph
> /dev/sda4 on /var/lib/ceph/osd/ceph-0 type xfs
> (rw,noexec,nodev,noatime,nodiratime,user_xattr,data=writeback,barrier=0)
>
> [root@node-17 ~]# cat /etc/sysctl.conf
> net.core.rmem_max = 16777216
> net.core.wmem_max = 16777216
> net.ipv4.tcp_rmem = 4096 87380 16777216
> net.ipv4.tcp_wmem = 4096 65536 16777216
> net.ipv4.tcp_window_scaling = 1
> net.ipv4.tcp_timestamps = 1
> net.ipv4.tcp_sack = 1
>
>
> Do we have other ways to significantly improve CEPH storage performance?
> Any feedback and comments are welcome!

This is entirely latency dominated and OpenStack configuration changes
aren't going to be able to do much — you're getting 80 sequential ops
a second out of a system that has to do two round trips over a network
and hit two hard drives on every operation. You might want to spend
some time looking at how latency, bandwidth, and concurrency are
(often not) related. :)
-Greg
Software Engineer #42 @ http://inktank.com | http://ceph.com

___
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

2014-09-29 Thread Clay Gerrard

I also have limited experience with Ceph and rados bench - but it looks
like you're setting the number of "worker threads" to only 1?  (-t 1)

I think the default is 16, and most storage distributed storage systems
designed for concurrency are going to do a bit better if you exercise more
concurrent workers... so you might try turning that up until you see some
diminishing returns.  Be sure to watch for resource contention on the load
generating server.

-Clay

On Mon, Sep 29, 2014 at 4:49 AM, Pasquale Porreca <
pasquale.porr...@dektech.com.au> wrote:

>  Hello
>
> I have no experience with Ceph and this specific benchmark tool, anyway I
> have experience with several other performance benchmark tools and file
> systems and I can say it always happen to have very very low performance
> results when the file size is too small (i.e. < 1MB).
>
> My suspect is that benchmark tools are not reliable for file size so
> small, since the time to write is so small that the overhead introduced by
> the test itself is not at all negligible.
>
> I saw that the default object size for rados is 4 MB, did you try your
> test without the option "-b 512"? I think the results should be different
> for several order of magnitude.
>
> BR
>
>
> On 09/27/14 17:14, Timur Nurlygayanov wrote:
>
>   Hello all,
>
>  I installed OpenStack with Glance + Ceph OSD with replication factor 2
> and now I can see the write operations are extremly slow.
> For example, I can see only 0.04 MB/s write speed when I run rados bench
> with 512b blocks:
>
>  rados bench -p test 60 write --no-cleanup -t 1 -b 512
>
>  Maintaining 1 concurrent writes of 512 bytes for up to 60 seconds or 0
> objects
>  Object prefix: benchmark_data_node-17.domain.tld_15862
>sec Cur ops   started  finishedavg MB/s cur MB/s   last
> lat  avg lat
>  0   0 0 0  0
> 0   -   0
>  1   183820.0400341   0.0400391
> 0.008465   0.0120985
>  2   1   169   168  0.04101110.0419922
> 0.080433   0.0118995
>  3   1   240   239  0.03889590.034668
> 0.008052   0.0125385
>  4   1   356   355  0.0433309   0.0566406
> 0.00837 0.0112662
>  5   1   472   471  0.0459919   0.0566406
> 0.008343   0.0106034
>  6   1   550   549  0.0446735   0.0380859
> 0.036639   0.0108791
>  7   1   581   580  0.0404538   0.0151367
> 0.008614   0.0120654
>
>
> My test environment configuration:
>  Hardware servers with 1Gb network interfaces, 64Gb RAM and 16 CPU cores
> per node, HDDs WDC WD5003ABYX-01WERA0.
>  OpenStack with 1 controller, 1 compute and 2 ceph nodes (ceph on separate
> nodes).
> CentOS 6.5, kernel 2.6.32-431.el6.x86_64.
>
>  I tested several config options for optimizations, like in
> /etc/ceph/ceph.conf:
>
>  [default]
> ...
> osd_pool_default_pg_num = 1024
> osd_pool_default_pgp_num = 1024
> osd_pool_default_flag_hashpspool = true
> ...
> [osd]
> osd recovery max active = 1
> osd max backfills = 1
> filestore max sync interval = 30
> filestore min sync interval = 29
> filestore flusher = false
> filestore queue max ops = 1
> filestore op threads = 16
> osd op threads = 16
> ...
> [client]
> rbd_cache = true
> rbd_cache_writethrough_until_flush = true
>
>  and in /etc/cinder/cinder.conf:
>
>  [DEFAULT]
>  volume_tmp_dir=/tmp
>
> but in the result performance was increased only on ~30 % and it not looks
> like huge success.
>
>  Non-default mount options and TCP optimization increase the speed in
> about 1%:
>
> [root@node-17 ~]# mount | grep ceph
> /dev/sda4 on /var/lib/ceph/osd/ceph-0 type xfs
> (rw,noexec,nodev,noatime,nodiratime,user_xattr,data=writeback,barrier=0)
>
> [root@node-17 ~]# cat /etc/sysctl.conf
> net.core.rmem_max = 16777216
> net.core.wmem_max = 16777216
> net.ipv4.tcp_rmem = 4096 87380 16777216
> net.ipv4.tcp_wmem = 4096 65536 16777216
> net.ipv4.tcp_window_scaling = 1
> net.ipv4.tcp_timestamps = 1
> net.ipv4.tcp_sack = 1
>
>
> Do we have other ways to significantly improve CEPH storage performance?
>  Any feedback and comments are welcome!
>
>  Thank you!
>
>
>  --
>
>  Timur,
> QA Engineer
> OpenStack Projects
> Mirantis Inc
>
>
> ___
> OpenStack-dev mailing 
> listOpenStack-dev@lists.openstack.orghttp://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>
>
> --
> Pasquale Porreca
>
> DEK Technologies
> Via dei Castelli Romani, 22
> 00040 Pomezia (Roma)
>
> Mobile +39 3394823805
> Skype paskporr
>
>
> ___
> OpenStack-dev mailing list
> OpenStack-dev@lists.openstack.org
> http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
>
>
___
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.o

Re: [openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

2014-09-29 Thread Pasquale Porreca


Hello

I have no experience with Ceph and this specific benchmark tool, anyway 
I have experience with several other performance benchmark tools and 
file systems and I can say it always happen to have very very low 
performance results when the file size is too small (i.e. < 1MB).


My suspect is that benchmark tools are not reliable for file size so 
small, since the time to write is so small that the overhead introduced 
by the test itself is not at all negligible.


I saw that the default object size for rados is 4 MB, did you try your 
test without the option "-b 512"? I think the results should be 
different for several order of magnitude.


BR

On 09/27/14 17:14, Timur Nurlygayanov wrote:

Hello all,

I installed OpenStack with Glance + Ceph OSD with replication factor 2 
and now I can see the write operations are extremly slow.
For example, I can see only 0.04 MB/s write speed when I run rados 
bench with 512b blocks:


rados bench -p test 60 write --no-cleanup -t 1 -b 512

 Maintaining 1 concurrent writes of 512 bytes for up to 60 seconds or 
0 objects

 Object prefix: benchmark_data_node-17.domain.tld_15862
   sec Cur ops   started  finishedavg MB/s cur MB/s   last 
lat  avg lat
 0   0 0 0 0
0   -   0
 1   183820.0400341 0.0400391  
0.008465   0.0120985
 2   1   169   168  0.0410111 0.0419922  
0.080433   0.0118995
 3   1   240   239  0.0388959 0.034668   
0.008052   0.0125385
 4   1   356   355  0.0433309 0.0566406  
0.00837 0.0112662
 5   1   472   471  0.0459919 0.0566406  
0.008343   0.0106034
 6   1   550   549  0.0446735 0.0380859  
0.036639   0.0108791
 7   1   581   580  0.0404538 0.0151367  
0.008614   0.0120654



My test environment configuration:
Hardware servers with 1Gb network interfaces, 64Gb RAM and 16 CPU 
cores per node, HDDs WDC WD5003ABYX-01WERA0.
OpenStack with 1 controller, 1 compute and 2 ceph nodes (ceph on 
separate nodes).

CentOS 6.5, kernel 2.6.32-431.el6.x86_64.

I tested several config options for optimizations, like in 
/etc/ceph/ceph.conf:


[default]
...
osd_pool_default_pg_num = 1024
osd_pool_default_pgp_num = 1024
osd_pool_default_flag_hashpspool = true
...
[osd]
osd recovery max active = 1
osd max backfills = 1
filestore max sync interval = 30
filestore min sync interval = 29
filestore flusher = false
filestore queue max ops = 1
filestore op threads = 16
osd op threads = 16
...
[client]
rbd_cache = true
rbd_cache_writethrough_until_flush = true

and in /etc/cinder/cinder.conf:

[DEFAULT]
volume_tmp_dir=/tmp

but in the result performance was increased only on ~30 % and it not 
looks like huge success.


Non-default mount options and TCP optimization increase the speed in 
about 1%:


[root@node-17 ~]# mount | grep ceph
/dev/sda4 on /var/lib/ceph/osd/ceph-0 type xfs 
(rw,noexec,nodev,noatime,nodiratime,user_xattr,data=writeback,barrier=0)


[root@node-17 ~]# cat /etc/sysctl.conf
net.core.rmem_max = 16777216
net.core.wmem_max = 16777216
net.ipv4.tcp_rmem = 4096 87380 16777216
net.ipv4.tcp_wmem = 4096 65536 16777216
net.ipv4.tcp_window_scaling = 1
net.ipv4.tcp_timestamps = 1
net.ipv4.tcp_sack = 1


Do we have other ways to significantly improve CEPH storage performance?
Any feedback and comments are welcome!

Thank you!


--

Timur,
QA Engineer
OpenStack Projects
Mirantis Inc


___
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


--
Pasquale Porreca

DEK Technologies
Via dei Castelli Romani, 22
00040 Pomezia (Roma)

Mobile +39 3394823805
Skype paskporr

___
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

[openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

2014-09-27 Thread Timur Nurlygayanov

Hello all,

I installed OpenStack with Glance + Ceph OSD with replication factor 2 and
now I can see the write operations are extremly slow.
For example, I can see only 0.04 MB/s write speed when I run rados bench
with 512b blocks:

rados bench -p test 60 write --no-cleanup -t 1 -b 512

 Maintaining 1 concurrent writes of 512 bytes for up to 60 seconds or 0
objects
 Object prefix: benchmark_data_node-17.domain.tld_15862
   sec Cur ops   started  finishedavg MB/s cur MB/s   last
lat  avg lat
 0   0 0 0  0
0   -   0
 1   183820.0400341   0.0400391
0.008465   0.0120985
 2   1   169   168  0.04101110.0419922
0.080433   0.0118995
 3   1   240   239  0.03889590.034668
0.008052   0.0125385
 4   1   356   355  0.0433309   0.0566406
0.00837 0.0112662
 5   1   472   471  0.0459919   0.0566406
0.008343   0.0106034
 6   1   550   549  0.0446735   0.0380859
0.036639   0.0108791
 7   1   581   580  0.0404538   0.0151367
0.008614   0.0120654


My test environment configuration:
Hardware servers with 1Gb network interfaces, 64Gb RAM and 16 CPU cores per
node, HDDs WDC WD5003ABYX-01WERA0.
OpenStack with 1 controller, 1 compute and 2 ceph nodes (ceph on separate
nodes).
CentOS 6.5, kernel 2.6.32-431.el6.x86_64.

I tested several config options for optimizations, like in
/etc/ceph/ceph.conf:

[default]
...
osd_pool_default_pg_num = 1024
osd_pool_default_pgp_num = 1024
osd_pool_default_flag_hashpspool = true
...
[osd]
osd recovery max active = 1
osd max backfills = 1
filestore max sync interval = 30
filestore min sync interval = 29
filestore flusher = false
filestore queue max ops = 1
filestore op threads = 16
osd op threads = 16
...
[client]
rbd_cache = true
rbd_cache_writethrough_until_flush = true

and in /etc/cinder/cinder.conf:

[DEFAULT]
volume_tmp_dir=/tmp

but in the result performance was increased only on ~30 % and it not looks
like huge success.

Non-default mount options and TCP optimization increase the speed in about
1%:

[root@node-17 ~]# mount | grep ceph
/dev/sda4 on /var/lib/ceph/osd/ceph-0 type xfs
(rw,noexec,nodev,noatime,nodiratime,user_xattr,data=writeback,barrier=0)

[root@node-17 ~]# cat /etc/sysctl.conf
net.core.rmem_max = 16777216
net.core.wmem_max = 16777216
net.ipv4.tcp_rmem = 4096 87380 16777216
net.ipv4.tcp_wmem = 4096 65536 16777216
net.ipv4.tcp_window_scaling = 1
net.ipv4.tcp_timestamps = 1
net.ipv4.tcp_sack = 1


Do we have other ways to significantly improve CEPH storage performance?
Any feedback and comments are welcome!

Thank you!


-- 

Timur,
QA Engineer
OpenStack Projects
Mirantis Inc
___
OpenStack-dev mailing list
OpenStack-dev@lists.openstack.org
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

Re: [openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

Re: [openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

[openstack-dev] [Ceph] Why performance of benchmarks with small blocks is extremely small?

4 matches

Site Navigation

Mail list logo

Footer information