Re: MDS placement

2013-01-23 Thread Wido den Hollander
On 01/22/2013 10:19 PM, Gandalf Corvotempesta wrote: Where should MDS server be placed, in the cluster network or in the public network with MONs ? It should be located in the public network since clients need to be able to interact with it. Wido -- To unsubscribe from this list: send the

Re: /etc/init.d/ceph bug for multi-host when using -a option

2013-01-23 Thread Dan Mick
On 01/22/2013 11:18 PM, Chen, Xiaoxi wrote: Hi List, Here is part of /etc/init.d/ceph script: case $command in start) # Increase max_open_files, if the configuration calls for it. get_conf max_open_files 8192 max open files if [

Re: MDS placement

2013-01-23 Thread Wido den Hollander
On 01/23/2013 10:44 AM, Gandalf Corvotempesta wrote: 2013/1/23 Wido den Hollander w...@widodh.nl: It should be located in the public network since clients need to be able to interact with it. Ok. Cluster network is only needed by OSDs ? No other devices should be able to access it ?

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Jens Kristian Søgaard
Hi Sage, I think the problem now is just that 'osd target transaction size' is too big (default is 300). Recommended 50.. let's see how that goes. Even smaller (20 or 25) would probably be fine. I set it to 50, and that seems to have solved all my problems. After a day or so my cluster got

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Wido den Hollander
On 01/23/2013 01:14 PM, Jens Kristian Søgaard wrote: Hi Sage, I think the problem now is just that 'osd target transaction size' is too big (default is 300). Recommended 50.. let's see how that goes. Even smaller (20 or 25) would probably be fine. Going through the code and reading that

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Jens Kristian Søgaard
Hi Wido, Going through the code and reading that this solved it for Jens, could this issue be traced back to less powerful CPUs? Depends on what you mean by less powerful. All my OSD servers are equipped with Xeon E5606 CPUs. That is a quad-core 2.13 Ghz CPU. They are not used for

Re: Will multi-monitor speed up pg initializing?

2013-01-23 Thread Sage Weil
On Wed, 23 Jan 2013, Chen, Xiaoxi wrote: Hi list, When first time I start my ceph cluster,it takes more than 15 minutes to get all the pg activeclean. It's fast at first (say 100pg/s) but quite slow when only hundreds of PG left peering. Is it a common situation? Since

Using a Data Pool

2013-01-23 Thread Paul Sherriffs
Hello All; I have been trying to associate a directory to a data pool (both called 'Media') according to a previous thread on this list. It all works except the last line: ceph osd pool create Media 500 500 ceph mds add_data_pool 3 added data pool 3 to mdsmap mkdir /mnt/ceph/Media cephfs

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Sage Weil
On Wed, 23 Jan 2013, Wido den Hollander wrote: On 01/23/2013 01:14 PM, Jens Kristian S?gaard wrote: Hi Sage, I think the problem now is just that 'osd target transaction size' is too big (default is 300). Recommended 50.. let's see how that goes. Even smaller (20 or 25) would

Re: Understanding Ceph

2013-01-23 Thread Sam Lang
On Sun, Jan 20, 2013 at 10:39 AM, Dimitri Maziuk dmaz...@bmrb.wisc.edu wrote: On 1/19/2013 12:16 PM, Sage Weil wrote: We generally recommend the KVM+librbd route, as it is easier to manage the dependencies, and is well integrated with libvirt. FWIW this is what OpenStack and CloudStack

Re: Understanding Ceph

2013-01-23 Thread Patrick McGarry
Dimitri, For what it's worth I also stepped through the process of spinning up Ceph and OpenStack on a single EC2 node in a recent blog entry: http://ceph.com/howto/building-a-public-ami-with-ceph-and-openstack/ It has some shortcuts (read: not meant to be production) but it may help give you a

Re: [PATCH] net/ceph/osdmap.c: fix undefined behavior when using snprintf()

2013-01-23 Thread Alex Elder
On 01/22/2013 01:20 PM, Cong Ding wrote: The variable str is used as both the source and destination in function snprintf(), which is undefined behavior based on C11. The original description in C11 is: If copying takes place between objects that overlap, the behavior is

Re: [PATCH 01/25] mds: fix end check in Server::handle_client_readdir()

2013-01-23 Thread Sage Weil
Hi Yan, I pushed this one to next, thanks. BTW what are you using to reproduce this? We'd like to continue to improve the coverage of ceph-qa-suite.git/suites/fs. Thanks! sage On Wed, 23 Jan 2013, Yan, Zheng wrote: From: Yan, Zheng zheng.z@intel.com commit 1174dd3188 (don't retry

Re: [PATCH] net/ceph/osdmap.c: fix undefined behavior when using snprintf()

2013-01-23 Thread Cong Ding
On Wed, Jan 23, 2013 at 10:48:07AM -0600, Alex Elder wrote: On 01/22/2013 01:20 PM, Cong Ding wrote: The variable str is used as both the source and destination in function snprintf(), which is undefined behavior based on C11. The original description in C11 is: If copying takes

Re: [PATCH] net/ceph/osdmap.c: fix undefined behavior when using snprintf()

2013-01-23 Thread Alex Elder
On 01/23/2013 11:41 AM, Cong Ding wrote: On Wed, Jan 23, 2013 at 10:48:07AM -0600, Alex Elder wrote: On 01/22/2013 01:20 PM, Cong Ding wrote: The variable str is used as both the source and destination in function snprintf(), which is undefined behavior based on C11. The original description

[PATCH 0/2] Fix some autoconf issues

2013-01-23 Thread Danny Al-Gaaf
These patches contains some autoconf fixes/cleanups. Danny Al-Gaaf (2): configure: fix RPM_RELEASE configure: remove -m4_include(m4/acx_pthread.m4) configure.ac | 10 -- 1 file changed, 8 insertions(+), 2 deletions(-) -- 1.8.1.1 -- To unsubscribe from this list: send the line

[PATCH 1/2] configure: fix RPM_RELEASE

2013-01-23 Thread Danny Al-Gaaf
Use git to get RPM_RELEASE only if this is a git repo clone and if the git command is available on the system. Signed-off-by: Danny Al-Gaaf danny.al-g...@bisect.de --- configure.ac | 9 - 1 file changed, 8 insertions(+), 1 deletion(-) diff --git a/configure.ac b/configure.ac index

Re: radosgw

2013-01-23 Thread Yehuda Sadeh
On Wed, Jan 23, 2013 at 9:56 AM, Gandalf Corvotempesta gandalf.corvotempe...@gmail.com wrote: I'm trying to configure RadosGW but actually, radosgw doesn't start. I've followed this guide: http://ceph.com/docs/master/radosgw/config/ Apache configuration is OK and i've added the following

Re: radosgw

2013-01-23 Thread Yehuda Sadeh
On Wed, Jan 23, 2013 at 10:02 AM, Gandalf Corvotempesta gandalf.corvotempe...@gmail.com wrote: 2013/1/23 Yehuda Sadeh yeh...@inktank.com: try # radosgw -n client radosgw.gateway Still doesn't work. whoops, was missing a period there. # radosgw -n client radosgw.gateway error parsing

Re: Consistently reading/writing rados objects via command line

2013-01-23 Thread Nick Bartos
This seems to be working ok for the most part, but I noticed that using large files gives errors getting them (but not putting them). The problems start after 2GB which, as you said, is larger than should be used in this method. It shouldn't affect us since we shouldn't be using this for files

Re: Installing Rados Gateway from sources

2013-01-23 Thread Yehuda Sadeh
On Wed, Jan 23, 2013 at 11:53 AM, Cesar Mello cme...@gmail.com wrote: Hi, Sorry if this question is too off-topic. I am a Windows guy with no much knowledge about Linux. I have successfully installed a Rados Gateway through apt-get in a virtual machine. Now I would like to build and

Re: Installing Rados Gateway from sources

2013-01-23 Thread Cesar Mello
Thanks so much Yehuda! Best regards! Mello On Wed, Jan 23, 2013 at 6:02 PM, Yehuda Sadeh yeh...@inktank.com wrote: On Wed, Jan 23, 2013 at 11:53 AM, Cesar Mello cme...@gmail.com wrote: Hi, Sorry if this question is too off-topic. I am a Windows guy with no much knowledge about Linux. I

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Jens Kristian Søgaard
Hi Sage, I think the problem now is just that 'osd target transaction size' is I set it to 50, and that seems to have solved all my problems. After a day or so my cluster got to a HEALTH_OK state again. It has been running for a few days now without any crashes! Hmm, one of the OSDs crashed

Re: radosgw

2013-01-23 Thread Cesar Mello
Now that I built from the sources I think I got stuck in this too. The logs: root@l3:/etc/ceph# cat /var/log/ceph/radosgw.log 2013-01-23 19:05:42.233438 7ff3dae2c780 0 ceph version 0.56-395-g371e6fb (371e6fbed624ececb385663a59dad907e9153d6a), process radosgw, pid 3811 2013-01-23 19:05:43.937851

python examples for librados

2013-01-23 Thread Nick Bartos
I just recently found out that ceph has some python bindings (yay!). I see there are a couple of examples for using the rbd bindings here: http://ceph.com/docs/master/rbd/librbdpy/ But that doesn't really include much about the librados bindings. Are there any examples for that? For example I'm

Re: radosgw

2013-01-23 Thread Yehuda Sadeh
On Wed, Jan 23, 2013 at 1:08 PM, Cesar Mello cme...@gmail.com wrote: Now that I built from the sources I think I got stuck in this too. The logs: root@l3:/etc/ceph# cat /var/log/ceph/radosgw.log 2013-01-23 19:05:42.233438 7ff3dae2c780 0 ceph version 0.56-395-g371e6fb

Re: radosgw

2013-01-23 Thread Yehuda Sadeh
On Wed, Jan 23, 2013 at 1:25 PM, Gandalf Corvotempesta gandalf.corvotempe...@gmail.com wrote: 2013/1/23 Gandalf Corvotempesta gandalf.corvotempe...@gmail.com: I've solved. Ceph primary keyring should be also placed on radosgw. On radosgw these two files are needed: /etc/ceph/ceph.conf (the

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Andrey Korolyov
On Thu, Jan 24, 2013 at 12:59 AM, Jens Kristian Søgaard j...@mermaidconsulting.dk wrote: Hi Sage, I think the problem now is just that 'osd target transaction size' is I set it to 50, and that seems to have solved all my problems. After a day or so my cluster got to a HEALTH_OK state again.

Re: Using a Data Pool

2013-01-23 Thread Gregory Farnum
On Wednesday, January 23, 2013 at 5:01 AM, Paul Sherriffs wrote: Hello All; I have been trying to associate a directory to a data pool (both called 'Media') according to a previous thread on this list. It all works except the last line: ceph osd pool create Media 500 500 ceph

Re: some questions about ceph

2013-01-23 Thread Gregory Farnum
On Wednesday, January 23, 2013 at 3:35 PM, Yue Li wrote: Hi, i have some questions about ceph. ceph provide a POSIX client for users. for aio-read/write, it still use page cache on client side (seems to me). How long will the page cache expire (in case the data on server side has

Re: Understanding Ceph

2013-01-23 Thread Dimitri Maziuk
On 01/23/2013 10:19 AM, Patrick McGarry wrote: http://ceph.com/howto/building-a-public-ami-with-ceph-and-openstack/ On Wed, Jan 23, 2013 at 10:13 AM, Sam Lang sam.l...@inktank.com wrote: http://ceph.com/docs/master/rbd/rbd-openstack/ These are both great, I'm sure, but Patrick's page says I

Re: Understanding Ceph

2013-01-23 Thread John Nielsen
On Jan 23, 2013, at 5:10 PM, Dimitri Maziuk dmaz...@bmrb.wisc.edu wrote: On 01/23/2013 10:19 AM, Patrick McGarry wrote: http://ceph.com/howto/building-a-public-ami-with-ceph-and-openstack/ On Wed, Jan 23, 2013 at 10:13 AM, Sam Lang sam.l...@inktank.com wrote:

Re: [PATCH 2/2] configure: remove -m4_include(m4/acx_pthread.m4)

2013-01-23 Thread Gary Lowell
Hi Danny - These two patches are now in the wip-rpm-update-2 branch. Will merge into master after build test. Thanks, Gary On Jan 23, 2013, at 9:57 AM, Danny Al-Gaaf wrote: Since we use already AC_CONFIG_MACRO_DIR, no need to include m4/acx_pthread.m4 extra. Signed-off-by: Danny Al-Gaaf

radosgw: MethodNotAllowed response for AWS C# Sample

2013-01-23 Thread Cesar Mello
Hi, When running the PutObject sample from http://ceph.com/docs/master/radosgw/s3/csharp/ I get a MethodNotAllowed response. Please has anyone successfully run this sample? I have tested with a current local build (0.56). Thank you a lot for the attention! Best regards Mello -- To unsubscribe

Re: radosgw: MethodNotAllowed response for AWS C# Sample

2013-01-23 Thread Yehuda Sadeh
On Wed, Jan 23, 2013 at 4:46 PM, Cesar Mello cme...@gmail.com wrote: Hi, When running the PutObject sample from http://ceph.com/docs/master/radosgw/s3/csharp/ I get a MethodNotAllowed response. Please has anyone successfully run this sample? I have tested with a current local build (0.56).

Re: radosgw: MethodNotAllowed response for AWS C# Sample

2013-01-23 Thread Marcus Sorensen
Re-sending this due to plain text limitations: Are you just testing a build following the 5 min guide? I did this today, and the method not allowed just meant that apache wasn't acutally set up to call rados gateway via rewrite. I verified this by placing an index.html in /var/www and seeing that

Re: radosgw: MethodNotAllowed response for AWS C# Sample

2013-01-23 Thread Cesar Mello
Yes the radosgw log shows the request. The handler-get_op call is returning null at rgw/rgw_main.cc (RGWProcess::handle_request). I've confirmed putting this log: if (!op) { req-log(s, get_op failed!); abort_early(s, -ERR_METHOD_NOT_ALLOWED); goto done; } I'm pasting a copy of

Re: [PATCH 01/25] mds: fix end check in Server::handle_client_readdir()

2013-01-23 Thread Yan, Zheng
On 01/24/2013 01:17 AM, Sage Weil wrote: Hi Yan, I pushed this one to next, thanks. BTW what are you using to reproduce this? We'd like to continue to improve the coverage of ceph-qa-suite.git/suites/fs. My scripts delete the test directory after finishing a round of fsstress

Re: radosgw: MethodNotAllowed response for AWS C# Sample

2013-01-23 Thread Yehuda Sadeh
On Wed, Jan 23, 2013 at 6:13 PM, Cesar Mello cme...@gmail.com wrote: Yes the radosgw log shows the request. The handler-get_op call is returning null at rgw/rgw_main.cc (RGWProcess::handle_request). I've confirmed putting this log: if (!op) { req-log(s, get_op failed!);

Re: radosgw: MethodNotAllowed response for AWS C# Sample

2013-01-23 Thread Cesar Mello
Oh man now it works perfectly! Thank you so much!!! Just added the line 'rgw dns name=l3' to the [client.radosgw.gateway] section of ceph.conf. Best regards Mello On Thu, Jan 24, 2013 at 12:26 AM, Yehuda Sadeh yeh...@inktank.com wrote: On Wed, Jan 23, 2013 at 6:13 PM, Cesar Mello

Re: Understanding Ceph

2013-01-23 Thread Dimitri Maziuk
On 01/23/2013 06:17 PM, John Nielsen wrote: ... http://ceph.com/docs/master/install/rpm/ http://ceph.com/docs/master/start/quick-start/ Between those two links my own quick-start on CentOS 6.3 was maybe 6 minutes. YMMV. It does, obviously, since Deploy the configuration ... 2. Execute the

Re: Consistently reading/writing rados objects via command line

2013-01-23 Thread Sage Weil
Hi Nick- The problem here looks to just be that do_get() in rados.cc isn't making any attempt to read large objects in chunks. I'm not sure where the 2GB limit is, but it well beyond non-optimal before it gets to that point. That function needs to read in chunks of a few MB and keep going

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Sage Weil
On Wed, 23 Jan 2013, Jens Kristian S?gaard wrote: Hi Sage, I think the problem now is just that 'osd target transaction size' is I set it to 50, and that seems to have solved all my problems. After a day or so my cluster got to a HEALTH_OK state again. It has been running for a few

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Sage Weil
On Thu, 24 Jan 2013, Andrey Korolyov wrote: On Thu, Jan 24, 2013 at 12:59 AM, Jens Kristian S?gaard j...@mermaidconsulting.dk wrote: Hi Sage, I think the problem now is just that 'osd target transaction size' is I set it to 50, and that seems to have solved all my problems. After a

Re: Consistently reading/writing rados objects via command line

2013-01-23 Thread Sage Weil
Try wip-rados-get On Wed, 23 Jan 2013, Sage Weil wrote: Hi Nick- The problem here looks to just be that do_get() in rados.cc isn't making any attempt to read large objects in chunks. I'm not sure where the 2GB limit is, but it well beyond non-optimal before it gets to that point.

Re: Hit suicide timeout after adding new osd

2013-01-23 Thread Andrey Korolyov
On Thu, Jan 24, 2013 at 8:39 AM, Sage Weil s...@inktank.com wrote: On Thu, 24 Jan 2013, Andrey Korolyov wrote: On Thu, Jan 24, 2013 at 12:59 AM, Jens Kristian S?gaard j...@mermaidconsulting.dk wrote: Hi Sage, I think the problem now is just that 'osd target transaction size' is I set