I've built and installed RPMs for ceph for RHEL6beta.
I've placed the below ceph.conf in /etc/ceph on both of my test nodes
(test10, test11).
I build a ceph filesystem and hide the key
mkcephfs -c /etc/ceph/ceph.conf -a --mkbtrfs -k /etc/ceph/keyring.bin
cauthtool --print-key /etc/ceph/keyring.bin > /etc/ceph/secret
chmod 600 /etc/ceph/secret
scp -p /etc/ceph/secret test11:/etc/ceph
Then I start the daemons on each node:
service ceph start
My daemons start up on both nodes with 'service ceph -a start'
root 3199 1 0 17:18 ? 00:00:00 /usr/local/bin/cmon -i
0 -c /tmp/ceph.conf.5365
root 3228 1 0 17:18 ? 00:00:00 /usr/local/bin/cmds -i
test10 -c /tmp/ceph.conf.5365
root 3285 1 0 17:18 ? 00:00:00 /usr/local/bin/cosd -i
0 -c /tmp/ceph.conf.5365
(similar output on other node)
I attempt to mount the ceph filesystem on test10 (using test11's IP):
mount -t ceph -o name=admin,secretfile=/etc/ceph/keyring.bin
10.200.98.111:/ /mnt/ceph
mount error 5 = Input/output error
/var/log/messages seems to show me what the problem is:
Dec 7 17:45:19 test10 kernel: libceph: mon0 10.200.98.111:6789
connection failed
(a few more of those before mount fails)
on test11, the daemon is up and listening on that port:
tcp 0 0 10.200.98.111:6789 10.200.98.111:56805
ESTABLISHED 7781/cmon
And here's /var/log/ceph/mon.1.log on test11 (the .111 node)
2010-12-07 17:36:20.136921 --- 7780 opened log /var/log/ceph/mon.1.log ---
ceph version 0.24~rc (commit:378d13df9505e4ea9a32f42cb713cdcf7aaccda0)
2010-12-07 17:36:20.137164 7f80c51e3720 store(/data/mon1) mount
2010-12-07 17:36:20.138241 7f80c51e3720 mo...@1(starting) e1 init fsid
1b4cabdb-30d2-752d-005f-517a7fa982f8
2010-12-07 17:36:20.165407 7f80c51e3720 log [INF] : mon.1 calling new
monitor election
2010-12-07 17:36:20.192343 7f80c51e1710 -- 10.200.98.111:6789/0 >>
10.200.98.110:6789/0 pipe(0x1cafd20 sd=6 pgs=0 cs=0 l=0).fault first
fault
And for test10 (the .110 node)
2010-12-07 17:36:49.183357 --- 5767 opened log /var/log/ceph/mon.0.log ---
ceph version 0.24~rc (commit:378d13df9505e4ea9a32f42cb713cdcf7aaccda0)
2010-12-07 17:36:49.183545 7ff24669c720 store(/data/mon0) mount
2010-12-07 17:36:49.184556 7ff24669c720 mo...@0(starting) e1 init fsid
1b4cabdb-30d2-752d-005f-517a7fa982f8
2010-12-07 17:36:49.600650 7ff24669c720 log [INF] : mon.0 calling new
monitor election
2010-12-07 17:36:49.645875 7ff24669a710 -- 10.200.98.110:6789/0 >>
10.200.98.111:6789/0 pipe(0xac7d20 sd=6 pgs=0 cs=0 l=0).fault first
fault
Does this mean my cmon on 111 is getting into a state where it's not
receiving incoming connections?
Any suggestions on where to go from here?
thanks,
Brian Chrisman
----- ceph.conf in /etc/ceph -----
; From sample:
[global]
auth supported = cephx
[mon]
mon data = /data/mon$id
; logging, for debugging monitor crashes, in order of
; their likelihood of being helpful :)
;debug ms = 1
;debug mon = 20
;debug paxos = 20
;debug auth = 20
[mon0]
host = test10
mon addr = 10.200.98.110:6789
[mon1]
host = test11
mon addr = 10.200.98.111:6789
[mds]
keyring = /data/keyring.$name
;debug ms = 1
;debug mds = 20
[mds.test10]
host = test10
[mds.test11]
host = test11
[osd]
osd data = /data/osd$id
osd journal = /data/osd$id/journal
osd journal size = 1000 ; journal size, in megabytes
;debug ms = 1
;debug osd = 20
;debug filestore = 20
;debug journal = 20
[osd0]
host = test10
btrfs devs = /dev/sdd4
[osd1]
host = test11
btrfs devs = /dev/sdd4
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to [email protected]
More majordomo info at http://vger.kernel.org/majordomo-info.html