Thanks Tim,

You are correct the /mnt/lustre is the lustre share, and /share/mnt is an NFS share where the CTDB_RECOVERY_LOCK file is stored for the 3 ctdb services running on 3 clients.

I think that the problem could have something to do with the --with-privatedir
I wasn't sure what 'private dir' should point to in a cluster environment.

My ctdb tdb's are located at /usr/local/var/ctdb/
My Samba secret.tdb is located at /etc/samba/ so should the --with-privatedir point to /etc/samba ?
And am I correct in using the following when configuring samba --with-lockdir=/usr/local/var/ctdb ?

Regards

Darren George
Linux Server Administrator
CONFIDENTIALITY: This e-mail and any attachment is confidential. If you are not the intended recipient or you receive this e-mail in error, please notify the sender immediately and do not disclose the contents to another person or take or print copies or use it for any other purpose, but delete it immediately.
VIRUSES: We offer no guarantees that this e-mail or any attachment will be received error or virus free and accept no responsibility whatever in this regard.
SECURITY: This transmission is not guaranteed as secure. We do not accept liability for any errors or omissions in the contents of this e-mail which results from this transmission.
Registered in England under number 3579805. Registered address as above
Tim Wickberg wrote:
It's a little hard to tell, but I'm assuming that /mnt/lustre is the lustre mount, and /share/mnt is either kept to each machine (but the name seems to imply that it isn't) or is shared storage that all three machines are seeing (but that isn't the lustre pool?).

Either way, I'm guessing that either:

- Your --private-dir isn't very private (and is on shared storage at /share/mnt/ )
- Or, your CTDB_RECOVERY_LOCK is too private.

The --with-privatedir directory needs to be local to the machine - if all three ctdb's are trying to maintain their local state on a shared FS things would probably not behave normally.

CTDB_RECOVERY_LOCK need to be on shared storage (preferably on the lustre pool) to have the three ctdbd's communicate. (Is ctdbd running on each? Any debug messages from it?)

You may also want to watch out for

https://bugzilla.lustre.org/show_bug.cgi?id=5135

Hope this helps,
- Tim

Darren George wrote:
    Hi,

I'm having an issue with file locking whist using lustre+ctdb+samba

My Lustre Setup
mds/mgs
3 oss
3 Clients each with ctdb and samba services running.

All servers and clients running Centos 5 - Kernal 2.6.19-92
All servers and clients with Lustre 1.6.6
ctdb 3.0
Samba 3.2.1

CTDB Configuration:

./configure --bindir=/usr/bin --sbindir=/usr/sbin --libdir=/usr/lib/ctdb

/etc/sysconfig/ctdb

CTDB_RECOVERY_LOCK="/share/mnt/recovery"
CTDB_PUBLIC_ADDRESSES=/etc/ctdb/public_addresses
CTDB_MANAGES_SAMBA=yes
CTDB_LOGFILE=/var/log/log.ctdb

/etc/ctdb/public_addresses

192.168.3.200/24 eth0

/etc/ctdb/nodes

192.168.3.173
192.168.2.3
192.168.1.3

SAMBA Configuration

./configure --with-ctdb=/etc/ctdb --with-cluster-support --enable-pie=no --bindir=/usr/bin --sbindir=/usr/sbin --libdir=/usr/lib/samba --with-privatedir=/share/mnt/ctdb --with-lockdir=/usr/local/var/ctdb --with-piddir=/var/run --with-swatdir=/usr/share/swat  --with-logfilebase=/var/log/samba --with-configdir=/etc/samba

Load smb config files from /etc/samba/smb.conf
Processing section "[homes]"
Processing section "[filepool]"
Processing section "[printers]"
Loaded services file OK.
Server role: ROLE_STANDALONE
Press enter to see a dump of your service definitions
 
[global]
        workgroup = MYGROUP
        server string = Samba Server Version %v
        passdb backend = tdbsam
        use mmap = No
        clustering = Yes
        idmap backend = tdb2
        fileid:mapping = fsname
        ea support = Yes
        cups options = raw
 
[filepool]
        comment = The Shared Area
        path = /mnt/lustre
        read
        create mask = 0777
        directory mask = 0777
 
All lustre clients have been mounted with the '-o flock' option.

Everything from a linux prospective is working fine, my linux clients can read/write to files on the file share and file locking is active.

 >From a window share prospective clients can read/write to files on the file share but file locking is not active.

The only errors I can find from samba logs are:

[2009/01/11 02:50:25,  3] locking/locking.c:fetch_share_mode_unlocked(857)
  fill_share_mode_lock failed
[2009/01/11 02:50:25, 10] smbd/statcache.c:stat_cache_lookup(235)
  stat_cache_lookup: lookup failed for name [SG]
[2009/01/11 02:50:33,  5] lib/util_sock.c:read_socket_with_timeout(928)
  read_socket_with_timeout: blocking read. EOF from client.  [2009/01/11 02:50:33,  3] smbd/process.c:smbd_process(2027)
  receive_message_or_smb failed: NT_STATUS_END_OF_FILE, exiting
Has anyone dealt with a similar set-up and can assist with the file locking from a ctdb+samba environment ?



_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to