alright.. I'll try to do that tonight and see if it solves the problem.
if I were able to identify the exact things that were causing this
I'd pass them along, but right now I just have general ideas.
For example, squirrilmail webmail just breaks hard with RC4 (single
server mode), whereas in RC2 it works. At the same time, roundcube
web mail works fine. But still I have no way of knowing specifically
where within squirrilmail it gets stuck so I can't identify a
situation to debug.
Keith
At 06:07 AM 3/18/2009, you wrote:
Hi.
Try the split approach - I didn't notice this here on RC4, during my testing.
Regards.
2009/3/18 Keith Freedman
<<mailto:[email protected]>[email protected]>
I rolled back to RC2, and no longer have this problem.
i don't see these error messages, and processes that were hanging
before are not hanging now?
I'm using single process AFR. should I split it out and try RC4, or
does it not seem logical that this should be a problem?
Keith
At 09:57 AM 3/16/2009, Keith Freedman wrote:
Also, I'm wondering if this is related to the fact that I have
single process client/server.
which used to be the recommended method and now is not.
if I split those out, will that solve my problem?
At 09:50 AM 3/16/2009, Keith Freedman wrote:
At 04:06 AM 3/16/2009, Vikas Gorur wrote:
2009/3/14 Keith Freedman
<<mailto:[email protected]>[email protected]>:
> all of a sudden, I'm getting messages such as this:
>
> 2009-03-13 23:14:06 C [posix.c:709:pl_forget] posix-locks-home1: Pending
> fcntl locks found!
>
> and some processes are hanging waiting presumably for the locks?
> any way to find out what files are being locked and unlock them.
> restarting gluster doesn't seem to solve the problem.
Are you using any applications that hold POSIX fcntl locks? Try
running the server in debug mode --- then you can find out which files
are being locked/not unlocked, etc.
well, I'm sure I am, I've no idea really, there are some php scripts
which seem to hang and some python programs.
however, this problem only manifested itself when I upgraded to rc4
and the new fuse-2.7.4glfs11
so something must be significantly different about how those (or
that combination) handles locks.
Also, debug mode wont really solve the problem, cause knowing what
exact file is the problem, isn't going to help because that wont
really tell me how to prevent this from happening. Clearly one side
should get the lock and one should wait, rather than both servers in
the replicate pair just hanging on the same file?
in addition, ERROR mode logging should log enough related
information to know this stuff (this is an enhancement request :) )
Vikas
--
Engineer - Z Research
<http://gluster.com/>http://gluster.com/
_______________________________________________
Gluster-users mailing list
<mailto:[email protected]>[email protected]
http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users
_______________________________________________
Gluster-users mailing list
<mailto:[email protected]>[email protected]
http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users
_______________________________________________
Gluster-users mailing list
<mailto:[email protected]>[email protected]
http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users
_______________________________________________
Gluster-users mailing list
[email protected]
http://zresearch.com/cgi-bin/mailman/listinfo/gluster-users