Re: rsync error after server restore
Wondering if this cpould happen if a file or a directory have a space on it? Something like Nouveau\ dossier? Thanx! Manuel Kissoyan wrote: Hi, We have this backup server running for a long time without problem, it run the rsync command from cron and connect to many other servers geting the files from those servers. One of these servers crashed a week before and we restored it, since then the rsync that connect to this server is giving the following erro after it get some files: rsync error: timeout in data send/receive (code 30) at io.c(233) [sender=3.0.0pre2] _exit_cleanup(code=30, file=io.c, line=233): about to call exit(30) rsync: connection unexpectedly closed (83654700 bytes received so far) [receiver] rsync error: error in rsync protocol data stream (code 12) at io.c(596) [receiver=3.0.0pre2] _exit_cleanup(code=12, file=io.c, line=596): about to call exit(12) rsync: connection unexpectedly closed (14185 bytes received so far) [generator] rsync error: timeout in data send/receive (code 30) at io.c(596) [generator=3.0.0pre2] _exit_cleanup(code=12, file=io.c, line=596): about to call exit(30) We have installed the same version as we had before the crash, is a Centos 5 64 bit. and the rsync version is rsync version 3.0.0pre2 protocol version 30.PR10 Wondering what could cause the error. Thank you in advance! Manuel -- Please use reply-all for most replies to avoid omitting the mailing list. To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html
Re: rsync error after server restore
On Wed, Jun 04, 2008 at 05:25:06PM -0300, Manuel Kissoyan wrote: rsync error: timeout in data send/receive (code 30) at io.c(233) [sender=3.0.0pre2] Are you using a --timeout option or a timeout parameter in a daemon config? If so, try increasing that value. If not, you should check to see if both sides are reporting a connection-closed error, or if one side is dying/crashing. One way to do that by attaching an strace to each of the 3 programs (e.g. use strace -o FILE -p PID after it is running if you don't want to tweak the way you start the copy going). ..wayne.. -- Please use reply-all for most replies to avoid omitting the mailing list. To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html
Re: rsync error after server restore
Hi Wayne, Yes, im using timeout=500 in the cron/command as we were using before the crash and also as im using in the other servers. Ill do the strace to see what is happening and ill post here. btw, what exactly do you mean when you say to attach it of each of the 3 programs? Thank you in advance! Manuel Wayne Davison wrote: On Wed, Jun 04, 2008 at 05:25:06PM -0300, Manuel Kissoyan wrote: rsync error: timeout in data send/receive (code 30) at io.c(233) [sender=3.0.0pre2] Are you using a --timeout option or a timeout parameter in a daemon config? If so, try increasing that value. If not, you should check to see if both sides are reporting a connection-closed error, or if one side is dying/crashing. One way to do that by attaching an strace to each of the 3 programs (e.g. use strace -o FILE -p PID after it is running if you don't want to tweak the way you start the copy going). ..wayne.. -- Please use reply-all for most replies to avoid omitting the mailing list. To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html
Re: rsync error after server restore
Hi Wayne, Here is the strace result. This is the one from the server where we are running the rsync command and where we backup the other server: --- select(4, [3], [], NULL, {47, 471000}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 0 (Timeout) select(4, [3], [], NULL, {60, 0}) = 1 (in [3], left {0, 0}) --- SIGCHLD (Child exited) @ 0 (0) --- waitpid(-1, [{WIFSIGNALED(s) WTERMSIG(s) == SIGUSR1}], WNOHANG) = 18568 waitpid(-1, [{WIFEXITED(s) WEXITSTATUS(s) == 30}], WNOHANG) = 18569 waitpid(-1, 0xbff58724, WNOHANG)= -1 ECHILD (No child processes) sigreturn() = ? (mask now []) read(3, , 8184) = 0 write(2, rsync: connection unexpectedly c..., 79) = 79 write(2, \n, 1) = 1 rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0 rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0 waitpid(18568, 0xbff58a18, WNOHANG) = -1 ECHILD (No child processes) getpid()= 18567 kill(18568, SIGUSR1)= -1 ESRCH (No such process) kill(18569, SIGUSR1)= -1 ESRCH (No such process) write(2, rsync error: error in rsync prot..., 93) = 93 write(2, \n, 1) = 1 exit_group(12) = ? Process 18567 detached --- And this is from the server that we are backing up: -- Process 19247 attached - interrupt to quit select(1, [0], [], NULL, {6, 227000}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 0 (Timeout) select(1, [0], [], NULL, {60, 0}) = 1 (in [0], left {0, 31000}) read(0, , 4) = 0 select(2, NULL, [1], [1], {60, 0}) = 1 (out [1], left {60, 0}) write(1, M\0\0\10rsync: connection unexpected..., 81) = -1 EPIPE (Broken pipe) --- SIGPIPE (Broken pipe) @ 0 (0) --- write(2, rsync: writefd_unbuffered failed..., 77) = -1 EPIPE (Broken pipe) --- SIGPIPE (Broken pipe) @ 0 (0) --- rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0 rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0 write(2, rsync error: errors with program..., 87) = -1 EPIPE (Broken pipe) --- SIGPIPE (Broken pipe) @ 0 (0) --- rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0 rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0 select(0, NULL, NULL, NULL, {0, 10}) = 0 (Timeout) exit_group(13) = ? Process 19247 detached - Thank you! Manuel Wayne Davison wrote: On Wed, Jun 04, 2008 at 05:25:06PM -0300, Manuel Kissoyan wrote: rsync error: timeout in data send/receive (code 30) at io.c(233) [sender=3.0.0pre2] Are you using a --timeout option or a timeout parameter in a daemon config? If so, try increasing that value. If not, you should check to see if both sides are reporting a connection-closed error, or if one side is dying/crashing. One way to do that by attaching an strace to each of the 3 programs (e.g. use strace -o FILE -p PID after it is running if you don't want to tweak the way you start the copy going). ..wayne.. -- Please use reply-all for most replies to avoid omitting the mailing list. To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html
Re: rsync error after server restore
On Fri, Jun 06, 2008 at 04:14:45PM -0300, Manuel Kissoyan wrote: This is the one from the server where we are running the rsync command and where we backup the other server: That's just the client. You also need to see what the server is doing. You should attach to all 3 rsync processes, 2 on the receiving side, and 1 on the sending side. ..wayne.. -- Please use reply-all for most replies to avoid omitting the mailing list. To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html
Re: rsync error after server restore
ok, here are the last lines from the receiving side: - time(NULL) = 1212801884 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212801944 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212802004 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212802064 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212802124 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212802184 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212802244 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212802304 select(6, [5], [], NULL, {60, 0}) = 0 (Timeout) time(NULL) = 1212802364 select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {6, 938000}) read(5, T\0\0\10, 4) = 4 time(NULL) = 1212802417 select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {60, 0}) read(5, rsync error: timeout in data sen..., 84) = 84 time(NULL) = 1212802417 write(2, rsync error: timeout in data sen..., 83) = 83 write(2, \n, 1) = 1 select(6, [5], [], NULL, {60, 0}) = 1 (in [5], left {59, 895000}) read(5, , 4) = 0 write(2, rsync: connection unexpectedly c..., 81) = 81 write(2, \n, 1) = 1 rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0 rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0 waitpid(18766, 0xbff0a6f8, WNOHANG) = -1 ECHILD (No child processes) kill(18766, SIGUSR1)= 0 write(2, rsync error: error in rsync prot..., 92) = 92 write(2, \n, 1) = 1 exit_group(12) = ? Process 18767 detached --- read(3, \263\272\213?5\314\344\244\226M\4\215\305W)\1II\305\31..., 8192) = 128 close(5)= 0 close(4)= 0 close(6)= 0 select(7, [3], [3], NULL, NULL) = 1 (out [3]) write(3, \243\17K,.sR\227^\6\W\377\267\27\200\34\254O\331\304\34..., 32) = 32 ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid argument) fcntl64(0, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK) fcntl64(0, F_SETFL, O_RDWR) = 0 ioctl(1, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid argument) fcntl64(1, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK) fcntl64(1, F_SETFL, O_RDWR) = 0 ioctl(2, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig icanon echo ...}) = 0 gettimeofday({1212802417, 879075}, NULL) = 0 shutdown(3, 2 /* send and receive */) = 0 close(3)= 0 brk(0x9c65000) = 0x9c65000 brk(0x9c5b000) = 0x9c5b000 brk(0x9c3d000) = 0x9c3d000 brk(0x9c23000) = 0x9c23000 exit_group(30) = ? Process 18766 detached read(3, \263\272\213?5\314\344\244\226M\4\215\305W)\1II\305\31..., 8192) = 128 close(5)= 0 close(4)= 0 close(6)= 0 select(7, [3], [3], NULL, NULL) = 1 (out [3]) write(3, \243\17K,.sR\227^\6\W\377\267\27\200\34\254O\331\304\34..., 32) = 32 ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid argument) fcntl64(0, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK) fcntl64(0, F_SETFL, O_RDWR) = 0 ioctl(1, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid argument) fcntl64(1, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK) fcntl64(1, F_SETFL, O_RDWR) = 0 ioctl(2, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig icanon echo ...}) = 0 gettimeofday({1212802417, 879075}, NULL) = 0 shutdown(3, 2 /* send and receive */) = 0 close(3)= 0 brk(0x9c65000) = 0x9c65000 brk(0x9c5b000) = 0x9c5b000 brk(0x9c3d000) = 0x9c3d000 brk(0x9c23000) = 0x9c23000 exit_group(30) = ? Process 18766 detached And this from the sending side: select(1, [0], [], NULL, {60, 0}) = 1 (in [0], left {60, 0}) read(0, \1\0\0\7, 4) = 4 select(1, [0], [], NULL, {60, 0}) = 1 (in [0], left {60, 0}) read(0, \0, 1)= 1 select(2, NULL, [1], [1], {60, 0}) = 1 (out [1], left {60, 0}) write(1, \1\0\0\7\0, 5) = 5 select(1, [0], [], NULL, {60, 0}) = 1 (in [0], left {55, 486000}) read(0, \1\0\0\7, 4) = 4 select(1,
rsync error after server restore
Hi, We have this backup server running for a long time without problem, it run the rsync command from cron and connect to many other servers geting the files from those servers. One of these servers crashed a week before and we restored it, since then the rsync that connect to this server is giving the following erro after it get some files: rsync error: timeout in data send/receive (code 30) at io.c(233) [sender=3.0.0pre2] _exit_cleanup(code=30, file=io.c, line=233): about to call exit(30) rsync: connection unexpectedly closed (83654700 bytes received so far) [receiver] rsync error: error in rsync protocol data stream (code 12) at io.c(596) [receiver=3.0.0pre2] _exit_cleanup(code=12, file=io.c, line=596): about to call exit(12) rsync: connection unexpectedly closed (14185 bytes received so far) [generator] rsync error: timeout in data send/receive (code 30) at io.c(596) [generator=3.0.0pre2] _exit_cleanup(code=12, file=io.c, line=596): about to call exit(30) We have installed the same version as we had before the crash, is a Centos 5 64 bit. and the rsync version is rsync version 3.0.0pre2 protocol version 30.PR10 Wondering what could cause the error. Thank you in advance! Manuel -- Please use reply-all for most replies to avoid omitting the mailing list. To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html