Re: rsync error after server restore

2008-06-06 Thread Manuel Kissoyan

Wondering if this cpould happen if a file or a directory have a space on it?

Something like Nouveau\ dossier?

Thanx!






Manuel Kissoyan wrote:

Hi,

We have this backup server running for a long time without problem, it 
run the rsync command from cron and connect to many other servers 
geting the files from those servers.
One of these servers crashed a week before and we restored it, since 
then the rsync that connect to this server is giving the following 
erro after it get some files:


rsync error: timeout in data send/receive (code 30) at io.c(233) 
[sender=3.0.0pre2]

_exit_cleanup(code=30, file=io.c, line=233): about to call exit(30)
rsync: connection unexpectedly closed (83654700 bytes received so far) 
[receiver]
rsync error: error in rsync protocol data stream (code 12) at 
io.c(596) [receiver=3.0.0pre2]

_exit_cleanup(code=12, file=io.c, line=596): about to call exit(12)
rsync: connection unexpectedly closed (14185 bytes received so far) 
[generator]
rsync error: timeout in data send/receive (code 30) at io.c(596) 
[generator=3.0.0pre2]

_exit_cleanup(code=12, file=io.c, line=596): about to call exit(30)


We have installed the same version as we had before the crash, is a 
Centos 5 64 bit. and the rsync version is rsync  version 3.0.0pre2  
protocol version 30.PR10


Wondering what could cause the error.

Thank you in advance!

Manuel


--
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html


Re: rsync error after server restore

2008-06-06 Thread Wayne Davison
On Wed, Jun 04, 2008 at 05:25:06PM -0300, Manuel Kissoyan wrote:
 rsync error: timeout in data send/receive (code 30) at io.c(233) 
 [sender=3.0.0pre2]

Are you using a --timeout option or a timeout parameter in a daemon
config?  If so, try increasing that value.  If not, you should check to
see if both sides are reporting a connection-closed error, or if one
side is dying/crashing.  One way to do that by attaching an strace to
each of the 3 programs (e.g. use strace -o FILE -p PID after it is
running if you don't want to tweak the way you start the copy going).

..wayne..
-- 
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html


Re: rsync error after server restore

2008-06-06 Thread Manuel Kissoyan

Hi Wayne,

Yes, im using timeout=500 in the cron/command as we were using before 
the crash and also as im using in the other servers.


Ill do the strace to see what is happening and ill post here. btw, what 
exactly do you mean when you say to attach it of each of the 3 programs?


Thank you in advance!

Manuel




Wayne Davison wrote:

On Wed, Jun 04, 2008 at 05:25:06PM -0300, Manuel Kissoyan wrote:
  

rsync error: timeout in data send/receive (code 30) at io.c(233) 
[sender=3.0.0pre2]



Are you using a --timeout option or a timeout parameter in a daemon
config?  If so, try increasing that value.  If not, you should check to
see if both sides are reporting a connection-closed error, or if one
side is dying/crashing.  One way to do that by attaching an strace to
each of the 3 programs (e.g. use strace -o FILE -p PID after it is
running if you don't want to tweak the way you start the copy going).

..wayne..

  


--
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html


Re: rsync error after server restore

2008-06-06 Thread Manuel Kissoyan

Hi Wayne,

Here is the strace result.

This is the one from the server where we are running the rsync command 
and where we backup the other server:


---
select(4, [3], [], NULL, {47, 471000})  = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 0 (Timeout)
select(4, [3], [], NULL, {60, 0})   = 1 (in [3], left {0, 0})
--- SIGCHLD (Child exited) @ 0 (0) ---
waitpid(-1, [{WIFSIGNALED(s)  WTERMSIG(s) == SIGUSR1}], WNOHANG) = 18568
waitpid(-1, [{WIFEXITED(s)  WEXITSTATUS(s) == 30}], WNOHANG) = 18569
waitpid(-1, 0xbff58724, WNOHANG)= -1 ECHILD (No child processes)
sigreturn() = ? (mask now [])
read(3, , 8184)   = 0
write(2, rsync: connection unexpectedly c..., 79) = 79
write(2, \n, 1)   = 1
rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0
rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0
waitpid(18568, 0xbff58a18, WNOHANG) = -1 ECHILD (No child processes)
getpid()= 18567
kill(18568, SIGUSR1)= -1 ESRCH (No such process)
kill(18569, SIGUSR1)= -1 ESRCH (No such process)
write(2, rsync error: error in rsync prot..., 93) = 93
write(2, \n, 1)   = 1
exit_group(12)  = ?
Process 18567 detached

---





And this is from the server that we are backing up:

--
Process 19247 attached - interrupt to quit
select(1, [0], [], NULL, {6, 227000})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 0 (Timeout)
select(1, [0], [], NULL, {60, 0})   = 1 (in [0], left {0, 31000})
read(0, , 4)  = 0
select(2, NULL, [1], [1], {60, 0})  = 1 (out [1], left {60, 0})
write(1, M\0\0\10rsync: connection unexpected..., 81) = -1 EPIPE 
(Broken pipe)

--- SIGPIPE (Broken pipe) @ 0 (0) ---
write(2, rsync: writefd_unbuffered failed..., 77) = -1 EPIPE (Broken pipe)
--- SIGPIPE (Broken pipe) @ 0 (0) ---
rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0
rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0
write(2, rsync error: errors with program..., 87) = -1 EPIPE (Broken pipe)
--- SIGPIPE (Broken pipe) @ 0 (0) ---
rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0
rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0
select(0, NULL, NULL, NULL, {0, 10}) = 0 (Timeout)
exit_group(13)  = ?
Process 19247 detached
-

Thank you!

Manuel



Wayne Davison wrote:

On Wed, Jun 04, 2008 at 05:25:06PM -0300, Manuel Kissoyan wrote:
  

rsync error: timeout in data send/receive (code 30) at io.c(233) 
[sender=3.0.0pre2]



Are you using a --timeout option or a timeout parameter in a daemon
config?  If so, try increasing that value.  If not, you should check to
see if both sides are reporting a connection-closed error, or if one
side is dying/crashing.  One way to do that by attaching an strace to
each of the 3 programs (e.g. use strace -o FILE -p PID after it is
running if you don't want to tweak the way you start the copy going).

..wayne..
  

--
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html


Re: rsync error after server restore

2008-06-06 Thread Wayne Davison
On Fri, Jun 06, 2008 at 04:14:45PM -0300, Manuel Kissoyan wrote:
 This is the one from the server where we are running the rsync command
 and where we backup the other server:

That's just the client.  You also need to see what the server is doing.
You should attach to all 3 rsync processes, 2 on the receiving side,
and 1 on the sending side.

..wayne..
-- 
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html


Re: rsync error after server restore

2008-06-06 Thread Manuel Kissoyan

ok, here are the last lines from the receiving side:


-
time(NULL)  = 1212801884
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212801944
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212802004
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212802064
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212802124
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212802184
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212802244
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212802304
select(6, [5], [], NULL, {60, 0})   = 0 (Timeout)
time(NULL)  = 1212802364
select(6, [5], [], NULL, {60, 0})   = 1 (in [5], left {6, 938000})
read(5, T\0\0\10, 4)  = 4
time(NULL)  = 1212802417
select(6, [5], [], NULL, {60, 0})   = 1 (in [5], left {60, 0})
read(5, rsync error: timeout in data sen..., 84) = 84
time(NULL)  = 1212802417
write(2, rsync error: timeout in data sen..., 83) = 83
write(2, \n, 1)   = 1
select(6, [5], [], NULL, {60, 0})   = 1 (in [5], left {59, 895000})
read(5, , 4)  = 0
write(2, rsync: connection unexpectedly c..., 81) = 81
write(2, \n, 1)   = 1
rt_sigaction(SIGUSR1, {SIG_IGN}, NULL, 8) = 0
rt_sigaction(SIGUSR2, {SIG_IGN}, NULL, 8) = 0
waitpid(18766, 0xbff0a6f8, WNOHANG) = -1 ECHILD (No child processes)
kill(18766, SIGUSR1)= 0
write(2, rsync error: error in rsync prot..., 92) = 92
write(2, \n, 1)   = 1
exit_group(12)  = ?
Process 18767 detached


---


read(3, \263\272\213?5\314\344\244\226M\4\215\305W)\1II\305\31..., 
8192) = 128

close(5)= 0
close(4)= 0
close(6)= 0
select(7, [3], [3], NULL, NULL) = 1 (out [3])
write(3, \243\17K,.sR\227^\6\W\377\267\27\200\34\254O\331\304\34..., 
32) = 32
ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid 
argument)

fcntl64(0, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK)
fcntl64(0, F_SETFL, O_RDWR) = 0
ioctl(1, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid 
argument)

fcntl64(1, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK)
fcntl64(1, F_SETFL, O_RDWR) = 0
ioctl(2, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig icanon echo 
...}) = 0

gettimeofday({1212802417, 879075}, NULL) = 0
shutdown(3, 2 /* send and receive */)   = 0
close(3)= 0
brk(0x9c65000)  = 0x9c65000
brk(0x9c5b000)  = 0x9c5b000
brk(0x9c3d000)  = 0x9c3d000
brk(0x9c23000)  = 0x9c23000
exit_group(30)  = ?
Process 18766 detached





read(3, \263\272\213?5\314\344\244\226M\4\215\305W)\1II\305\31..., 
8192) = 128

close(5)= 0
close(4)= 0
close(6)= 0
select(7, [3], [3], NULL, NULL) = 1 (out [3])
write(3, \243\17K,.sR\227^\6\W\377\267\27\200\34\254O\331\304\34..., 
32) = 32
ioctl(0, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid 
argument)

fcntl64(0, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK)
fcntl64(0, F_SETFL, O_RDWR) = 0
ioctl(1, SNDCTL_TMR_TIMEBASE or TCGETS, 0xbfe18678) = -1 EINVAL (Invalid 
argument)

fcntl64(1, F_GETFL) = 0x802 (flags O_RDWR|O_NONBLOCK)
fcntl64(1, F_SETFL, O_RDWR) = 0
ioctl(2, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig icanon echo 
...}) = 0

gettimeofday({1212802417, 879075}, NULL) = 0
shutdown(3, 2 /* send and receive */)   = 0
close(3)= 0
brk(0x9c65000)  = 0x9c65000
brk(0x9c5b000)  = 0x9c5b000
brk(0x9c3d000)  = 0x9c3d000
brk(0x9c23000)  = 0x9c23000
exit_group(30)  = ?
Process 18766 detached






And this from the sending side:


select(1, [0], [], NULL, {60, 0})   = 1 (in [0], left {60, 0})
read(0, \1\0\0\7, 4)  = 4
select(1, [0], [], NULL, {60, 0})   = 1 (in [0], left {60, 0})
read(0, \0, 1)= 1
select(2, NULL, [1], [1], {60, 0})  = 1 (out [1], left {60, 0})
write(1, \1\0\0\7\0, 5)   = 5
select(1, [0], [], NULL, {60, 0})   = 1 (in [0], left {55, 486000})
read(0, \1\0\0\7, 4)  = 4
select(1, 

rsync error after server restore

2008-06-04 Thread Manuel Kissoyan

Hi,

We have this backup server running for a long time without problem, it 
run the rsync command from cron and connect to many other servers geting 
the files from those servers.
One of these servers crashed a week before and we restored it, since 
then the rsync that connect to this server is giving the following erro 
after it get some files:


rsync error: timeout in data send/receive (code 30) at io.c(233) 
[sender=3.0.0pre2]

_exit_cleanup(code=30, file=io.c, line=233): about to call exit(30)
rsync: connection unexpectedly closed (83654700 bytes received so far) 
[receiver]
rsync error: error in rsync protocol data stream (code 12) at io.c(596) 
[receiver=3.0.0pre2]

_exit_cleanup(code=12, file=io.c, line=596): about to call exit(12)
rsync: connection unexpectedly closed (14185 bytes received so far) 
[generator]
rsync error: timeout in data send/receive (code 30) at io.c(596) 
[generator=3.0.0pre2]

_exit_cleanup(code=12, file=io.c, line=596): about to call exit(30)


We have installed the same version as we had before the crash, is a 
Centos 5 64 bit. and the rsync version is rsync  version 3.0.0pre2  
protocol version 30.PR10


Wondering what could cause the error.

Thank you in advance!

Manuel

--
Please use reply-all for most replies to avoid omitting the mailing list.
To unsubscribe or change options: https://lists.samba.org/mailman/listinfo/rsync
Before posting, read: http://www.catb.org/~esr/faqs/smart-questions.html