Re: [Bacula-users] fd lose connection

2023-11-15 Thread Lionel PLASSE
Going further, I check my remote machine provider and it seems that  there tcp  
services monitoring was the real problem for my brutal connection termination
So they won't now perturbate my connections.

I too  activated tcp_bbr  and sfq for queuing, and I reactivated psk passphrase 
and tls verify peer.

I now have a  stable backup 

So, sorry for all the issues I mentioned, which seem to be coming from the 
cloud services provider that has apparently been the source of these issues

Be aware too  that /sys/class/net/enp1s0/power/control is "auto" by default in 
Debian 12 and should be set  to "on" to prevent device  from being power 
managed during backup (even in linux ; ) (my eth device is not IEEE802.3az 
compliant but it could be a problem if it is)
 
thanks


-Message d'origine-
De : Josh Fisher via Bacula-users  
Envoyé : jeudi 9 novembre 2023 01:16
À : Martin Simmons 
Cc : bacula-users@lists.sourceforge.net
Objet : Re: [Bacula-users] fd lose connection


On 11/8/23 13:32, Martin Simmons wrote:
>>>>>> On Wed, 8 Nov 2023 11:09:44 -0500, Josh Fisher via Bacula-users said:
>> On 11/7/23 19:26, Lionel PLASSE wrote:
>>> I’m sorry but no wi-fi nor 5G in our factory, and don't use my phone too to 
>>> backup my servers :) .
>>> I was talking about ssh (scp) transfer just to Just to show out  I have no 
>>> problem when uploading big continuous data using other tools through this 
>>> wan. The wan connection is quite stable.
>>>
>>> "So it is fine when the NIC is up. Since this is Windows,"
>>> no windows. I discarder windows problem hypothesis by using a 
>>> migration job, so from linux sd to linux sd
>> OK. I see that now. You also tried without compression and without 
>> encryption. Have you tried reducing Maximum Network Buffer Size back 
>> to the default 32768?
> Are you sure it is 32768?
>
> I thought the default comes from this in bacula/src/baconfig.h:
>
> #define DEFAULT_NETWORK_BUFFER_SIZE (64 * 1024)


In the docs it says the default is 32768, but if it's in the source, then 
that's what it is. :)





___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


smime.p7s
Description: S/MIME cryptographic signature
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-08 Thread Josh Fisher via Bacula-users


On 11/8/23 13:32, Martin Simmons wrote:

On Wed, 8 Nov 2023 11:09:44 -0500, Josh Fisher via Bacula-users said:

On 11/7/23 19:26, Lionel PLASSE wrote:

I’m sorry but no wi-fi nor 5G in our factory, and don't use my phone too to 
backup my servers :) .
I was talking about ssh (scp) transfer just to Just to show out  I have no 
problem when uploading big continuous data using other tools through this wan. 
The wan connection is quite stable.

"So it is fine when the NIC is up. Since this is Windows,"
no windows. I discarder windows problem hypothesis by using a migration job, so 
from linux sd to linux sd

OK. I see that now. You also tried without compression and without
encryption. Have you tried reducing Maximum Network Buffer Size back to
the default 32768?

Are you sure it is 32768?

I thought the default comes from this in bacula/src/baconfig.h:

#define DEFAULT_NETWORK_BUFFER_SIZE (64 * 1024)



In the docs it says the default is 32768, but if it's in the source, 
then that's what it is. :)






___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-08 Thread Lionel PLASSE
Ok , Without encryption (TlsPskEnable = false  and TlsVerifyPeer=false)  and 
lzo-md5 the job  finished well  and all job that were in error gone ok too 
today.
Just to say , on Debian 12 , libssl  is on the new v3.0 version so I have to 
retroport  it  from the bullseye repo in order to install Bacula which depends 
on libssl1.1
Perhaps it could be why I encounter this problem. 
(for info too, the ethernet card is  :  

> OK. I see that now. You also tried without compression and without 
> encryption. Have you tried reducing Maximum Network Buffer Size back 
> to the default 32768?
I also have  tested with 32768 max net buffer size, but it was from window's  
fd to Debian 12 remote sd and it went in error (but really not sure  if at this 
time  all my confs were properly set in  both side) : 
  Error: lib/bsock.c:397 Wrote 32772 bytes to Storage 
daemon:192.168.0.18:9103, but only 32768 accepted.

I now use without encr (and with  lzo-md5 comp for data), Its ok, I can work 
like this.
As the last past jobs are now ok I won't go further.

Thank you for considering my problem. 
Great thanks for the Bacula and bbr explanation I will study this with great 
attention. Sure it could give me much more efficience. 


-Message d'origine-
De : Martin Simmons  
Envoyé : mercredi 8 novembre 2023 19:32
À : Josh Fisher 
Cc : bacula-users@lists.sourceforge.net
Objet : Re: [Bacula-users] fd lose connection

>>>>> On Wed, 8 Nov 2023 11:09:44 -0500, Josh Fisher via Bacula-users said:
> 
> On 11/7/23 19:26, Lionel PLASSE wrote:
> > I’m sorry but no wi-fi nor 5G in our factory, and don't use my phone too to 
> > backup my servers :) .
> > I was talking about ssh (scp) transfer just to Just to show out  I have no 
> > problem when uploading big continuous data using other tools through this 
> > wan. The wan connection is quite stable.
> >
> > "So it is fine when the NIC is up. Since this is Windows,"
> > no windows. I discarder windows problem hypothesis by using a 
> > migration job, so from linux sd to linux sd
> 
> OK. I see that now. You also tried without compression and without 
> encryption. Have you tried reducing Maximum Network Buffer Size back 
> to the default 32768?

Are you sure it is 32768?

I thought the default comes from this in bacula/src/baconfig.h:

#define DEFAULT_NETWORK_BUFFER_SIZE (64 * 1024)

>There must be some reason why the client seems to 
> be sending 30 bytes more than its Maximum Network Buffer Size. Bacula 
> first tries the Maximum Network Buffer Size, but if the OS does not 
> accept that size, then it adjusts the value down until the OS accepts 
> it. Maybe the actual buffer size gets calculated differently on Debian 
> 12? Why is the send size exceeding the buffer size? Or could there be 
> a typo in the Maximum Network Buffer Size setting on one side?

It's an interesting question.  OTOH, if the connection is using TLS, then the 
underlying number of bytes transmitted doesn't match the application layer 
anyway.

Also, can you explain why the network buffer size would cause data loss on a 
TCP connection?

> 
> 
> >
> > Thanks for all, I will find out a solution Best regards
> >
> > PLASSE Lionel | Networks & Systems Administrator
> > 221 Allée de Fétan
> > 01600 TREVOUX - FRANCE
> > Tel : +33(0)4.37.49.91.39
> > pla...@cofiem.fr
> > www.cofiem.fr | www.cofiem-robotics.fr
> >
> >   
> >
> >
> >
> >
> >
> > -Message d'origine-
> > De : Josh Fisher via Bacula-users 
> > 
> > Envoyé : mardi 7 novembre 2023 18:01 À : 
> > bacula-users@lists.sourceforge.net
> > Objet : Re: [Bacula-users] fd lose connection
> >
> >
> > On 11/7/23 04:34, Lionel PLASSE wrote:
> >> Hello ,
> >>
> >> Could  Encryption have any impact in my problem.
> >>
> >> I am testing without any encryption between SD/DIR/BConsole or FD 
> >> and it seems to be more stable. (short sized  job right done , 
> >> longest job already running : 750Gb to migrate)
> >>
> >> My WAN connection seems  to be quite good,  I  achieve transferring big 
> >> and small  raw files by scp ssh and don't have ping latency or troubles 
> >> with the ipsec connection.
> >
> > So it is fine when the NIC is up. Since this is Windows, the first thing to 
> > do is turn off power saving for the network interface device in Device 
> > Manager. Make sure that the NIC doesn't ever power down its PHY.
> > If any switch, router, or VPN doesn't handle energy-efficient internet in 
> > the same way, then it can look like a dropped connection to the other side.
> >
> > Also, you don't s

Re: [Bacula-users] fd lose connection

2023-11-08 Thread Martin Simmons
>>>>> On Wed, 8 Nov 2023 11:09:44 -0500, Josh Fisher via Bacula-users said:
> 
> On 11/7/23 19:26, Lionel PLASSE wrote:
> > I’m sorry but no wi-fi nor 5G in our factory, and don't use my phone too to 
> > backup my servers :) .
> > I was talking about ssh (scp) transfer just to Just to show out  I have no 
> > problem when uploading big continuous data using other tools through this 
> > wan. The wan connection is quite stable.
> >
> > "So it is fine when the NIC is up. Since this is Windows,"
> > no windows. I discarder windows problem hypothesis by using a migration 
> > job, so from linux sd to linux sd
> 
> OK. I see that now. You also tried without compression and without 
> encryption. Have you tried reducing Maximum Network Buffer Size back to 
> the default 32768?

Are you sure it is 32768?

I thought the default comes from this in bacula/src/baconfig.h:

#define DEFAULT_NETWORK_BUFFER_SIZE (64 * 1024)

>There must be some reason why the client seems to be 
> sending 30 bytes more than its Maximum Network Buffer Size. Bacula first 
> tries the Maximum Network Buffer Size, but if the OS does not accept 
> that size, then it adjusts the value down until the OS accepts it. Maybe 
> the actual buffer size gets calculated differently on Debian 12? Why is 
> the send size exceeding the buffer size? Or could there be a typo in the 
> Maximum Network Buffer Size setting on one side?

It's an interesting question.  OTOH, if the connection is using TLS, then the
underlying number of bytes transmitted doesn't match the application layer
anyway.

Also, can you explain why the network buffer size would cause data loss on a
TCP connection?

> 
> 
> >
> > Thanks for all, I will find out a solution
> > Best regards
> >
> > PLASSE Lionel | Networks & Systems Administrator
> > 221 Allée de Fétan
> > 01600 TREVOUX - FRANCE
> > Tel : +33(0)4.37.49.91.39
> > pla...@cofiem.fr
> > www.cofiem.fr | www.cofiem-robotics.fr
> >
> >   
> >
> >
> >
> >
> >
> > -Message d'origine-
> > De : Josh Fisher via Bacula-users 
> > Envoyé : mardi 7 novembre 2023 18:01
> > À : bacula-users@lists.sourceforge.net
> > Objet : Re: [Bacula-users] fd lose connection
> >
> >
> > On 11/7/23 04:34, Lionel PLASSE wrote:
> >> Hello ,
> >>
> >> Could  Encryption have any impact in my problem.
> >>
> >> I am testing without any encryption between SD/DIR/BConsole or FD and
> >> it seems to be more stable. (short sized  job right done , longest job
> >> already running : 750Gb to migrate)
> >>
> >> My WAN connection seems  to be quite good,  I  achieve transferring big 
> >> and small  raw files by scp ssh and don't have ping latency or troubles 
> >> with the ipsec connection.
> >
> > So it is fine when the NIC is up. Since this is Windows, the first thing to 
> > do is turn off power saving for the network interface device in Device 
> > Manager. Make sure that the NIC doesn't ever power down its PHY.
> > If any switch, router, or VPN doesn't handle energy-efficient internet in 
> > the same way, then it can look like a dropped connection to the other side.
> >
> > Also, you don't say what type of WAN connection this is. Many wireless 
> > services, 5G, etc. can and will drop open sockets due to inactivity (or 
> > perceived inactivity) to free up channels.
> >
> >
> >> I tried too with NAT,  by not using IPSEC and setting  Bacula SD & DIR  
> >> directly in front of the WAN.
> >> And the same occurs  (wrote X byte  but only Y accepted)
> >>
> >> I Tried too to make a migration job to migrate from  SD  to SD through WAN 
> >>  instead of SD-> FD through WAN and the result was the same. (to see if 
> >> win32 FD could be involved)
> >>- DIR and SD in the same LAN.
> >>- Backup remote  FD through  remote SD, the two are in the same LAN to 
> >> fast backup : step OK .
> >> - Then  migration from remote SD to the SD that is in the DIR area
> >> through WAN to outsource volumes physical support : step nok The final 
> >> goal: outsourcing volumes.
> >> I then discard the gzip compression (just in case)
> >>
> >> The errors are quite disturbing :
> >> *   Error: lib/bsock.c:397 Wrote 65566 bytes to 
> >> client:192.168.0.4:9102, but only 65536 accepted
> >>   Fatal error: filed/backup.c:1008 Network send error to SD.
> >> ERR=Input/output error Or  (when increasing MaximumNetworkBuf

Re: [Bacula-users] fd lose connection

2023-11-08 Thread Heitor Faria
Hello Lionel, 


On 11/7/23 19:26, Lionel PLASSE wrote:
> I’m sorry but no wi-fi nor 5G in our factory, and don't use my phone too to 
> backup my servers :) .
> I was talking about ssh (scp) transfer just to Just to show out I have no 
> problem when uploading big continuous data using other tools through this 
> wan. The wan connection is quite stable.
>
> "So it is fine when the NIC is up. Since this is Windows,"
> no windows. I discarder windows problem hypothesis by using a migration job, 
> so from linux sd to linux sd

"OK. I see that now. You also tried without compression and without
encryption. Have you tried reducing Maximum Network Buffer Size back to
the default 32768? There must be some reason why the client seems to be
sending 30 bytes more than its Maximum Network Buffer Size. Bacula first
tries the Maximum Network Buffer Size, but if the OS does not accept
that size, then it adjusts the value down until the OS accepts it. Maybe
the actual buffer size gets calculated differently on Debian 12? Why is
the send size exceeding the buffer size? Or could there be a typo in the
Maximum Network Buffer Size setting on one side?"

In addition to the Josh considerations, I would consider trying BBR: 
https://www.bacula.lat/bacula-e-prototoclo-bbr-para-backups-via-internet-perdas-de-pacote-desconecoes-etc/?lang=en
I have a mixed OnP+cloud customer +1000 machines situations, where this was the 
only solution for the network interruptions.
The BBR is not only more resilient to transmission contingencies (NATs, 
firewall drops, switches/router resets etc.), but also provides much faster 
throughput recovery in case of failure.
Underlying VPN is also something that, by my experience, are prone to 
disconnections that have huge impact on Linux default congestion protocol.

Rgds.___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-08 Thread Josh Fisher via Bacula-users


On 11/7/23 19:26, Lionel PLASSE wrote:

I’m sorry but no wi-fi nor 5G in our factory, and don't use my phone too to 
backup my servers :) .
I was talking about ssh (scp) transfer just to Just to show out  I have no 
problem when uploading big continuous data using other tools through this wan. 
The wan connection is quite stable.

"So it is fine when the NIC is up. Since this is Windows,"
no windows. I discarder windows problem hypothesis by using a migration job, so 
from linux sd to linux sd


OK. I see that now. You also tried without compression and without 
encryption. Have you tried reducing Maximum Network Buffer Size back to 
the default 32768? There must be some reason why the client seems to be 
sending 30 bytes more than its Maximum Network Buffer Size. Bacula first 
tries the Maximum Network Buffer Size, but if the OS does not accept 
that size, then it adjusts the value down until the OS accepts it. Maybe 
the actual buffer size gets calculated differently on Debian 12? Why is 
the send size exceeding the buffer size? Or could there be a typo in the 
Maximum Network Buffer Size setting on one side?





Thanks for all, I will find out a solution
Best regards

PLASSE Lionel | Networks & Systems Administrator
221 Allée de Fétan
01600 TREVOUX - FRANCE
Tel : +33(0)4.37.49.91.39
pla...@cofiem.fr
www.cofiem.fr | www.cofiem-robotics.fr

  






-Message d'origine-
De : Josh Fisher via Bacula-users 
Envoyé : mardi 7 novembre 2023 18:01
À : bacula-users@lists.sourceforge.net
Objet : Re: [Bacula-users] fd lose connection


On 11/7/23 04:34, Lionel PLASSE wrote:

Hello ,

Could  Encryption have any impact in my problem.

I am testing without any encryption between SD/DIR/BConsole or FD and
it seems to be more stable. (short sized  job right done , longest job
already running : 750Gb to migrate)

My WAN connection seems  to be quite good,  I  achieve transferring big and 
small  raw files by scp ssh and don't have ping latency or troubles with the 
ipsec connection.


So it is fine when the NIC is up. Since this is Windows, the first thing to do 
is turn off power saving for the network interface device in Device Manager. 
Make sure that the NIC doesn't ever power down its PHY.
If any switch, router, or VPN doesn't handle energy-efficient internet in the 
same way, then it can look like a dropped connection to the other side.

Also, you don't say what type of WAN connection this is. Many wireless 
services, 5G, etc. can and will drop open sockets due to inactivity (or 
perceived inactivity) to free up channels.



I tried too with NAT,  by not using IPSEC and setting  Bacula SD & DIR  
directly in front of the WAN.
And the same occurs  (wrote X byte  but only Y accepted)

I Tried too to make a migration job to migrate from  SD  to SD through WAN  
instead of SD-> FD through WAN and the result was the same. (to see if win32 FD 
could be involved)
   - DIR and SD in the same LAN.
   - Backup remote  FD through  remote SD, the two are in the same LAN to fast 
backup : step OK .
- Then  migration from remote SD to the SD that is in the DIR area
through WAN to outsource volumes physical support : step nok The final goal: 
outsourcing volumes.
I then discard the gzip compression (just in case)

The errors are quite disturbing :
*   Error: lib/bsock.c:397 Wrote 65566 bytes to client:192.168.0.4:9102, 
but only 65536 accepted
  Fatal error: filed/backup.c:1008 Network send error to SD.
ERR=Input/output error Or  (when increasing MaximumNetworkBuffer)
*   Error: lib/bsock.c:397 Wrote 130277 bytes to client:192.168.0.17:9102, 
but only 98304 accepted.
  Fatal error: filed/backup.c:1008 Network send error to SD.
ERR=Input/output error Or (Migration job)
*   Fatal error: append.c:319 Network error reading from FD. ERR=Erreur 
d'entrée/sortie
  Error: bsock.c:571 Read expected 131118 got 114684 from
Storage daemon:192.168.10.54:9103

It's look like there is a gap between send and receive buffer and looking at 
the source code, encryption could affect buffer size due to encryption.
So I  think Bacula-SD could be in cause (maybe).
Could it be a bug?
What could I do to determine the problem (activating debug in sd
daemon ? )

I use Bacula 13.0.3 on Debian 12 , with ssl 1.1

Thank for helping. Backup scenarios must have an a step of relocating the 
backup media to be reliable.

___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users



___
Bacula-users mail

Re: [Bacula-users] fd lose connection

2023-11-07 Thread Lionel PLASSE
I’m sorry but no wi-fi nor 5G in our factory, and don't use my phone too to 
backup my servers :) . 
I was talking about ssh (scp) transfer just to Just to show out  I have no 
problem when uploading big continuous data using other tools through this wan. 
The wan connection is quite stable.

"So it is fine when the NIC is up. Since this is Windows,"
no windows. I discarder windows problem hypothesis by using a migration job, so 
from linux sd to linux sd

Thanks for all, I will find out a solution 
Best regards

PLASSE Lionel | Networks & Systems Administrator 
221 Allée de Fétan
01600 TREVOUX - FRANCE 
Tel : +33(0)4.37.49.91.39
pla...@cofiem.fr
www.cofiem.fr | www.cofiem-robotics.fr

 





-Message d'origine-
De : Josh Fisher via Bacula-users  
Envoyé : mardi 7 novembre 2023 18:01
À : bacula-users@lists.sourceforge.net
Objet : Re: [Bacula-users] fd lose connection


On 11/7/23 04:34, Lionel PLASSE wrote:
> Hello ,
>
> Could  Encryption have any impact in my problem.
>
> I am testing without any encryption between SD/DIR/BConsole or FD and 
> it seems to be more stable. (short sized  job right done , longest job 
> already running : 750Gb to migrate)
>
> My WAN connection seems  to be quite good,  I  achieve transferring big and 
> small  raw files by scp ssh and don't have ping latency or troubles with the 
> ipsec connection.


So it is fine when the NIC is up. Since this is Windows, the first thing to do 
is turn off power saving for the network interface device in Device Manager. 
Make sure that the NIC doesn't ever power down its PHY. 
If any switch, router, or VPN doesn't handle energy-efficient internet in the 
same way, then it can look like a dropped connection to the other side.

Also, you don't say what type of WAN connection this is. Many wireless 
services, 5G, etc. can and will drop open sockets due to inactivity (or 
perceived inactivity) to free up channels.


>
> I tried too with NAT,  by not using IPSEC and setting  Bacula SD & DIR  
> directly in front of the WAN.
> And the same occurs  (wrote X byte  but only Y accepted)
>
> I Tried too to make a migration job to migrate from  SD  to SD through WAN  
> instead of SD-> FD through WAN and the result was the same. (to see if win32 
> FD could be involved)
>   - DIR and SD in the same LAN.
>   - Backup remote  FD through  remote SD, the two are in the same LAN to fast 
> backup : step OK .
> - Then  migration from remote SD to the SD that is in the DIR area 
> through WAN to outsource volumes physical support : step nok The final goal: 
> outsourcing volumes.
> I then discard the gzip compression (just in case)
>
> The errors are quite disturbing :
> *   Error: lib/bsock.c:397 Wrote 65566 bytes to client:192.168.0.4:9102, 
> but only 65536 accepted
>  Fatal error: filed/backup.c:1008 Network send error to SD. 
> ERR=Input/output error Or  (when increasing MaximumNetworkBuffer)
> *   Error: lib/bsock.c:397 Wrote 130277 bytes to 
> client:192.168.0.17:9102, but only 98304 accepted.
>  Fatal error: filed/backup.c:1008 Network send error to SD. 
> ERR=Input/output error Or (Migration job)
> *   Fatal error: append.c:319 Network error reading from FD. ERR=Erreur 
> d'entrée/sortie
>  Error: bsock.c:571 Read expected 131118 got 114684 from 
> Storage daemon:192.168.10.54:9103
>
> It's look like there is a gap between send and receive buffer and looking at 
> the source code, encryption could affect buffer size due to encryption.
> So I  think Bacula-SD could be in cause (maybe).
> Could it be a bug?
> What could I do to determine the problem (activating debug in sd 
> daemon ? )
>
> I use Bacula 13.0.3 on Debian 12 , with ssl 1.1
>
> Thank for helping. Backup scenarios must have an a step of relocating the 
> backup media to be reliable.
>
> ___
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users


___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users

___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-07 Thread Josh Fisher via Bacula-users


On 11/7/23 04:34, Lionel PLASSE wrote:

Hello ,

Could  Encryption have any impact in my problem.

I am testing without any encryption between SD/DIR/BConsole or FD and it seems 
to be more stable. (short sized  job right done , longest job already running : 
750Gb to migrate)

My WAN connection seems  to be quite good,  I  achieve transferring big and 
small  raw files by scp ssh and don't have ping latency or troubles with the 
ipsec connection.



So it is fine when the NIC is up. Since this is Windows, the first thing 
to do is turn off power saving for the network interface device in 
Device Manager. Make sure that the NIC doesn't ever power down its PHY. 
If any switch, router, or VPN doesn't handle energy-efficient internet 
in the same way, then it can look like a dropped connection to the other 
side.


Also, you don't say what type of WAN connection this is. Many wireless 
services, 5G, etc. can and will drop open sockets due to inactivity (or 
perceived inactivity) to free up channels.





I tried too with NAT,  by not using IPSEC and setting  Bacula SD & DIR  
directly in front of the WAN.
And the same occurs  (wrote X byte  but only Y accepted)

I Tried too to make a migration job to migrate from  SD  to SD through WAN  
instead of SD-> FD through WAN and the result was the same. (to see if win32 FD 
could be involved)
  - DIR and SD in the same LAN.
  - Backup remote  FD through  remote SD, the two are in the same LAN to fast 
backup : step OK .
- Then  migration from remote SD to the SD that is in the DIR area through WAN 
to outsource volumes physical support : step nok
The final goal: outsourcing volumes.
I then discard the gzip compression (just in case)

The errors are quite disturbing :
*   Error: lib/bsock.c:397 Wrote 65566 bytes to client:192.168.0.4:9102, 
but only 65536 accepted
 Fatal error: filed/backup.c:1008 Network send error to SD. 
ERR=Input/output error
Or  (when increasing MaximumNetworkBuffer)
*   Error: lib/bsock.c:397 Wrote 130277 bytes to client:192.168.0.17:9102, 
but only 98304 accepted.
 Fatal error: filed/backup.c:1008 Network send error to SD. 
ERR=Input/output error
Or (Migration job)
*   Fatal error: append.c:319 Network error reading from FD. ERR=Erreur 
d'entrée/sortie
 Error: bsock.c:571 Read expected 131118 got 114684 from Storage 
daemon:192.168.10.54:9103

It's look like there is a gap between send and receive buffer and looking at 
the source code, encryption could affect buffer size due to encryption.
So I  think Bacula-SD could be in cause (maybe).
Could it be a bug?
What could I do to determine the problem (activating debug in sd daemon ? )

I use Bacula 13.0.3 on Debian 12 , with ssl 1.1

Thank for helping. Backup scenarios must have an a step of relocating the 
backup media to be reliable.

___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users



___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-07 Thread Lionel PLASSE
Hello ,

Could  Encryption have any impact in my problem.

I am testing without any encryption between SD/DIR/BConsole or FD and it seems 
to be more stable. (short sized  job right done , longest job already running : 
750Gb to migrate)

My WAN connection seems  to be quite good,  I  achieve transferring big and 
small  raw files by scp ssh and don't have ping latency or troubles with the 
ipsec connection.

I tried too with NAT,  by not using IPSEC and setting  Bacula SD & DIR  
directly in front of the WAN.
And the same occurs  (wrote X byte  but only Y accepted)

I Tried too to make a migration job to migrate from  SD  to SD through WAN  
instead of SD-> FD through WAN and the result was the same. (to see if win32 FD 
could be involved)
 - DIR and SD in the same LAN.
 - Backup remote  FD through  remote SD, the two are in the same LAN to fast 
backup : step OK .
- Then  migration from remote SD to the SD that is in the DIR area through WAN 
to outsource volumes physical support : step nok
The final goal: outsourcing volumes.
I then discard the gzip compression (just in case)

The errors are quite disturbing :
*   Error: lib/bsock.c:397 Wrote 65566 bytes to client:192.168.0.4:9102, 
but only 65536 accepted
Fatal error: filed/backup.c:1008 Network send error to SD. 
ERR=Input/output error
Or  (when increasing MaximumNetworkBuffer)
*   Error: lib/bsock.c:397 Wrote 130277 bytes to client:192.168.0.17:9102, 
but only 98304 accepted.
Fatal error: filed/backup.c:1008 Network send error to SD. 
ERR=Input/output error
Or (Migration job)
*   Fatal error: append.c:319 Network error reading from FD. ERR=Erreur 
d'entrée/sortie
Error: bsock.c:571 Read expected 131118 got 114684 from Storage 
daemon:192.168.10.54:9103

It's look like there is a gap between send and receive buffer and looking at 
the source code, encryption could affect buffer size due to encryption.
So I  think Bacula-SD could be in cause (maybe).
Could it be a bug?
What could I do to determine the problem (activating debug in sd daemon ? )

I use Bacula 13.0.3 on Debian 12 , with ssl 1.1

Thank for helping. Backup scenarios must have an a step of relocating the 
backup media to be reliable.

___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-05 Thread Lionel PLASSE
Job lost connection ...

-Message d'origine-
De : Lionel PLASSE 
Envoyé : dimanche 5 novembre 2023 08:00
À : Lionel PLASSE ; Bill Arlofski ; 
bacula-users@lists.sourceforge.net
Objet : RE: [Bacula-users] fd lose connection

I upgraded net device drivers to last version
I change some network adapters conf (Broadcom netxtrem2) and gain some Mb/s (up 
to 10 now)
Backup still running.

Will see if it achieve job well

-Message d'origine-
De : Lionel PLASSE 
Envoyé : vendredi 3 novembre 2023 22:02
À : Bill Arlofski ; bacula-users@lists.sourceforge.net
Objet : Re: [Bacula-users] fd lose connection

Yes , you're right

But the job work well for an other machine through the same vpn  and the job 
last 9hours .
And from the hardware vpn gateway,  I don't have much configuration choice.  It 
could be a direction.

Could it be a problem of Debian 12 which seems not to be official for Bacula 
v13 ?

"SD calls Client" directive was activated , don't know why ( I tried so many 
conf to solve it :).  I will re-test without

And for the slow rate I am trying the Network buffer size directive on FD and 
SD  but not really conclusive.
But :   Error: lib/bsock.c:397 Wrote 65566 bytes to client:192.168.0.4:9102, 
but only 65536 accepted
Tells me to look after the send and receive buffer on network drivers 

Bacula doc :
Note, on certain Windows machines, there are reports that the transfer rates 
are very slow and this seems to be related to the default 65,536 size. On 
systems where the transfer rates seem abnormally slow compared to other 
systems, you might try setting the Maximum Network Buffer Size to 32,768 in 
both the File daemon and in the Storage daemon ...
The default size was chosen to be relatively large but not too big in the case 
that you are transmitting data over Internet. It is clear that on a high speed 
local network, you can increase this number and improve performance. For 
example, some users have found that if you use a value of 65,536 bytes they get 
five to ten times the throughput. Larger values for most users don’t seem to 
improve performance. If you are interested in improving your backup speeds, 
this is definitely a place to experiment. You will probably also want to make 
the corresponding change in each of your File daemons conf files.

I'm disappointed, and made so many inoperant conf changes , that I will try to 
with v9.6.6 ( I used before on my lan)  to see if the problem is the same .

___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-05 Thread Lionel PLASSE
I upgraded net device drivers to last version
I change some network adapters conf (Broadcom netxtrem2) and gain some Mb/s (up 
to 10 now)
Backup still running.

Will see if it achieve job well

-Message d'origine-
De : Lionel PLASSE  
Envoyé : vendredi 3 novembre 2023 22:02
À : Bill Arlofski ; bacula-users@lists.sourceforge.net
Objet : Re: [Bacula-users] fd lose connection

Yes , you're right 

But the job work well for an other machine through the same vpn  and the job 
last 9hours .
And from the hardware vpn gateway,  I don't have much configuration choice.  It 
could be a direction.

Could it be a problem of Debian 12 which seems not to be official for Bacula 
v13 ?

"SD calls Client" directive was activated , don't know why ( I tried so many 
conf to solve it :).  I will re-test without  

And for the slow rate I am trying the Network buffer size directive on FD and 
SD  but not really conclusive.  
But :   Error: lib/bsock.c:397 Wrote 65566 bytes to client:192.168.0.4:9102, 
but only 65536 accepted   
Tells me to look after the send and receive buffer on network drivers 

Bacula doc : 
Note, on certain Windows machines, there are reports that the transfer rates 
are very slow and this seems to be related to the default 65,536 size. On 
systems where the transfer rates seem abnormally slow compared to other 
systems, you might try setting the Maximum Network Buffer Size to 32,768 in 
both the File daemon and in the Storage daemon ...
The default size was chosen to be relatively large but not too big in the case 
that you are transmitting data over Internet. It is clear that on a high speed 
local network, you can increase this number and improve performance. For 
example, some users have found that if you use a value of 65,536 bytes they get 
five to ten times the throughput. Larger values for most users don’t seem to 
improve performance. If you are interested in improving your backup speeds, 
this is definitely a place to experiment. You will probably also want to make 
the corresponding change in each of your File daemons conf files.

I'm disappointed, and made so many inoperant conf changes , that I will try to 
with v9.6.6 ( I used before on my lan)  to see if the problem is the same .


smime.p7s
Description: S/MIME cryptographic signature
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-03 Thread Lionel PLASSE
Yes , you're right 

But the job work well for an other machine through the same vpn  and the job 
last 9hours .
And from the hardware vpn gateway,  I don't have much configuration choice.  It 
could be a direction.

Could it be a problem of Debian 12 which seems not to be official for Bacula 
v13 ?

"SD calls Client" directive was activated , don't know why ( I tried so many 
conf to solve it :).  I will re-test without  

And for the slow rate I am trying the Network buffer size directive on FD and 
SD  but not really conclusive.  
But :   Error: lib/bsock.c:397 Wrote 65566 bytes to client:192.168.0.4:9102, 
but only 65536 accepted   
Tells me to look after the send and receive buffer on network drivers 

Bacula doc : 
Note, on certain Windows machines, there are reports that the transfer rates 
are very slow
and this seems to be related to the default 65,536 size. On systems where the 
transfer
rates seem abnormally slow compared to other systems, you might try setting the 
Maximum
Network Buffer Size to 32,768 in both the File daemon and in the Storage daemon
...
The default size was chosen to be relatively large but not too big in the case 
that you are
transmitting data over Internet. It is clear that on a high speed local 
network, you can
increase this number and improve performance. For example, some users have 
found that
if you use a value of 65,536 bytes they get five to ten times the throughput. 
Larger values
for most users don’t seem to improve performance. If you are interested in 
improving your
backup speeds, this is definitely a place to experiment. You will probably also 
want to
make the corresponding change in each of your File daemons conf files.

I'm disappointed, and made so many inoperant conf changes , that I will try to 
with v9.6.6 ( I used before on my lan)  to see if the problem is the same .


smime.p7s
Description: S/MIME cryptographic signature
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-03 Thread Rob Gerber
Lionel,

I found this post from 10 years ago. That user had a similar problem. Kern
suggested add "Heartbeat Interval = 300" to the appropriate resources on
both sides (dir and FD in your case). I see you mentioned setting
heartbeat, did you set it on both sides?

Source: old post to this list
https://bacula-users.narkive.com/i6Y7gQ1J/lib-bsock-c-397-write-error-sending-1-bytes-to-storage-daemon


Robert Gerber
402-237-8692
r...@craeon.net

On Fri, Nov 3, 2023, 1:51 PM Lionel PLASSE  wrote:

> Hello ,
>
> I am losing connection between sd and windows fd  and  I don't understand
> why.
> The  machine lose every time the connection, sometimes after an hour  and
> often after 2 hours of job.
> And the transfer rate is so slow around 3 to 5 Mb/s (instead of up to 50 in
> other configurations)
>
> I tried many Network Maximum Rate values (32768 and higher and smaller) but
> every time I have more byte written to client (65566 > 65536) and
> connection
> fall down.
>
> The server is in win2019 version  with bacula-fd  win32 v13 and the Bacula
> Dir is in v13.0.3 on  a Debian 12 (with libssl1.1 and 6.1.0-13-amd kernel)
>
> I before was using bacula v9.6 and all wad good through my Lan network
> when
> backing up this same windows server.
>
> I am now using a Debian server  in a different place, through a site to
> site
> permanent ipsec VPN ( general upload wan bandwith  80Mb/s)
> The jobs are good for the other machines (win2012-r2 & Debian 10) which
> reach a rate of 10 up to 30 Mb/s depending the time and size of files
> backed
> up.
>
> But for this one (and another one in win2008-r2) :
> serv-dc-01-fd JobId 122: Fatal error: filed/backup.c:1008 Network send
> error
> to SD. ERR=Input/output error
> serv-dc-01-fd JobId 122: Error: lib/bsock.c:397 Wrote 65566 bytes to
> client:192.168.0.4:9102, but only 65536 accepted
>
> I check full duplex, opened firewall port, tcp_keep_alive & heartbeat,
> changed net buffer size,installed last driver .
>
> Any suggestion ?
> ___
> Bacula-users mailing list
> Bacula-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/bacula-users
>
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-03 Thread Bill Arlofski via Bacula-users

On 11/3/23 13:17, Rob Gerber wrote:

Oh, reading the error message again, did you set "heartbeat interval = 300" 
between Dir and SD? Your error message is from
the SD.

I don't know if FD will communicate directly with SD or if data from Fd goes 
through dir to SD.

Robert Gerber


Hello Robert,

Just an FYI, a HeartbeatInterval = 300 seconds has been hard-coded as the 
default - even when not explicitly set - for
several years now. :)

He may want to set a lower one on each Daemon (DIR, SD, FD), but I know from 
testing that anything less than 60 will be
ignored and 60 seconds will be used. ;)

But it really sounds to me that Lionel's problem is possibly an overly 
aggressive firewall closing sockets or it could also
have something to do with the IPsec VPN - since that seems to be the only real 
netowrking change - so that is where I would
look first.


Best regards,
Bill

--
Bill Arlofski
w...@protonmail.com



signature.asc
Description: OpenPGP digital signature
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


Re: [Bacula-users] fd lose connection

2023-11-03 Thread Rob Gerber
Oh, reading the error message again, did you set "heartbeat interval = 300"
between Dir and SD? Your error message is from the SD.

I don't know if FD will communicate directly with SD or if data from Fd
goes through dir to SD.

Robert Gerber
402-237-8692
r...@craeon.net

On Fri, Nov 3, 2023, 2:15 PM Rob Gerber  wrote:

> Lionel,
>
> I found this post from 10 years ago. That user had a similar problem. Kern
> suggested add "Heartbeat Interval = 300" to the appropriate resources on
> both sides (dir and FD in your case). I see you mentioned setting
> heartbeat, did you set it on both sides?
>
> Source: old post to this list
>
> https://bacula-users.narkive.com/i6Y7gQ1J/lib-bsock-c-397-write-error-sending-1-bytes-to-storage-daemon
>
>
> Robert Gerber
> 402-237-8692
> r...@craeon.net
>
> On Fri, Nov 3, 2023, 1:51 PM Lionel PLASSE  wrote:
>
>> Hello ,
>>
>> I am losing connection between sd and windows fd  and  I don't understand
>> why.
>> The  machine lose every time the connection, sometimes after an hour  and
>> often after 2 hours of job.
>> And the transfer rate is so slow around 3 to 5 Mb/s (instead of up to 50
>> in
>> other configurations)
>>
>> I tried many Network Maximum Rate values (32768 and higher and smaller)
>> but
>> every time I have more byte written to client (65566 > 65536) and
>> connection
>> fall down.
>>
>> The server is in win2019 version  with bacula-fd  win32 v13 and the Bacula
>> Dir is in v13.0.3 on  a Debian 12 (with libssl1.1 and 6.1.0-13-amd kernel)
>>
>> I before was using bacula v9.6 and all wad good through my Lan network
>> when
>> backing up this same windows server.
>>
>> I am now using a Debian server  in a different place, through a site to
>> site
>> permanent ipsec VPN ( general upload wan bandwith  80Mb/s)
>> The jobs are good for the other machines (win2012-r2 & Debian 10) which
>> reach a rate of 10 up to 30 Mb/s depending the time and size of files
>> backed
>> up.
>>
>> But for this one (and another one in win2008-r2) :
>> serv-dc-01-fd JobId 122: Fatal error: filed/backup.c:1008 Network send
>> error
>> to SD. ERR=Input/output error
>> serv-dc-01-fd JobId 122: Error: lib/bsock.c:397 Wrote 65566 bytes to
>> client:192.168.0.4:9102, but only 65536 accepted
>>
>> I check full duplex, opened firewall port, tcp_keep_alive & heartbeat,
>> changed net buffer size,installed last driver .
>>
>> Any suggestion ?
>> ___
>> Bacula-users mailing list
>> Bacula-users@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/bacula-users
>>
>
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users


[Bacula-users] fd lose connection

2023-11-03 Thread Lionel PLASSE
Hello ,

I am losing connection between sd and windows fd  and  I don't understand
why.
The  machine lose every time the connection, sometimes after an hour  and
often after 2 hours of job.
And the transfer rate is so slow around 3 to 5 Mb/s (instead of up to 50 in
other configurations)

I tried many Network Maximum Rate values (32768 and higher and smaller) but
every time I have more byte written to client (65566 > 65536) and connection
fall down.

The server is in win2019 version  with bacula-fd  win32 v13 and the Bacula
Dir is in v13.0.3 on  a Debian 12 (with libssl1.1 and 6.1.0-13-amd kernel)

I before was using bacula v9.6 and all wad good through my Lan network  when
backing up this same windows server.

I am now using a Debian server  in a different place, through a site to site
permanent ipsec VPN ( general upload wan bandwith  80Mb/s)
The jobs are good for the other machines (win2012-r2 & Debian 10) which
reach a rate of 10 up to 30 Mb/s depending the time and size of files backed
up.

But for this one (and another one in win2008-r2) :
serv-dc-01-fd JobId 122: Fatal error: filed/backup.c:1008 Network send error
to SD. ERR=Input/output error
serv-dc-01-fd JobId 122: Error: lib/bsock.c:397 Wrote 65566 bytes to
client:192.168.0.4:9102, but only 65536 accepted

I check full duplex, opened firewall port, tcp_keep_alive & heartbeat,
changed net buffer size,installed last driver . 

Any suggestion ? 


smime.p7s
Description: S/MIME cryptographic signature
___
Bacula-users mailing list
Bacula-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-users