Thank you!

> Hi,
>
> if you want to work around the job terminates after 6 days issue, 
> there are two places in the code that you need to change.  In the 
> 5.2.13 code base, the first one is in bnet.c line 784 and then line 79 
> in bsock.c - both of these source files are in the lib directory.
>
> hope this helps,
>
>
> --tom
>
>
>> Hi. Thank you, I'll try.
>> But why my job terminates exactly after 6 days?
>>
>>> Hello,
>>>
>>> Your problem is a comm line drop not a watch dog problem.
>>>
>>> Put HeartBeatInterval = 300 in your Dir, SD, and FDs.
>>>
>>> Best regards,
>>> Kern
>>>
>>> On 01/15/2014 09:28 AM, Andrey Chebotarev wrote:
>>>> I asked because in the latest version(5.2.13) modifying sources 
>>>> doesn't
>>>> work anymore.
>>>> I've changed this part:
>>>>        /*
>>>>         * ****FIXME**** reduce this to a few hours once
>>>>         *   heartbeats are implemented
>>>>         */
>>>>        bsock->timeout = 60 * 60 * 30 * 24;
>>>>
>>>> but job still terminates after 6 days :(
>>>>
>>>> In 5.2.11 I didn't have such problem.
>>>> What has been changed in 5.2.13 ? In which part of code I can fix it?
>>>>
>>>>> Hi.
>>>>> I'm using bacula to backup huge stuff, about 100TB. Usually it takes
>>>>> about 15-16 days.
>>>>> I've faced with a problem. As I understood, in bacula there is 
>>>>> mechanism
>>>>> which cares about jobs(watchdog timer). And with this mechanism I 
>>>>> have
>>>>> trouble. My job terminates after 6 days with error message:
>>>>>
>>>>> 2013-12-29 16:42:56baculasrv-dir JobId 8013: Error: Watchdog sending
>>>>> kill after 518427 secs to thread stalled reading File daemon.
>>>>> 2013-12-29 16:42:56baculasrv-dir JobId 8013: Fatal error: Network 
>>>>> error
>>>>> with FD during Backup: ERR=Interrupted system call
>>>>> 2013-12-29 16:42:57baculasrv-sd JobId 8013: Elapsed time=143:47:09,
>>>>> Transfer rate=58.09 M Bytes/second
>>>>> 2013-12-29 16:42:57baculasrv-dir JobId 8013: Error: Director's 
>>>>> comm line
>>>>> to SD dropped.
>>>>> 2013-12-29 16:42:57baculasrv-dir JobId 8013: Fatal error: No Job 
>>>>> status
>>>>> returned from FD.
>>>>> 2013-12-29 16:42:57baculasrv-dir JobId 8013: Error: Bacula 
>>>>> baculasrv-dir
>>>>> 5.2.13 (19Jan13):
>>>>>
>>>>> But my job is still active. Where is the problem? FD isn't sending
>>>>> "keep-alive" packets or 6 days is hardcoded interval of maximum 
>>>>> running
>>>>> time?
>>>>>
>>>>> In sources I see this(src/lib/bnet.c):
>>>>>
>>>>>        /*
>>>>>         * ****FIXME**** reduce this to a few hours once
>>>>>         *   heartbeats are implemented
>>>>>         */
>>>>>        bsock->timeout = 60 * 60 * 6 * 24;   /* 6 days timeout */
>>>>>
>>>>> Is it mean that  heartbeat isn't implemented yet?
>>>>>
>>>>> Now I'm changing that interval to 30 days.
>>>>> Is there any more beautiful way?
>>>>>
>>>>> ------------------------------------------------------------------------------
>>>>>  
>>>>>
>>>>> Rapidly troubleshoot problems before they affect your business. 
>>>>> Most IT
>>>>> organizations don't have a clear picture of how application 
>>>>> performance
>>>>> affects their revenue. With AppDynamics, you get 100% visibility 
>>>>> into your
>>>>> Java,.NET, & PHP application. Start your 15-day FREE TRIAL of 
>>>>> AppDynamics Pro!
>>>>> http://pubads.g.doubleclick.net/gampad/clk?id=84349831&iu=/4140/ostg.clktrk
>>>>>  
>>>>>
>>>>> _______________________________________________
>>>>> Bacula-devel mailing list
>>>>> Bacula-devel@lists.sourceforge.net
>>>>> https://lists.sourceforge.net/lists/listinfo/bacula-devel
>>>> ------------------------------------------------------------------------------
>>>>  
>>>>
>>>> CenturyLink Cloud: The Leader in Enterprise Cloud Services.
>>>> Learn Why More Businesses Are Choosing CenturyLink Cloud For
>>>> Critical Workloads, Development Environments & Everything In Between.
>>>> Get a Quote or Start a Free Trial Today.
>>>> http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
>>>>  
>>>>
>>>> _______________________________________________
>>>> Bacula-devel mailing list
>>>> Bacula-devel@lists.sourceforge.net
>>>> https://lists.sourceforge.net/lists/listinfo/bacula-devel
>>>>
>>
>>
>> ------------------------------------------------------------------------------
>>  
>>
>> CenturyLink Cloud: The Leader in Enterprise Cloud Services.
>> Learn Why More Businesses Are Choosing CenturyLink Cloud For
>> Critical Workloads, Development Environments & Everything In Between.
>> Get a Quote or Start a Free Trial Today.
>> http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk 
>>
>> _______________________________________________
>> Bacula-devel mailing list
>> Bacula-devel@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/bacula-devel
>>
>


------------------------------------------------------------------------------
CenturyLink Cloud: The Leader in Enterprise Cloud Services.
Learn Why More Businesses Are Choosing CenturyLink Cloud For
Critical Workloads, Development Environments & Everything In Between.
Get a Quote or Start a Free Trial Today. 
http://pubads.g.doubleclick.net/gampad/clk?id=119420431&iu=/4140/ostg.clktrk
_______________________________________________
Bacula-devel mailing list
Bacula-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/bacula-devel

Reply via email to