Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, Martin Simmons wrote: If that is OK, then I suggest running the SD with debug level 200, which might give us a clue where the error occurs. So far I have been unable to get it to fail using -d200, while it does fail if I don't specify a debug level. Maybe there is a timing issue. I'll keep trying. Steve Strangely, it has also not failed for me at all since I have had debugging turned up. I will continue to keep debugging high over the next few days and see if it fails at all. --Marc Okay, finally got the problem to happen again: ---snip--- 2-Dec 03:05 escabot-dir JobId 8558: Start Backup JobId 8558, Job=sun_boot_kim_topaz.2007-12-12_02.00.42 12-Dec 03:05 escabot-dir JobId 8558: Using Device T50-Drive-1 12-Dec 03:05 escabot-sd JobId 8558: Error: Autochanger Volume not found in slot 28. Setting InChanger to zero in catalog. 12-Dec 03:05 escabot-sd JobId 8558: Fatal error: askdir.c:332 NULL Volume name. This shouldn't happen!!! 12-Dec 03:05 escabot-sd JobId 8558: Warning: Director wanted Volume 67LX. Current Volume not acceptable because: 1998 Volume status is , not in Pool. 12-Dec 03:05 escabot-sd JobId 8558: Fatal error: Job 8558 canceled. 12-Dec 03:05 escabot-fd JobId 8558: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data ---snip--- Then here is what I got in the log with increased sd debugging: ---snip--- escabot-sd: dircmd.c:214-0 dird: JobId=8558 job=sun_boot_kim_topaz.2007-12-12_02.00.42 job_name=sun_boot_kim_topaz client_name=escabot-fd type=66 level=68 FileSet=sun_snap-boot_kim_topaz NoAttr=0 SpoolAttr=0 FileSetMD5=wRNfu9+S35kWu9/dG8+UqA SpoolData=0 WritePartAfterJob=1 PreferMountedVols=0 escabot-sd: dircmd.c:228-0 Do command: JobId= escabot-sd: job.c:87-0 dird: JobId=8558 job=sun_boot_kim_topaz.2007-12-12_02.00.42 job_name=sun_boot_kim_topaz client_name=escabot-fd type=66 level=68 FileSet=sun_snap-boot_kim_topaz NoAttr=0 SpoolAttr=0 FileSetMD5=wRNfu9+S35kWu9/dG8+UqA SpoolData=0 WritePartAfterJob=1 PreferMountedVols=0 escabot-sd: job.c:141-0 dird jid=8558: 3000 OK Job SDid=316 SDtime=1196884941 Authorization=GDBD-OLJK-FFFI-GOKH-ALGP-GMGF-JHMN-DENF escabot-sd: pythonlib.c:237-0 No startup module. escabot-sd: dircmd.c:214-0 dird: use storage=SpectraLogicT50 media_type=LTO-3 pool_name=Daily pool_type=Backup append=1 copy=0 stripe=0 escabot-sd: dircmd.c:228-0 Do command: use storage= escabot-sd: reserve.c:586-0 jid=8558 dird: use storage=SpectraLogicT50 media_type=LTO-3 pool_name=Daily pool_type=Backup append=1 copy=0 stripe=0 escabot-sd: reserve.c:615-0 jid=8558 dird device: use device=SpectraLogicT50 escabot-sd: reserve.c:632-0 jid=8558 Storage=SpectraLogicT50 media_type=LTO-3 pool=Daily pool_type=Backup append=1 escabot-sd: reserve.c:634-0 jid=8558 Device=SpectraLogicT50 escabot-sd: reserve.c:683-0 jid=8558 PrefMnt=0 exact=0 suitable=0 chgronly=1 any=0 escabot-sd: reserve.c:828-0 jid=8558 PrefMnt=0 exact=0 suitable=0 chgronly=1 escabot-sd: reserve.c:986-0 jid=8558 search res for SpectraLogicT50 escabot-sd: reserve.c:989-0 jid=8558 Try match changer res=SpectraLogicT50 escabot-sd: reserve.c:995-0 jid=8558 Try changer device T50-Drive-1 escabot-sd: reserve.c:1058-0 jid=8558 chk MediaType device=LTO-3 request=LTO-3 escabot-sd: reserve.c:1081-0 jid=8558 try reserve T50-Drive-1 escabot-sd: reserve.c:1095-0 jid=8558 have_vol=0 vol= escabot-sd: reserve.c:1254-0 jid=8558 reserve_append device is T50-Drive-1 (/dev/nst0) escabot-sd: reserve.c:1286-0 jid=8558 PrefMnt=0 exact=0 suitable=1 chgronly=1 any=0 escabot-sd: reserve.c:1365-0 jid=8558 OK Res Unused autochanger T50-Drive-1 (/dev/nst0). escabot-sd: reserve.c:1264-0 jid=8558 Inc reserve=1 dev=T50-Drive-1 (/dev/nst0) 81b0828 escabot-sd: reserve.c:1105-0 jid=8558 Reserved=1 dev_name=SpectraLogicT50 mediatype=LTO-3 pool=Daily ok=1 escabot-sd: askdir.c:256-0 dir_find_next_appendable_volume escabot-sd: askdir.c:271-0 dird: CatReq Job=sun_boot_kim_topaz.2007-12-12_02.00.42 FindMedia=1 pool_name=Daily media_type=LTO-3 escabot-sd: askdir.c:182-0 dird 1000 OK VolName=67LX VolJobs=12 VolFiles=19 VolBlocks=139901 VolBytes=9025357824 VolMounts=11 VolErrors=0 VolWrites=11155326 MaxVolBytes=0 VolCapacityBytes=0 VolStatus=Append Slot=28 MaxVolJobs=0 MaxVolFiles=0 InChanger=1 VolReadTime=0 VolWriteTime=3017012185 EndFile=18 EndBlock=8360 VolParts=0 LabelType=0 MediaId=67 escabot-sd: askdir.c:204-0 do_reqest_vol_info return true slot=28 Volume=67LX escabot-sd: reserve.c:406-0 jid=8558 find_vol=67LX found=1 escabot-sd: reserve.c:181-0 jid=8558 List from find_volume: 67LX at 81b1eb0 on device T50-Drive-1 (/dev/nst0) escabot-sd: reserve.c:538-0 jid=8558 Vol=67LX on same dev. escabot-sd: reserve.c:313-0 jid=8558 reserve_volume 67LX escabot-sd: reserve.c:181-0 jid=8558 List from begin reserve_volume: 67LX at 81b1eb0 on device T50-Drive-1 (/dev/nst0) escabot-sd: reserve.c:181-0 jid=8558 List from end new volume: 67LX at
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, Martin Simmons wrote: If that is OK, then I suggest running the SD with debug level 200, which might give us a clue where the error occurs. So far I have been unable to get it to fail using -d200, while it does fail if I don't specify a debug level. Maybe there is a timing issue. I'll keep trying. Steve - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, Martin Simmons wrote: If that is OK, then I suggest running the SD with debug level 200, which might give us a clue where the error occurs. So far I have been unable to get it to fail using -d200, while it does fail if I don't specify a debug level. Maybe there is a timing issue. I'll keep trying. Steve Strangely, it has also not failed for me at all since I have had debugging turned up. I will continue to keep debugging high over the next few days and see if it fails at all. --Marc - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users !DSPAM:1,475ac815206871589165409! - SF.Net email is sponsored by: Check out the new SourceForge.net Marketplace. It's the best place to buy or sell services for just about anything Open Source. http://sourceforge.net/services/buy/index.php ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007 11:36:33 -0500 (EST), Steve Thompson said: On Wed, 5 Dec 2007, [EMAIL PROTECTED] wrote: I am still experiencing this problem on a regular basis; not every job does this, but it seems a good 40% do each night. [...] 05-Dec 03:33 escabot-fd JobId 8219: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data I see this very often as well, and I am using disk exclusively. It also happens about 40% of the time, and has done since I started with bacula at 1.38 (now on 2.2.4). I'd like to see a proper explanation of what this message really means. It's certainly annoying. It is a generic error message meaning the SD did like something so doesn't tell you much. Sometimes the text after the word got is useful, but more often you have to look at the previous messages to find out what it didn't like. __Martin - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007 13:46:26 -0500 (EST), Steve Thompson said: On Wed, 5 Dec 2007, Dan Langille wrote: My first idea: different versions of SD and FD, with one trying to use a command the other does not recognize. What version is each of: bacula-dir, bacula-fd, bacula-sd They are all the same at 2.2.4. It happens even in the case where bacula-dir, bacula-fd and bacula-sd are running on the same machine. Everything was rebuilt from source by myself, and installed on a clean O/S, but I have had this problem with every single version of bacula since 1.38. The SD must be failing to pass the error message back to the Director. Do you see messages any from the SD in your logs (e.g. about recycling)? If not, check that the SD's Messages resource is configured correctly, e.g. if your bacula-sd.conf contains Director { Name = foo-bar ... } then use Messages { Name = Standard director = foo-bar = all } If that is OK, then I suggest running the SD with debug level 200, which might give us a clue where the error occurs. __Martin - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, Steve Thompson wrote: On Wed, 5 Dec 2007, Martin Simmons wrote: On Wed, 5 Dec 2007 11:36:33 -0500 (EST), Steve Thompson said: I see this very often as well, and I am using disk exclusively. It also happens about 40% of the time, and has done since I started with bacula at 1.38 (now on 2.2.4). I'd like to see a proper explanation of what this message really means. It's certainly annoying. It is a generic error message meaning the SD did like something so doesn't tell you much. Sometimes the text after the word got is useful, but more often you have to look at the previous messages to find out what it didn't like. This is all I get, consistently: 05-Dec 13:02 vger-dir: No prior Full backup Job record found. 05-Dec 13:02 vger-dir: No prior or suitable Full backup found in catalog. Doing FULL backup. 05-Dec 13:02 vger-dir: Start Backup JobId 1093, Job=vger_data1.2007-12-05_13.02.03 05-Dec 13:02 vger-dir: There are no more Jobs associated with Volume Backup-0073. Marking it purged. 05-Dec 13:02 vger-dir: All records pruned from Volume Backup-0073; marking it Purged 05-Dec 13:02 vger-dir: Recycled volume Backup-0073 05-Dec 13:02 vger-dir: Using Device Backup 05-Dec 13:02 vger-fd: vger_data1.2007-12-05_13.02.03 Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data My first idea: different versions of SD and FD, with one trying to use a command the other does not recognize. What version is each of: bacula-dir, bacula-fd, bacula-sd Please don't go by memory, but check it, via the status command in bconsole. Something may have changed. Sorry if you already mentioned it... Mine is actually a bit different then Steve's message, I consistently get these: 05-Dec 03:34 escabot-dir JobId 8220: Using Device T50-Drive-1 05-Dec 03:34 escabot-sd JobId 8220: Error: Autochanger Volume not found in slot 2. Setting InChanger to zero in catalog. 05-Dec 03:34 escabot-fd JobId 8220: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 05-Dec 03:34 escabot-sd JobId 8220: Fatal error: askdir.c:332 NULL Volume name. This shouldn't happen!!! 05-Dec 03:34 escabot-sd JobId 8220: Warning: Director wanted Volume 97LX. Current Volume not acceptable because: 1998 Volume status is , not in Pool. 05-Dec 03:34 escabot-sd JobId 8220: Fatal error: Job 8220 canceled. How does it get a NULL volume name? Both seem to be the same version: escabot-dir Version: 2.2.6 (10 November 2007) i686-pc-linux-gnu gentoo escabot-sd Version: 2.2.6 (10 November 2007) i686-pc-linux-gnu gentoo --Marc -- Dan Langille - http://www.langille.org/ BSDCan - The Technical BSD Conference: http://www.bsdcan.org/ - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users !DSPAM:1,4756ebd3206872772910599! - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, Steve Thompson wrote: On Wed, 5 Dec 2007, Martin Simmons wrote: On Wed, 5 Dec 2007 11:36:33 -0500 (EST), Steve Thompson said: I see this very often as well, and I am using disk exclusively. It also happens about 40% of the time, and has done since I started with bacula at 1.38 (now on 2.2.4). I'd like to see a proper explanation of what this message really means. It's certainly annoying. It is a generic error message meaning the SD did like something so doesn't tell you much. Sometimes the text after the word got is useful, but more often you have to look at the previous messages to find out what it didn't like. This is all I get, consistently: 05-Dec 13:02 vger-dir: No prior Full backup Job record found. 05-Dec 13:02 vger-dir: No prior or suitable Full backup found in catalog. Doing FULL backup. 05-Dec 13:02 vger-dir: Start Backup JobId 1093, Job=vger_data1.2007-12-05_13.02.03 05-Dec 13:02 vger-dir: There are no more Jobs associated with Volume Backup-0073. Marking it purged. 05-Dec 13:02 vger-dir: All records pruned from Volume Backup-0073; marking it Purged 05-Dec 13:02 vger-dir: Recycled volume Backup-0073 05-Dec 13:02 vger-dir: Using Device Backup 05-Dec 13:02 vger-fd: vger_data1.2007-12-05_13.02.03 Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data My first idea: different versions of SD and FD, with one trying to use a command the other does not recognize. What version is each of: bacula-dir, bacula-fd, bacula-sd Please don't go by memory, but check it, via the status command in bconsole. Something may have changed. Sorry if you already mentioned it... -- Dan Langille - http://www.langille.org/ BSDCan - The Technical BSD Conference: http://www.bsdcan.org/ - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, Martin Simmons wrote: On Wed, 5 Dec 2007 13:46:26 -0500 (EST), Steve Thompson said: They are all the same at 2.2.4. It happens even in the case where bacula-dir, bacula-fd and bacula-sd are running on the same machine. Everything was rebuilt from source by myself, and installed on a clean O/S, but I have had this problem with every single version of bacula since 1.38. The SD must be failing to pass the error message back to the Director. Do you see messages any from the SD in your logs (e.g. about recycling)? If not, check that the SD's Messages resource is configured correctly, e.g. if your bacula-sd.conf contains [...] Yes, this is all correct. If that is OK, then I suggest running the SD with debug level 200, which might give us a clue where the error occurs. Will do, and will report back. Thanks, Steve - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, Dan Langille wrote: My first idea: different versions of SD and FD, with one trying to use a command the other does not recognize. What version is each of: bacula-dir, bacula-fd, bacula-sd They are all the same at 2.2.4. It happens even in the case where bacula-dir, bacula-fd and bacula-sd are running on the same machine. Everything was rebuilt from source by myself, and installed on a clean O/S, but I have had this problem with every single version of bacula since 1.38. Steve - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, [EMAIL PROTECTED] wrote: I am still experiencing this problem on a regular basis; not every job does this, but it seems a good 40% do each night. [...] 05-Dec 03:33 escabot-fd JobId 8219: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data I see this very often as well, and I am using disk exclusively. It also happens about 40% of the time, and has done since I started with bacula at 1.38 (now on 2.2.4). I'd like to see a proper explanation of what this message really means. It's certainly annoying. Steve - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Wed, 5 Dec 2007, [EMAIL PROTECTED] wrote: I am still experiencing this problem on a regular basis; not every job does this, but it seems a good 40% do each night. [...] 05-Dec 03:33 escabot-fd JobId 8219: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data I see this very often as well, and I am using disk exclusively. It also happens about 40% of the time, and has done since I started with bacula at 1.38 (now on 2.2.4). I'd like to see a proper explanation of what this message really means. It's certainly annoying. Steve Yeah, I've seen one other post on this list with the same problem, but he never got an answer (at least not on the list). I e-mailed him directly to see if he has come up with a solution, but I haven't gotten a response yet. This is especially frustrating as sometimes only half of my backup jobs work each night. I haven't been sleeping well lately knowing my backups are weak, if you know what I mean. =) --Marc - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users !DSPAM:1,4756d3a0206871384784741! - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
I am still experiencing this problem on a regular basis; not every job does this, but it seems a good 40% do each night. I cleaned both of my tape drives last night, but I still get the fatal errors. Here is another example: 05-Dec 03:32 escabot-dir JobId 8219: BeforeJob: run command /usr/local/sbin/sun_snap_ctl.sh start mssql_data 05-Dec 03:32 escabot-dir JobId 8219: BeforeJob: start: mssql_data 05-Dec 03:32 escabot-dir JobId 8219: BeforeJob: try # 1 05-Dec 03:32 escabot-dir JobId 8219: BeforeJob: sleeping... 05-Dec 03:33 escabot-dir JobId 8219: BeforeJob: rescanning fibre... 05-Dec 03:33 escabot-dir JobId 8219: BeforeJob: sleeping... 05-Dec 03:33 escabot-dir JobId 8219: BeforeJob: found the node: /dev/sdw 05-Dec 03:33 escabot-dir JobId 8219: BeforeJob: rereading the partition table for /dev/sdw... 05-Dec 03:33 escabot-dir JobId 8219: BeforeJob: sleeping... 05-Dec 03:33 escabot-dir JobId 8219: BeforeJob: mounting /dev/sdw1 @ /snapshots/mssql_data/1... 05-Dec 03:33 escabot-dir JobId 8219: Start Backup JobId 8219, Job=sun_mssql_data.2007-12-05_02.00.39 05-Dec 03:33 escabot-dir JobId 8219: Using Device T50-Drive-1 05-Dec 03:33 escabot-sd JobId 8219: Error: Autochanger Volume not found in slot 2. Setting InChanger to zero in catalog. 05-Dec 03:33 escabot-fd JobId 8219: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 05-Dec 03:33 escabot-sd JobId 8219: Fatal error: askdir.c:332 NULL Volume name. This shouldn't happen!!! 05-Dec 03:33 escabot-sd JobId 8219: Warning: Director wanted Volume 97LX. Current Volume not acceptable because: 1998 Volume status is , not in Pool. 05-Dec 03:33 escabot-sd JobId 8219: Fatal error: Job 8219 canceled. 05-Dec 03:33 escabot-dir JobId 8219: Error: Bacula escabot-dir 2.2.6 (10Nov07): 05-Dec-2007 03:33:27 Build OS: i686-pc-linux-gnu gentoo JobId: 8219 Job:sun_mssql_data.2007-12-05_02.00.39 Backup Level: Differential, since=2007-12-03 15:42:02 Client: escabot-fd 2.2.6 (10Nov07) i686-pc-linux-gnu,gentoo, FileSet:sun_snap-mssql_data 2007-07-26 09:36:24 Pool: Daily (From Run pool override) Storage:SpectraLogicT50 (From Job resource) Scheduled time: 05-Dec-2007 02:00:00 Start time: 05-Dec-2007 03:33:27 End time: 05-Dec-2007 03:33:27 Elapsed time: 0 secs Priority: 22 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None VSS:no Encryption: no Volume name(s): Volume Session Id: 87 Volume Session Time:1196800690 Last Volume Bytes: 85,736,125,440 (85.73 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** Whats strange is a job or two later, it will use the same volume just fine. I'm just not sure if why it gets NULL volume names -- is that coming from the tape library, or maybe something is corrupt in the database? Anyone else having/seen this problem? Thanks, Marc Hi, I am using Bacula 2.2.6 with a Spectra Logic T50 tape library (with 2 LTO3 drives). I have about 20 jobs that are run nightly, allowing Bacula to run multiple jobs at once (one on each drive). Here are my configuration files log: http://esweb.mcc.edu/~marc.smith/bacula/ This seems to happen somewhat randomly, usually happens to a few of my jobs each night. Other nights it doesn't happen to any. I can't really seem to reproduce the problem reliably, I happened to get lucky this time when I started 2 jobs. Here are the errors: 27-Nov 17:20 escabot-fd JobId 7784: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3 000 OK data , got 3903 Error append data 27-Nov 17:20 escabot-sd JobId 7784: Fatal error: 3992 Bad autochanger load slot 10, drive 1: ERR=Child died from signal 15: Termination. Results=source Element Address 4105 is Empty Program killed by Bacula watchdog (timeout) 27-Nov 17:20 escabot-sd JobId 7783: 3301 Issuing autochanger loaded? drive 0 command. 27-Nov 17:20 escabot-dir JobId 7784: Error: Bacula escabot-dir 2.2.6 (10Nov07): 27-Nov-2007 17:20:07 Build OS: i686-pc-linux-gnu gentoo JobId: 7784 Job:sun_squid_data.2007-11-27_17.12.04 Backup Level: Differential, since=2007-11-25 16:38:02 Client: escabot-fd 2.2.6 (10Nov07) i686-pc-linux-gnu,gentoo, FileSet:sun_snap-squid_data 2007-07-26 09:36:19 Pool: Daily (From Job resource) Storage:SpectraLogicT50 (From Job resource) Scheduled time: 27-Nov-2007 17:12:20 Start time:
[Bacula-users] Bad response to Append Data command.
Hi, I am using Bacula 2.2.6 with a Spectra Logic T50 tape library (with 2 LTO3 drives). I have about 20 jobs that are run nightly, allowing Bacula to run multiple jobs at once (one on each drive). Here are my configuration files log: http://esweb.mcc.edu/~marc.smith/bacula/ This seems to happen somewhat randomly, usually happens to a few of my jobs each night. Other nights it doesn't happen to any. I can't really seem to reproduce the problem reliably, I happened to get lucky this time when I started 2 jobs. Here are the errors: 27-Nov 17:20 escabot-fd JobId 7784: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3 000 OK data , got 3903 Error append data 27-Nov 17:20 escabot-sd JobId 7784: Fatal error: 3992 Bad autochanger load slot 10, drive 1: ERR=Child died from signal 15: Termination. Results=source Element Address 4105 is Empty Program killed by Bacula watchdog (timeout) 27-Nov 17:20 escabot-sd JobId 7783: 3301 Issuing autochanger loaded? drive 0 command. 27-Nov 17:20 escabot-dir JobId 7784: Error: Bacula escabot-dir 2.2.6 (10Nov07): 27-Nov-2007 17:20:07 Build OS: i686-pc-linux-gnu gentoo JobId: 7784 Job:sun_squid_data.2007-11-27_17.12.04 Backup Level: Differential, since=2007-11-25 16:38:02 Client: escabot-fd 2.2.6 (10Nov07) i686-pc-linux-gnu,gentoo, FileSet:sun_snap-squid_data 2007-07-26 09:36:19 Pool: Daily (From Job resource) Storage:SpectraLogicT50 (From Job resource) Scheduled time: 27-Nov-2007 17:12:20 Start time: 27-Nov-2007 17:14:47 End time: 27-Nov-2007 17:20:07 Elapsed time: 5 mins 20 secs Priority: 22 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None VSS:no Encryption: no Volume name(s): Volume Session Id: 2 Volume Session Time:1196201503 Last Volume Bytes: 692,129,636,352 (692.1 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** 27-Nov 17:20 escabot-sd JobId 7783: 3302 Autochanger loaded? drive 0, result is Slot 10. 27-Nov 17:20 escabot-sd JobId 7783: Error: Autochanger Volume not found in slot 10. Setting InChanger to zero in catalog. 27-Nov 17:20 escabot-fd JobId 7783: Fatal error: job.c:1811 Bad response to Append Data command. Wanted 3 000 OK data , got 3903 Error append data 27-Nov 17:20 escabot-sd JobId 7783: Fatal error: askdir.c:332 NULL Volume name. This shouldn't happen!!! 27-Nov 17:20 escabot-sd JobId 7783: Warning: Director wanted Volume 34LX. Current Volume not acceptable because: 1998 Volume status is , not in Pool. 27-Nov 17:20 escabot-sd JobId 7783: Fatal error: Job 7783 canceled. 27-Nov 17:20 escabot-dir JobId 7783: Error: Bacula escabot-dir 2.2.6 (10Nov07): 27-Nov-2007 17:20:14 Build OS: i686-pc-linux-gnu gentoo JobId: 7783 Job:sun_home_fac_staff.2007-11-27_17.15.03 Backup Level: Differential, since=2007-11-25 12:40:08 Client: escabot-fd 2.2.6 (10Nov07) i686-pc-linux-gnu,gentoo, FileSet:sun_snap-home_fac_staff 2007-05-30 15:59:34 Pool: Daily (From Job resource) Storage:SpectraLogicT50 (From Job resource) Scheduled time: 27-Nov-2007 17:15:02 Start time: 27-Nov-2007 17:12:36 End time: 27-Nov-2007 17:20:14 Elapsed time: 7 mins 38 secs Priority: 22 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None VSS:no Encryption: no Volume name(s): Volume Session Id: 1 Volume Session Time:1196201503 Last Volume Bytes: 692,129,636,352 (692.1 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** Hopefully this is just something simple I am doing wrong. Thanks in advance for any help. --Marc - SF.Net email is sponsored by: The Future of Linux Business White Paper from Novell. From the desktop to the data center, Linux is going mainstream. Let it simplify your IT future. http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command. Wanted 3000 OK data, , got 3903 Error append data
Hello, Note for Arno: 1.39.26+ does a better job of helping one diagnose these problems by printing the full output from mtx when a failure occurs. On Friday 10 November 2006 00:46, Arno Lehmann wrote: Hi, On 11/9/2006 7:25 AM, Ryan Novosielski wrote: I've gotten this one... I'd love to know what it is too. :) Dat's an error ;-) Jake Goerzen wrote: What does it mean when this happens? and is there a way to fix it? 07-Nov 08:40 adam-dir: Start Backup JobId 369, Job=BackupACSRV.2006-11-07_08.40.54 07-Nov 08:40 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. Ok, more to the point, this is a failure from the mtx-changer script. It's not a timeout, that would be indicated by a TERM or KILL exit code, but seems to be problem mt or mtx, inside mtx-changer, report. I'd try to unmount the drive using bconsole, then use mtx-changer from the shell with the loaded command and then follow it's operation until you can determine where the problem comes from. Most probably, the mtx command will not get a proper response to a the status command. Most probably you have a stuck autochanger, which doesn't properly respond to mtx commands, which in turn ends with an error. Power-cycling the autochanger sometimes helps, emergency ejecting any tapes and doing an inventory might also help - that depends on the actual problem and, of course, the autochanger you have. Arno -- IT-Service Lehmann[EMAIL PROTECTED] Arno Lehmann http://www.its-lehmann.de - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command. Wanted 3000 OK data, , got 3903 Error append data
Hi, On 11/9/2006 7:25 AM, Ryan Novosielski wrote: I've gotten this one... I'd love to know what it is too. :) Dat's an error ;-) Jake Goerzen wrote: What does it mean when this happens? and is there a way to fix it? 07-Nov 08:40 adam-dir: Start Backup JobId 369, Job=BackupACSRV.2006-11-07_08.40.54 07-Nov 08:40 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. Ok, more to the point, this is a failure from the mtx-changer script. It's not a timeout, that would be indicated by a TERM or KILL exit code, but seems to be problem mt or mtx, inside mtx-changer, report. I'd try to unmount the drive using bconsole, then use mtx-changer from the shell with the loaded command and then follow it's operation until you can determine where the problem comes from. Most probably, the mtx command will not get a proper response to a the status command. Most probably you have a stuck autochanger, which doesn't properly respond to mtx commands, which in turn ends with an error. Power-cycling the autochanger sometimes helps, emergency ejecting any tapes and doing an inventory might also help - that depends on the actual problem and, of course, the autochanger you have. Arno -- IT-Service Lehmann[EMAIL PROTECTED] Arno Lehmann http://www.its-lehmann.de - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Bad response to Append Data command. Wanted 3000 OK data, , got 3903 Error append data
What does it mean when this happens? 07-Nov 08:40 adam-dir: Start Backup JobId 369, Job=BackupACSRV.2006-11-07_08.40.54 07-Nov 08:40 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3304 Issuing autochanger load slot 9, drive 0 command. 07-Nov 08:46 adam-sd: BackupACSRV.2006-11-07_08.40.54 Fatal error: 3992 Bad autochanger load slot 9, drive 0: ERR=Child exited with code 1. 07-Nov 08:46 acsrv-fd: BackupACSRV.2006-11-07_08.40.54 Fatal error: c:\cygwin\home\kern\bacula\k\src\win32\filed\../../filed/job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 07-Nov 08:46 adam-dir: BackupACSRV.2006-11-07_08.40.54 Error: Bacula 1.38.11 (28Jun06): 07-Nov-2006 08:46:17 JobId: 369 Job:BackupACSRV.2006-11-07_08.40.54 Backup Level: Full Client: acsrv-fd Windows Server 2003,MVS,NT 5.2.3790 FileSet:ACSRVfileset 2006-09-24 10:20:32 Pool: Default Storage:Autochanger Scheduled time: 07-Nov-2006 08:40:53 Start time: 07-Nov-2006 08:40:59 End time: 07-Nov-2006 08:46:17 Elapsed time: 5 mins 18 secs Priority: 10 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 1 Volume Session Time:1162893909 Last Volume Bytes: 35,091,719,514 (35.09 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Bad response to Append Data command. Wanted 3000 OK data, , got 3903 Error append data
What does it mean when this happens? and is there a way to fix it? 07-Nov 08:40 adam-dir: Start Backup JobId 369, Job=BackupACSRV.2006-11-07_08.40.54 07-Nov 08:40 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3304 Issuing autochanger load slot 9, drive 0 command. 07-Nov 08:46 adam-sd: BackupACSRV.2006-11-07_08.40.54 Fatal error: 3992 Bad autochanger load slot 9, drive 0: ERR=Child exited with code 1. 07-Nov 08:46 acsrv-fd: BackupACSRV.2006-11-07_08.40.54 Fatal error: c:\cygwin\home\kern\bacula\k\src\win32\filed\../../filed/job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 07-Nov 08:46 adam-dir: BackupACSRV.2006-11-07_08.40.54 Error: Bacula 1.38.11 (28Jun06): 07-Nov-2006 08:46:17 JobId: 369 Job:BackupACSRV.2006-11-07_08.40.54 Backup Level: Full Client: acsrv-fd Windows Server 2003,MVS,NT 5.2.3790 FileSet:ACSRVfileset 2006-09-24 10:20:32 Pool: Default Storage:Autochanger Scheduled time: 07-Nov-2006 08:40:53 Start time: 07-Nov-2006 08:40:59 End time: 07-Nov-2006 08:46:17 Elapsed time: 5 mins 18 secs Priority: 10 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 1 Volume Session Time:1162893909 Last Volume Bytes: 35,091,719,514 (35.09 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command. Wanted 3000 OK data, , got 3903 Error append data
I've gotten this one... I'd love to know what it is too. :) Jake Goerzen wrote: What does it mean when this happens? and is there a way to fix it? 07-Nov 08:40 adam-dir: Start Backup JobId 369, Job=BackupACSRV.2006-11-07_08.40.54 07-Nov 08:40 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3301 Issuing autochanger loaded drive 0 command. 07-Nov 08:41 adam-sd: 3991 Bad autochanger loaded drive 0 command: ERR=Child exited with code 1. 07-Nov 08:41 adam-sd: 3304 Issuing autochanger load slot 9, drive 0 command. 07-Nov 08:46 adam-sd: BackupACSRV.2006-11-07_08.40.54 Fatal error: 3992 Bad autochanger load slot 9, drive 0: ERR=Child exited with code 1. 07-Nov 08:46 acsrv-fd: BackupACSRV.2006-11-07_08.40.54 Fatal error: c:\cygwin\home\kern\bacula\k\src\win32\filed\../../filed/job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 07-Nov 08:46 adam-dir: BackupACSRV.2006-11-07_08.40.54 Error: Bacula 1.38.11 (28Jun06): 07-Nov-2006 08:46:17 JobId: 369 Job:BackupACSRV.2006-11-07_08.40.54 Backup Level: Full Client: acsrv-fd Windows Server 2003,MVS,NT 5.2.3790 FileSet:ACSRVfileset 2006-09-24 10:20:32 Pool: Default Storage:Autochanger Scheduled time: 07-Nov-2006 08:40:53 Start time: 07-Nov-2006 08:40:59 End time: 07-Nov-2006 08:46:17 Elapsed time: 5 mins 18 secs Priority: 10 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 1 Volume Session Time:1162893909 Last Volume Bytes: 35,091,719,514 (35.09 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Bad response to Append Data command
Hello, I try to get bacula running on a Centos 4 running. I have VXA2 autochanger. I went through the manuals, and everything seems to work fine. I started backup job and it looked very good. After the test run, I changed my bacula-dir.conf to add another client and change what has to be backed up. After i made those changes as soon I will run a backup I get the following message. The same message appears if I use my old files. I hope someone can help me. If you need more information let me know. Robert Job started. JobId=42 09-Oct 10:26 server1-dir: Start Backup JobId 42, Job=Client1.2006-10-09_10.26.55 09-Oct 10:26 server1-sd: 3301 Issuing autochanger loaded drive 0 command. 09-Oct 10:26 server1-sd: 3302 Autochanger loaded drive 0, result: nothing loaded. 09-Oct 10:26 server1-sd: 3304 Issuing autochanger load slot 1, drive 0 command. 09-Oct 10:27 server1-sd: Client1.2006-10-09_10.26.55 Fatal error: 3992 Bad autochanger load slot 1, drive 0: ERR=Child exited with code 1. 09-Oct 10:27 server1-fd: Client1.2006-10-09_10.26.55 Fatal error: job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 09-Oct 10:27 server1-dir: Client1.2006-10-09_10.26.55 Error: Bacula 1.38.11 (28Jun06): 09-Oct-2006 10:27:14 JobId: 42 Job:Client1.2006-10-09_10.26.55 Backup Level: Incremental, since=2006-10-06 08:35:44 Client: server1-fd i686-redhat-linux-gnu,redhat, FileSet:Full Set 2006-10-05 09:49:48 Pool: Default Storage:VXA2 Scheduled time: 09-Oct-2006 10:26:54 Start time: 09-Oct-2006 10:26:58 End time: 09-Oct-2006 10:27:14 Elapsed time: 16 secs Priority: 100 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 1 Volume Session Time:1160414803 Last Volume Bytes: 14,357,017,975 (14.35 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Bad response to Append Data command
Hello, I try to get bacula running on a Centos 4 running. I have VXA2 autochanger. I went through the manuals, and everything seems to work fine. I started backup job and it looked very good. After the test run, I changed my bacula-dir.conf to add another client and change what has to be backed up. After i made those changes as soon I will run a backup I get the following message. The same message appears if I use my old files. I hope someone can help me. If you need more information let me know. Robert Job started. JobId=44 09-Oct 11:48 server1-dir: Start Backup JobId 44, Job=Client1.2006-10-09_11.48.56 09-Oct 11:49 server1-sd: 3301 Issuing autochanger loaded drive 0 command. 09-Oct 11:49 server1-sd: 3302 Autochanger loaded drive 0, result: nothing loaded. 09-Oct 11:49 server1-sd: 3304 Issuing autochanger load slot 1, drive 0 command. 09-Oct 11:49 server1-fd: Client1.2006-10-09_11.48.56 Fatal error: job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 09-Oct 11:49 server1-sd: Client1.2006-10-09_11.48.56 Fatal error: 3992 Bad autochanger load slot 1, drive 0: ERR=Child exited with code 1. 09-Oct 11:49 server1-dir: Client1.2006-10-09_11.48.56 Error: Bacula 1.38.11 (28Jun06): 09-Oct-2006 11:49:00 JobId: 44 Job:Client1.2006-10-09_11.48.56 Backup Level: Incremental, since=2006-10-06 08:35:44 Client: server1-fd i686-redhat-linux-gnu,redhat, FileSet:Full Set 2006-10-05 09:49:48 Pool: Default Storage:VXA2 Scheduled time: 09-Oct-2006 11:48:55 Start time: 09-Oct-2006 11:49:00 End time: 09-Oct-2006 11:49:00 Elapsed time: 0 secs Priority: 100 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 3 Volume Session Time:1160414803 Last Volume Bytes: 14,357,017,975 (14.35 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 1 Volume Session Time:1160414803 Last Volume Bytes: 14,357,017,975 (14.35 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
On Monday 09 October 2006 13:53, Robert Keidel wrote: I try to get bacula running on a Centos 4 running. I have VXA2 autochanger. I went through the manuals, and everything seems to work fine. I started backup job and it looked very good. After the test run, I changed my bacula-dir.conf to add another client and change what has to be backed up. After i made those changes as soon I will run a backup I get the following message. The same message appears if I use my old files. I hope someone can help me. If you need more information let me know. Robert Job started. JobId=44 09-Oct 11:48 server1-dir: Start Backup JobId 44, Job=Client1.2006-10-09_11.48.56 09-Oct 11:49 server1-sd: 3301 Issuing autochanger loaded drive 0 command. 09-Oct 11:49 server1-sd: 3302 Autochanger loaded drive 0, result: nothing loaded. 09-Oct 11:49 server1-sd: 3304 Issuing autochanger load slot 1, drive 0 command. 09-Oct 11:49 server1-fd: Client1.2006-10-09_11.48.56 Fatal error: job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 09-Oct 11:49 server1-sd: Client1.2006-10-09_11.48.56 Fatal error: 3992 Bad autochanger load slot 1, drive 0: ERR=Child exited with code 1. 09-Oct 11:49 server1-dir: Client1.2006-10-09_11.48.56 Error: Bacula 1.38.11 (28Jun06): 09-Oct-2006 11:49:00 Welcome to the club. I am having the same trouble, perhaps coincidentally, on a Centos 4.4 system. Is yours a multi or single tape drive? Mine is a 4 drive system; I thought the problem might be related to the multiple drives, but perhaps not. I am working some with Kern on this to try to reproduce and fix it. You might look at the bug reports 687 and 689 on bugs.bacula.org; you can at least see if what you are seeing tracks my errors. -- -- Michael - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
On Monday 09 October 2006 20:53, Robert Keidel wrote: Hello, I try to get bacula running on a Centos 4 running. I have VXA2 autochanger. I went through the manuals, and everything seems to work fine. I started backup job and it looked very good. After the test run, I changed my bacula-dir.conf to add another client and change what has to be backed up. After i made those changes as soon I will run a backup I get the following message. The same message appears if I use my old files. I hope someone can help me. If you need more information let me know. Robert Job started. JobId=44 09-Oct 11:48 server1-dir: Start Backup JobId 44, Job=Client1.2006-10-09_11.48.56 09-Oct 11:49 server1-sd: 3301 Issuing autochanger loaded drive 0 command. 09-Oct 11:49 server1-sd: 3302 Autochanger loaded drive 0, result: nothing loaded. 09-Oct 11:49 server1-sd: 3304 Issuing autochanger load slot 1, drive 0 command. 09-Oct 11:49 server1-fd: Client1.2006-10-09_11.48.56 Fatal error: job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data The above error can be ignored since it is the FD complaining that the SD went away. 09-Oct 11:49 server1-sd: Client1.2006-10-09_11.48.56 Fatal error: 3992 Bad autochanger load slot 1, drive 0: ERR=Child exited with code 1. The problem is indicated above. Your autochanger script did not work correctly and terminated with a non-zero status. It is most likely a permissions error. 09-Oct 11:49 server1-dir: Client1.2006-10-09_11.48.56 Error: Bacula 1.38.11 (28Jun06): 09-Oct-2006 11:49:00 JobId: 44 Job:Client1.2006-10-09_11.48.56 Backup Level: Incremental, since=2006-10-06 08:35:44 Client: server1-fd i686-redhat-linux-gnu,redhat, FileSet:Full Set 2006-10-05 09:49:48 Pool: Default Storage:VXA2 Scheduled time: 09-Oct-2006 11:48:55 Start time: 09-Oct-2006 11:49:00 End time: 09-Oct-2006 11:49:00 Elapsed time: 0 secs Priority: 100 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 3 Volume Session Time:1160414803 Last Volume Bytes: 14,357,017,975 (14.35 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 (0 B) SD Bytes Written: 0 (0 B) Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 1 Volume Session Time:1160414803 Last Volume Bytes: 14,357,017,975 (14.35 GB) Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Bad response to Append Data command.
I seem to be past the Appendable media issue I've been having - now I'm getting something totally different when I try and run this backup: You have messages. * * * *messages 02-Oct 13:59 bacula-atl2-sd: 3302 Autochanger loaded drive 0, result is Slot 8. 02-Oct 13:59 bacula-atl2-sd: taurus.2006-10-02_13.59.13 Warning: Director wanted Volume 2006-09-16-s8. Current Volume 2006-09-16-S8 not acceptable because: 1998 Volume 2006-09-16-S8 status is Append, not in Pool. 02-Oct 13:59 bacula-atl2-sd: 3301 Issuing autochanger loaded drive 0 command. 02-Oct 13:59 bacula-atl2-sd: 3302 Autochanger loaded drive 0, result is Slot 8. 02-Oct 13:59 bacula-atl2-sd: taurus.2006-10-02_13.59.13 Warning: Director wanted Volume 2006-09-16-s8. Current Volume 2006-09-16-S8 not acceptable because: 1998 Volume 2006-09-16-S8 status is Append, not in Pool. 02-Oct 13:59 bacula-atl2-sd: 3301 Issuing autochanger loaded drive 0 command. 02-Oct 13:59 bacula-atl2-sd: 3302 Autochanger loaded drive 0, result is Slot 8. 02-Oct 13:59 bacula-atl2-sd: taurus.2006-10-02_13.59.13 Warning: Director wanted Volume 2006-09-16-s8. Current Volume 2006-09-16-S8 not acceptable because: 1998 Volume 2006-09-16-S8 status is Append, not in Pool. 02-Oct 13:59 bacula-atl2-sd: 3301 Issuing autochanger loaded drive 0 command. 02-Oct 13:59 bacula-atl2-sd: 3302 Autochanger loaded drive 0, result is Slot 8. 02-Oct 13:59 bacula-atl2-sd: taurus.2006-10-02_13.59.13 Warning: Director wanted Volume 2006-09-16-s8. Current Volume 2006-09-16-S8 not acceptable because: 1998 Volume 2006-09-16-S8 status is Append, not in Pool. 02-Oct 13:59 bacula-atl2-sd: taurus.2006-10-02_13.59.13 Fatal error: Too many errors trying to mount device /dev/nrsa0. 02-Oct 13:59 taurus-fd: taurus.2006-10-02_13.59.13 Fatal error: job.c:1665 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 02-Oct 13:59 bacula-atl2-dir: taurus.2006-10-02_13.59.13 Error: Bacula 1.36.3 (22Apr05): 02-Oct-2006 13:59:26 JobId: 6152 Job:taurus.2006-10-02_13.59.13 Backup Level: Full Client: taurus-fd FileSet:taurus disk 2005-07-28 17:09:19 Pool: Full_A Storage:ATL2-TAPE1 Start time: 02-Oct-2006 13:59:15 End time: 02-Oct-2006 13:59:26 FD Files Written: 0 SD Files Written: 0 FD Bytes Written: 0 SD Bytes Written: 0 Rate: 0.0 KB/s Software Compression: None Volume name(s): Volume Session Id: 5 Volume Session Time:1159802982 Last Volume Bytes: 0 Non-fatal FD errors:0 SD Errors: 0 FD termination status: Error SD termination status: Error Termination:*** Backup Error *** This is an older version of Bacula (1.36.3) running on FreeBSD(4.9 Release) I'm getting ready to build a RHES solution box and install a newer version on there, but right now, I'd like to get this box working as it's attached to the tape changer. The other won't be for some time. -Keith - Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.phpp=sourceforgeCID=DEVDEV ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
On Tue, 5 Sep 2006, Arno Lehmann wrote: I get several messages like these on dmesg: (scsi2:A:15:0): data overrun detected in Data-in phase. Tag == 0x3. (scsi2:A:15:0): Have seen Data Phase. Length = 50. NumSGs = 1. sg[0] - Addr 0x11c3b8000 : Length 50 These are SCSI errors and it's unlikely that anything you do to / with Bacula can fix it. More specifically those are errors associated with cabling problems. Check cable lengths, that everything is plugged together properly and that the terminators are OK/ AB - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
Thank you Alan and Arno, for your help. Although I made a kernel update, i checked the the data cable. Is was slightly disconnected. Probably that was the problem. Now those dmesg messages are gone. Now I have a tape with volStatus =Error. What now? What you recommend? delete the jobs associated with that tape and the volume? By the way, I sent a message to the list about adding one single tape to the pool. I would like to add one so I can replace the one with the error. How can I do it? Ive search/read the documentation but I cat find the answer. When I run the label barcodes, bacula only ask me if I want to label all of then (including the ones that are already labeled). Once again, thanks Jaime Ventura [Infra-estruturas e Comunicações] Rua Dr. António Bernardino de Almeida, 431 4200 - 072 Porto Telef: +351 22 834 05 00 (04) - ext. 1641 Fax: +351 22 832 11 59 e-mail: [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] url:www.isep.ipp.pt http://www.isep.ipp.pt Alan Brown wrote: On Tue, 5 Sep 2006, Arno Lehmann wrote: I get several messages like these on dmesg: (scsi2:A:15:0): data overrun detected in Data-in phase. Tag == 0x3. (scsi2:A:15:0): Have seen Data Phase. Length = 50. NumSGs = 1. sg[0] - Addr 0x11c3b8000 : Length 50 These are SCSI errors and it's unlikely that anything you do to / with Bacula can fix it. More specifically those are errors associated with cabling problems. Check cable lengths, that everything is plugged together properly and that the terminators are OK/ AB - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
On Wed, 6 Sep 2006, Jaime Ventura wrote: Although I made a kernel update, i checked the the data cable. Is was slightly disconnected. Probably that was the problem. very likely. Now I have a tape with volStatus =Error. What now? What you recommend? delete the jobs associated with that tape and the volume? Update the status of the tape to used and wait for the jobs to expire. If you want to reuse the tape immediately, then update it as above, then purge it. By the way, I sent a message to the list about adding one single tape to the pool. I would like to add one so I can replace the one with the error. How can I do it? Create a new barcode, and use the add command to create an entry in the appropriate pool. Bacula will label the tape automatically when it is loaded Do not reuse the identity of the error status tape. When the tape is purged and eligible for recyling it will be reused automatically. AB - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Bad response to Append Data command
Hello, Im backing up data to tape using a tapeloader, and im experiencing the following problem: 05-Sep 09:38 isep-dir: Start Backup JobId 498, Job=jtgv-gsi.2006-09-05_09.38.17 05-Sep 09:38 localhost-sd: 3301 Issuing autochanger loaded drive 0 command. 05-Sep 09:38 localhost-sd: 3302 Autochanger loaded drive 0, result: nothing loaded. 05-Sep 09:38 localhost-sd: 3304 Issuing autochanger load slot 1, drive 0 command. 05-Sep 09:38 localhost-sd: jtgv-gsi.2006-09-05_09.38.17 Fatal error: 3992 Bad autochanger load slot 1, drive 0: ERR=Child exited with code 1. 05-Sep 09:36 jtgv-fd: jtgv-gsi.2006-09-05_09.38.17 Fatal error: job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data The list volumes say there is an appendable volume(01L3): *list volumes Pool: Default +-++---+-+--+--+-+--+---+---+-+ | MediaId | VolumeName | VolStatus | VolBytes| VolFiles | VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | +-++---+-+--+--+-+--+---+---+-+ | 8 | testeVol1 | Append| 171,591,154 |0 | 31,536,000 | 1 |0 | 1 | File | 2006-09-04 23:10:04 | +-++---+-+--+--+-+--+---+---+-+ Pool: TapePoolForSERVERS +-++---+-+--+--+-+--+---+---+-+ | MediaId | VolumeName | VolStatus | VolBytes| VolFiles | VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | +-++---+-+--+--+-+--+---+---+-+ | 11 | 01L3 | Append| 7,651,509,655 |8 | 31,536,000 | 1 |1 | 1 | Ultrium-3 | 2006-08-15 07:30:00 | | 12 | 02L3 | Full | 434,014,408,572 | 437 | 31,536,000 | 1 |2 | 1 | Ultrium-3 | 2006-08-14 22:24:58 | +-++---+-+--+--+-+--+---+---+-+ * Is my data on the tape corrupted? At this point I cant really tell if the last job that used that tape finished correctly. How do I overcome the problem? Thank you very much. -- - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
Hello, On 9/5/2006 10:44 AM, Jaime Ventura wrote: Hello, Im backing up data to tape using a tapeloader, and im experiencing the following problem: 05-Sep 09:38 isep-dir: Start Backup JobId 498, Job=jtgv-gsi.2006-09-05_09.38.17 05-Sep 09:38 localhost-sd: 3301 Issuing autochanger loaded drive 0 command. 05-Sep 09:38 localhost-sd: 3302 Autochanger loaded drive 0, result: nothing loaded. 05-Sep 09:38 localhost-sd: 3304 Issuing autochanger load slot 1, drive 0 command. 05-Sep 09:38 localhost-sd: jtgv-gsi.2006-09-05_09.38.17 Fatal error: 3992 Bad autochanger load slot 1, drive 0: ERR=Child exited with code 1. The changer script you use failed. Depending on the script you use and the kind of hardware you have, and the OS you run, and lots of other things probably, you could fix that. 05-Sep 09:36 jtgv-fd: jtgv-gsi.2006-09-05_09.38.17 Fatal error: job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data The list volumes say there is an appendable volume(01L3): *list volumes Pool: Default +-++---+-+--+--+-+--+---+---+-+ | MediaId | VolumeName | VolStatus | VolBytes| VolFiles | VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | +-++---+-+--+--+-+--+---+---+-+ | 8 | testeVol1 | Append| 171,591,154 |0 | 31,536,000 | 1 |0 | 1 | File | 2006-09-04 23:10:04 | +-++---+-+--+--+-+--+---+---+-+ Pool: TapePoolForSERVERS +-++---+-+--+--+-+--+---+---+-+ | MediaId | VolumeName | VolStatus | VolBytes| VolFiles | VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | +-++---+-+--+--+-+--+---+---+-+ | 11 | 01L3 | Append| 7,651,509,655 |8 | 31,536,000 | 1 |1 | 1 | Ultrium-3 | 2006-08-15 07:30:00 | | 12 | 02L3 | Full | 434,014,408,572 | 437 | 31,536,000 | 1 |2 | 1 | Ultrium-3 | 2006-08-14 22:24:58 | +-++---+-+--+--+-+--+---+---+-+ * Is my data on the tape corrupted? Probably not. At this point I cant really tell if the last job that used that tape finished correctly. What does the 'status sd=xxx' command tell you? 'sta dir' might also help, the messages in the console should have some information, and of course, when the job finishes, there should be a job report. How do I overcome the problem? Have you run btape on that device? Arno Thank you very much. -- IT-Service Lehmann[EMAIL PROTECTED] Arno Lehmann http://www.its-lehmann.de - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
Arno Lehmann, Thanks for you reply. Im using RHEL 4 x86_64 with a dell powervault 124T. Im using bacula for some time backing up to disk. Now i bought a a dell powervault 124T so i can make backups to tape. So, im newbe to tape issues. Im on a test environment... making tests :P. When you say «Have you run btape on that device?», what is the purpose? test the tape? write a EOF? Cant get much info on status: *mount The defined Storage resources are: 1: File 2: IBM-Ultrium-3 3: TapeLoader124T Select Storage resource (1-3): 3 3301 Issuing autochanger loaded drive 0 command. 3302 Autochanger loaded drive 0, result: nothing loaded. 3001 Mounted Volume: 01L3 3001 Device IBM-Ultrium-3 (/dev/nst0) is already mounted with Volume 01L3 * *status sd=TapeLoader124T Connecting to Storage daemon TapeLoader124T at 10.0.23.17:9103 localhost-sd Version: 1.38.11 (28 June 2006) x86_64-redhat-linux-gnu redhat Enterprise release Daemon started 28-Aug-06 14:48, 30 Jobs run since started. Running Jobs: No Jobs running. Jobs waiting to reserve a drive: Terminated Jobs: JobId Level Files Bytes Status FinishedName == 498 Full 0 0 Error05-Sep-06 09:40 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:43 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:45 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:48 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:50 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:52 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:55 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:57 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:59 jtgv-gsi 498 Full 0 0 Error05-Sep-06 10:02 jtgv-gsi Device status: Autochanger TapeLoader124T with devices: IBM-Ultrium-3 (/dev/nst0) Device FileStorage (/backupspace/bacula) is not open or does not exist. Device IBM-Ultrium-3 (/dev/nst0) is mounted with Volume=01L3 Pool=TapePoolForSERVERS Drive 0 is not loaded. Total Bytes Read=64,512 Blocks Read=1 Bytes/block=64,512 Positioned at File=0 Block=0 In Use Volume status: 01L3 on device IBM-Ultrium-3 (/dev/nst0) How do I make that tape appendable again? Again, thanks for you help Arno Lehmann wrote: Hello, On 9/5/2006 10:44 AM, Jaime Ventura wrote: Hello, Im backing up data to tape using a tapeloader, and im experiencing the following problem: 05-Sep 09:38 isep-dir: Start Backup JobId 498, Job=jtgv-gsi.2006-09-05_09.38.17 05-Sep 09:38 localhost-sd: 3301 Issuing autochanger loaded drive 0 command. 05-Sep 09:38 localhost-sd: 3302 Autochanger loaded drive 0, result: nothing loaded. 05-Sep 09:38 localhost-sd: 3304 Issuing autochanger load slot 1, drive 0 command. 05-Sep 09:38 localhost-sd: jtgv-gsi.2006-09-05_09.38.17 Fatal error: 3992 Bad autochanger load slot 1, drive 0: ERR=Child exited with code 1. The changer script you use failed. Depending on the script you use and the kind of hardware you have, and the OS you run, and lots of other things probably, you could fix that. 05-Sep 09:36 jtgv-fd: jtgv-gsi.2006-09-05_09.38.17 Fatal error: job.c:1617 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data The list volumes say there is an appendable volume(01L3): *list volumes Pool: Default +-++---+-+--+--+-+--+---+---+-+ | MediaId | VolumeName | VolStatus | VolBytes| VolFiles | VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | +-++---+-+--+--+-+--+---+---+-+ | 8 | testeVol1 | Append| 171,591,154 |0 | 31,536,000 | 1 |0 | 1 | File | 2006-09-04 23:10:04 | +-++---+-+--+--+-+--+---+---+-+ Pool: TapePoolForSERVERS +-++---+-+--+--+-+--+---+---+-+ | MediaId | VolumeName | VolStatus | VolBytes| VolFiles | VolRetention | Recycle | Slot | InChanger | MediaType | LastWritten | +-++---+-+--+--+-+--+---+---+-+ | 11 | 01L3 | Append| 7,651,509,655 |8 | 31,536,000 | 1 |1 | 1 | Ultrium-3 | 2006-08-15 07:30:00 | | 12 | 02L3 |
Re: [Bacula-users] Bad response to Append Data command
Thanks for your reply. I've already installer and heavily tested the tapeloader with bacula. I've done lots of backups to tape. All worked ok. Then I went on vacation, and when I came I was getting this error. I cant figure out what happened. Maybe the tape went on vacations too :P Jaime Uwe Schuerkamp wrote: On Tue, Sep 05, 2006 at 10:54:43AM +0100, Jaime Ventura wrote: From: Jaime Ventura [EMAIL PROTECTED] To: Arno Lehmann [EMAIL PROTECTED] Cc: Bacula Users bacula-users@lists.sourceforge.net Date: Tue, 05 Sep 2006 10:54:43 +0100 Subject: Re: [Bacula-users] Bad response to Append Data command Arno Lehmann, Thanks for you reply. Im using RHEL 4 x86_64 with a dell powervault 124T. Im using bacula for some time backing up to disk. Now i bought a a dell powervault 124T so i can make backups to tape. So, im newbe to tape issues. Im on a test environment... making tests :P. When you say «Have you run btape on that device?», what is the purpose? test the tape? write a EOF? Cant get much info on status: *mount The defined Storage resources are: 1: File 2: IBM-Ultrium-3 3: TapeLoader124T Select Storage resource (1-3): 3 3301 Issuing autochanger loaded drive 0 command. 3302 Autochanger loaded drive 0, result: nothing loaded. 3001 Mounted Volume: 01L3 3001 Device IBM-Ultrium-3 (/dev/nst0) is already mounted with Volume 01L3 * *status sd=TapeLoader124T Connecting to Storage daemon TapeLoader124T at 10.0.23.17:9103 localhost-sd Version: 1.38.11 (28 June 2006) x86_64-redhat-linux-gnu redhat Enterprise release Daemon started 28-Aug-06 14:48, 30 Jobs run since started. Running Jobs: No Jobs running. Jobs waiting to reserve a drive: Terminated Jobs: JobId Level Files Bytes Status FinishedName == 498 Full 0 0 Error05-Sep-06 09:40 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:43 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:45 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:48 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:50 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:52 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:55 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:57 jtgv-gsi 498 Full 0 0 Error05-Sep-06 09:59 jtgv-gsi 498 Full 0 0 Error05-Sep-06 10:02 jtgv-gsi Device status: Autochanger TapeLoader124T with devices: IBM-Ultrium-3 (/dev/nst0) Device FileStorage (/backupspace/bacula) is not open or does not exist. Device IBM-Ultrium-3 (/dev/nst0) is mounted with Volume=01L3 Pool=TapePoolForSERVERS Drive 0 is not loaded. Total Bytes Read=64,512 Blocks Read=1 Bytes/block=64,512 Positioned at File=0 Block=0 In Use Volume status: 01L3 on device IBM-Ultrium-3 (/dev/nst0) How do I make that tape appendable again? Again, thanks for you help Hi, you need to run btape to ensure compatibility with your tape library / tape drive. First, you'll need to find out which scsi generic device represents your changer device (probably /dev/sg1, dependent on the scsi id you've configured for the changer). Unless you tell Bacula where your tape changer is, it won't be able to load other tapes using mtx. Here's an example from our setup: Autochanger { Name = Overland Device = DLT8000 Changer Command = /server/bacula/etc/mtx-changer %c %o %S %a %d Changer Device = /dev/changer } Where /dev/changer is a symlink to the generic scsi device: /dev/changer - /dev/sg6 and /proc/scsi/scsi has the following entry: Host: scsi1 Channel: 00 Id: 05 Lun: 01 Vendor: HP Model: 1x8 autoloader Rev: 1.46 Type: Medium Changer ANSI SCSI revision: 03 HTH, uwe - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
On Tue, Sep 05, 2006 at 12:59:57PM +0100, Jaime Ventura wrote: From: Jaime Ventura [EMAIL PROTECTED] To: Uwe Schuerkamp [EMAIL PROTECTED] Cc: Arno Lehmann [EMAIL PROTECTED], Bacula Users bacula-users@lists.sourceforge.net Date: Tue, 05 Sep 2006 12:59:57 +0100 Subject: Re: [Bacula-users] Bad response to Append Data command Thanks for your reply. I've already installer and heavily tested the tapeloader with bacula. I've done lots of backups to tape. All worked ok. Then I went on vacation, and when I came I was getting this error. I cant figure out what happened. Maybe the tape went on vacations too :P Jaime Hello Jaime, to get the simple stuff out of the way first: have you tried restarting the bacula daemons? What's the output of mtx status and do you see any strange messages (scsi-errors and such) in the output of dmesg? Cheers, uwe -- Uwe Schuerkamp, NIONEX GmbH (http://www.nionex.com/) [EMAIL PROTECTED] Tel: +49 (0)5241 / 80 10 66 FAX: / 806 23 38 Avenwedder Str. 55, D-33311 Guetersloh, Germany GnuPG KeyID: 5887047D, Fingerprint: 2E1320229A3F63 7F676FE9B1A836A461 - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
After rebooting the all system, the problem remains. The result of mtx status is: [EMAIL PROTECTED] bacula]# mtx -f /dev/sg5 status Storage Changer /dev/sg5:1 Drives, 8 Slots ( 0 Import/Export ) Data Transfer Element 0:Full (Unknown Storage Element Loaded):VolumeTag = 01L3 Storage Element 1:Empty Storage Element 2:Full :VolumeTag=02L3 Storage Element 3:Full :VolumeTag=03L3 Storage Element 4:Empty Storage Element 5:Empty Storage Element 6:Empty Storage Element 7:Empty Storage Element 8:Empty I get several messages like these on dmesg: (scsi2:A:15:0): data overrun detected in Data-in phase. Tag == 0x3. (scsi2:A:15:0): Have seen Data Phase. Length = 50. NumSGs = 1. sg[0] - Addr 0x11c3b8000 : Length 50 Meanwhile, I'll try to make the bacula use another tape. This way I can tel if the problem is related to the tape or the tapeloader/drive. Thanks, Jaime Ventura [Infra-estruturas e Comunicações] Rua Dr. António Bernardino de Almeida, 431 4200 - 072 Porto Telef: +351 22 834 05 00 (04) - ext. 1641 Fax: +351 22 832 11 59 e-mail: [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] url:www.isep.ipp.pt http://www.isep.ipp.pt Uwe Schuerkamp wrote: On Tue, Sep 05, 2006 at 12:59:57PM +0100, Jaime Ventura wrote: From: Jaime Ventura [EMAIL PROTECTED] To: Uwe Schuerkamp [EMAIL PROTECTED] Cc: Arno Lehmann [EMAIL PROTECTED], Bacula Users bacula-users@lists.sourceforge.net Date: Tue, 05 Sep 2006 12:59:57 +0100 Subject: Re: [Bacula-users] Bad response to Append Data command Thanks for your reply. I've already installer and heavily tested the tapeloader with bacula. I've done lots of backups to tape. All worked ok. Then I went on vacation, and when I came I was getting this error. I cant figure out what happened. Maybe the tape went on vacations too :P Jaime Hello Jaime, to get the simple stuff out of the way first: have you tried restarting the bacula daemons? What's the output of mtx status and do you see any strange messages (scsi-errors and such) in the output of dmesg? Cheers, uwe - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command
Hi, On 9/5/2006 3:36 PM, Jaime Ventura wrote: After rebooting the all system, the problem remains. The result of mtx status is: [EMAIL PROTECTED] bacula]# mtx -f /dev/sg5 status Storage Changer /dev/sg5:1 Drives, 8 Slots ( 0 Import/Export ) Data Transfer Element 0:Full (Unknown Storage Element Loaded):VolumeTag = 01L3 Storage Element 1:Empty Storage Element 2:Full :VolumeTag=02L3 Storage Element 3:Full :VolumeTag=03L3 Storage Element 4:Empty Storage Element 5:Empty Storage Element 6:Empty Storage Element 7:Empty Storage Element 8:Empty I get several messages like these on dmesg: (scsi2:A:15:0): data overrun detected in Data-in phase. Tag == 0x3. (scsi2:A:15:0): Have seen Data Phase. Length = 50. NumSGs = 1. sg[0] - Addr 0x11c3b8000 : Length 50 These are SCSI errors and it's unlikely that anything you do to / with Bacula can fix it. Meanwhile, I'll try to make the bacula use another tape. This way I can tel if the problem is related to the tape or the tapeloader/drive. I suppose you'll find that the errors persist. Has a kernel update happened between the time everything worked and now? If it's not a SCSI driver related problem I can only suggest to try different SCSI hardware - start with cables and terminator, try another tape drive and HBA, and so on... Arno Thanks, Jaime Ventura [Infra-estruturas e Comunicações] Rua Dr. António Bernardino de Almeida, 431 4200 - 072 Porto Telef: +351 22 834 05 00 (04) - ext. 1641 Fax: +351 22 832 11 59 e-mail: [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] url: www.isep.ipp.pt http://www.isep.ipp.pt Uwe Schuerkamp wrote: On Tue, Sep 05, 2006 at 12:59:57PM +0100, Jaime Ventura wrote: From: Jaime Ventura [EMAIL PROTECTED] To: Uwe Schuerkamp [EMAIL PROTECTED] Cc: Arno Lehmann [EMAIL PROTECTED], Bacula Users bacula-users@lists.sourceforge.net Date: Tue, 05 Sep 2006 12:59:57 +0100 Subject: Re: [Bacula-users] Bad response to Append Data command Thanks for your reply. I've already installer and heavily tested the tapeloader with bacula. I've done lots of backups to tape. All worked ok. Then I went on vacation, and when I came I was getting this error. I cant figure out what happened. Maybe the tape went on vacations too :P Jaime Hello Jaime, to get the simple stuff out of the way first: have you tried restarting the bacula daemons? What's the output of mtx status and do you see any strange messages (scsi-errors and such) in the output of dmesg? Cheers, uwe - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users -- IT-Service Lehmann[EMAIL PROTECTED] Arno Lehmann http://www.its-lehmann.de - Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnkkid=120709bid=263057dat=121642 ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
Kern, thanks for your quick reply. On Thursday 03 November 2005 19:33, Kern Sibbald wrote: It looks like you stumbled into a change they made in the 2.6 kernel, which will undoubtely cause a lot of people a *lot* of pain. The change prohibits a program from opening a drive in read/write mode if there is no volume in the drive. You are most likely right. This behavior seems to have started since I told my streamer to eject the tapes automatically when a new tape was requested. I will tell the Debian package maintainer. Perhaps there is a patch even for the 1.36.2 version which can be included in Debian/Sarge. (The policy usually forbids shipping new upstream releases in the same Debian stable release. So a source code patch is the common way to deal with such issues. Perhaps that part of the 1.38 version can be included in the 1.36 version?) If this is what is causing your problem, you can solve it by: 1. Ensuring that there is some tape in the drive befort starting the SD, and before issuing a mount command in the console. Strangely the tape gets ejected after a backup job is done. It would be less bad if the tape would just stay in and Bacula complained that it needed a new tape when the next backup job starts. 2. Upgrade to version 1.38.0 (you might wait for a Win32 fix if you have Win32 clients). Fortunately there are no Windows clients in my network. :) So you say the 1.38 includes a fix for the kernel problem? Good to know. Regards Christoph -- ~ ~ ~ .signature [Modified] 3 lines --100%--3,41 All --- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42 plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
On Friday 04 November 2005 09:15, Christoph Haas wrote: Kern, thanks for your quick reply. On Thursday 03 November 2005 19:33, Kern Sibbald wrote: It looks like you stumbled into a change they made in the 2.6 kernel, which will undoubtely cause a lot of people a *lot* of pain. The change prohibits a program from opening a drive in read/write mode if there is no volume in the drive. You are most likely right. This behavior seems to have started since I told my streamer to eject the tapes automatically when a new tape was requested. Yes, that would create the problem. I will tell the Debian package maintainer. Perhaps there is a patch even for the 1.36.2 version which can be included in Debian/Sarge. (The policy usually forbids shipping new upstream releases in the same Debian stable release. So a source code patch is the common way to deal with such issues. Perhaps that part of the 1.38 version can be included in the 1.36 version?) If someone is very cleaver, perhaps one could patch it, otherwise, there is a *really* big difference between the 1.36.x code and the 1.38.0 that deals with this problem. If this is what is causing your problem, you can solve it by: 1. Ensuring that there is some tape in the drive befort starting the SD, and before issuing a mount command in the console. Strangely the tape gets ejected after a backup job is done. It would be less bad if the tape would just stay in and Bacula complained that it needed a new tape when the next backup job starts. 2. Upgrade to version 1.38.0 (you might wait for a Win32 fix if you have Win32 clients). Fortunately there are no Windows clients in my network. :) So you say the 1.38 includes a fix for the kernel problem? Good to know. Yes, but there is one mutex bug with autochangers for which I will release new code today or tomorrow. Regards Christoph -- Best regards, Kern ( /\ V_V --- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42 plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
Re: [Bacula-users] Bad response to Append Data command.
Hello, It looks like you stumbled into a change they made in the 2.6 kernel, which will undoubtely cause a lot of people a *lot* of pain. The change prohibits a program from opening a drive in read/write mode if there is no volume in the drive. If this is what is causing your problem, you can solve it by: 1. Ensuring that there is some tape in the drive befort starting the SD, and before issuing a mount command in the console. 2. Upgrade to version 1.38.0 (you might wait for a Win32 fix if you have Win32 clients). On Thursday 03 November 2005 18:03, Christoph Haas wrote: Evening... I'm repeatedly seeing the following messages cancelling my backup jobs: == 03-Nov 08:11 torf-dir: Start Backup JobId 206, Job=BackupCatalog.2005-11-03_08.10.00 03-Nov 08:13 torf-sd: BackupCatalog.2005-11-03_08.10.00 Fatal error: device.c:317 Unable to open device /dev/nst0. ERR=dev.c:289 stored: unable to open device /dev/nst0: ERR=Input/output error 03-Nov 08:13 torf-fd: BackupCatalog.2005-11-03_08.10.00 Fatal error: job.c:1665 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data 03-Nov 08:13 torf-dir: BackupCatalog.2005-11-03_08.10.00 Error: Bacula 1.36.2 (28Feb05): 03-Nov-2005 08:13:26 == The storage daemon should be able to open /dev/nst0. lsof shows no other processes accessing it. So I don't see a reason why Bacula is unable to open that device. Once I saw that error message (from a status email telling me the backup has failed) all subsequent backups will fail with the same error. But when I restart both the director and the storage daemon I can suddenly run a successful backup. This is my Device configuration in the storage daemons config file: Device { Name = Streamer Media Type = DDS-3 Archive Device = /dev/nst0 AutomaticMount = yes; AlwaysOpen = no; RemovableMedia = yes; RandomAccess = no; VolumePollInterval = 60 OfflineOnUnmount = yes CloseOnPoll = no MaximumOpenWait = 2 days } Is there anything I can try to debug in this special case? It's a DDS-3 streamer (SCSI) driven by a 2.6.10 Linux kernel. Currently half of my jobs need to be re-done manually which is a bit annoying. I'm running Bacula version 1.36.2 on Debian/Sarge. Christoph -- Best regards, Kern ( /\ V_V --- SF.Net email is sponsored by: Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42 plasma tv or your very own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users
[Bacula-users] Bad Response to Append Data Command
This error is appearing more and more. Sometimes the backups run, sometimes I get the following error: We are obviously writing data to a file, other backups are running on the machine. I am perplexed since sometimes it works, other times it fails. 23-Aug 20:36 vortech-dir: Start Backup JobId 2204, Job=unix10.2005-08-23_20.36.06 23-Aug 20:31 backup0-sd: unix10.2005-08-23_20.36.06 Fatal error: Device /backup/bacula is busy writing on another Volume. 23-Aug 20:28 unix10-fd: unix10.2005-08-23_20.36.06 Fatal error: job.c:1665 Bad response to Append Data command. Wanted 3000 OK data , got 3903 Error append data Thanks for your input, Aaron Summers [EMAIL PROTECTED] --- SF.Net email is Sponsored by the Better Software Conference EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile Plan-Driven Development * Managing Projects Teams * Testing QA Security * Process Improvement Measurement * http://www.sqe.com/bsce5sf ___ Bacula-users mailing list Bacula-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/bacula-users