Hi again.

So, as expected, the warning occurred again when job 94511 was copied (but 
also two new warnings for this specific job):
Warning: dird/catreq.cc:608 MD5 digest not same File=3 as attributes=2
Warning: dird/catreq.cc:608 MD5 digest not same File=18 as attributes=17
Warning: dird/catreq.cc:608 MD5 digest not same File=23 as attributes=22

The restore of the copy also errored in the same manner as before:
 2024-02-26 10:43:50 bareos-fd JobId 95784: Error: findlib/attribs.cc:381 
File size of restored file 
/tmp/bareos-restores/var/backup/old/2024-02-07_23:30/backup_www_1100CC.tar.gz 
not correct. Original 738212473, restored 442105856.
 2024-02-26 10:44:10 bareos-fd JobId 95784: Error: findlib/attribs.cc:381 
File size of restored file 
/tmp/bareos-restores/var/backup/old/2024-02-12_23:30/backup_www_1100CC.tar.gz 
not correct. Original 856739265, restored 783548416.
 2024-02-26 10:44:17 bareos-fd JobId 95784: Error: findlib/attribs.cc:381 
File size of restored file 
/tmp/bareos-restores/var/backup/old/2024-02-14_23:30/backup_www_1100CC.tar.gz 
not correct. Original 856739984, restored 31064064.

For now I'll try disabling autoxflate and see if that solves the issue.

Best regards,
Samuel


On Friday, February 23, 2024 at 2:02:28 PM UTC+1 Samuel wrote:

> Thanks for looking into it!
>
> Hm, there seems to be no file with fileindex=3 in the database for the 
> copy:
> Enter SQL query: select * from file where jobid=94146 and fileindex=3;
> No results to list.
>
> All files that are being backed up are just regular files, nothing special.
>
> The copy runs locally on the director's host from its SD to the same SD.
>
> Here's file 2 and 3 of today's consolidated incremental backup which will 
> be copied tomorrow and most likely lead to the same warning again as it has 
> for the last few days (for this specific fileset it always warns about 
> `File=3 attributes=2`):
>
> Enter SQL query: select *, decode_lstat(lstat) from file where jobid=94511 
> and fileindex=2;
>
> +-------------+-----------+-------+---------+----------+--------+--------+--------+----------------------------------------------------------------+------------------------+--------------------------+----------------------------------------------------------------------------------------------+
> | fileid      | fileindex | jobid | pathid  | deltaseq | markid | fhinfo | 
> fhnode | lstat                                                          | 
> md5                    | name                     | decode_lstat           
>                                                                       |
>
> +-------------+-----------+-------+---------+----------+--------+--------+--------+----------------------------------------------------------------+------------------------+--------------------------+----------------------------------------------------------------------------------------------+
> | 484,575,603 |         2 | 94511 | 910,901 |        0 |      0 | 0      | 
> 0      | P0D CgA4 IGk B A i A sADp5 BAA FgAo BlxAt1 BlxASF BlxVYH A A d | 
> D/VouduY5TF4KFacjRE7Hw | backup_www_1100CC.tar.gz | 
> (64771,655416,33188,1,0,34,0,738212473,4096,1441832,1707346805,1707345029,1707431431,0,0,29)
>  
> |
>
> +-------------+-----------+-------+---------+----------+--------+--------+--------+----------------------------------------------------------------+------------------------+--------------------------+----------------------------------------------------------------------------------------------+
>
> Enter SQL query: select *, decode_lstat(lstat) from file where jobid=94511 
> and fileindex=3;
>
> +-------------+-----------+-------+---------+----------+--------+--------+--------+---------------------------------------------------------------+------------------------+-----------------------------+-------------------------------------------------------------------------------------------+
> | fileid      | fileindex | jobid | pathid  | deltaseq | markid | fhinfo | 
> fhnode | lstat                                                         | 
> md5                    | name                        | decode_lstat         
>                                                                      |
>
> +-------------+-----------+-------+---------+----------+--------+--------+--------+---------------------------------------------------------------+------------------------+-----------------------------+-------------------------------------------------------------------------------------------+
> | 484,575,604 |         3 | 94511 | 910,901 |        0 |      0 | 0      | 
> 0      | P0D CgA5 IGk B A i A BA9gA BAA IHw BlxAt8 BlxASH BlxVYH A A d | 
> J9h+OdR/6XCqhqpJdJvYtw | backup_databases_1100CC.tar | 
> (64771,655417,33188,1,0,34,0,17029120,4096,33264,1707346812,1707345031,1707431431,0,0,29)
>  
> |
>
> +-------------+-----------+-------+---------+----------+--------+--------+--------+---------------------------------------------------------------+------------------------+-----------------------------+-------------------------------------------------------------------------------------------+
>
>
> Restore of this job (jobid=94511) works just fine:
>
> 23-Feb 13:44 bareos-sd JobId 94559: Releasing device 
> "FileDevice-ReadOnly-0005" (/backup_1/bareos).
> 23-Feb 13:44 bareos-dir JobId 94559: Max configured use duration=82,800 
> sec. exceeded. Marking Volume "AI-Consolidated-1771" as Used.
> 23-Feb 13:44 bareos-dir JobId 94559: Bareos bareos-dir 
> 23.0.2~pre32.0a0e55739 (31Jan24):
>   Build OS:               Ubuntu 20.04.5 LTS
>   JobId:                  94559
>   Job:                    Restore-Files.2024-02-23_13.43.47_38
>   Restore Client:         "bareos-fd" 23.0.2~pre32.0a0e55739 (31Jan24) 
> Ubuntu 20.04.5 LTS,ubuntu
>   Start time:             23-Feb-2024 13:43:49
>   End time:               23-Feb-2024 13:44:20
>   Elapsed time:           31 secs
>   Files Expected:         33
>   Files Restored:         33
>   Bytes Restored:         9,143,514,060
>   Rate:                   294952.1 KB/s
>   FD Errors:              0
>   FD termination status:  OK
>   SD termination status:  OK
>   Bareos binary info:     Bareos community build (UNSUPPORTED): Get 
> professional support from https://www.bareos.com
>   Job triggered by:       User
>   Termination:            Restore OK
>
> I'll also try restoring its copy from tape once it has been copied.
>
> Best regards,
> Samuel
> On Friday, February 23, 2024 at 1:16:32 PM UTC+1 Sebastian Sura wrote:
>
>> I meant to say File 3 has no stream 1, sorry for the confusion!
>> Am 23.02.24 um 13:15 schrieb Sebastian Sura:
>>
>> Hi Samuel
>>
>> thanks for the gathering this info.  Ill look into the bscan issue as 
>> well though i think this might be a known issue. 
>> Regardless, regarding your actual issue: The bscan output shows that File 
>> 3 is missing its attributes (it has no stream 3).
>> This confuses the director as it never got told that file 3 now started 
>> getting backed up and this is why you get the warning
>> message from the director.
>>
>> The same is happening during the restore: the filedaemon never got told 
>> that now a new file was started (because it never got the attribute stream)
>> so it basically merged both File 2 and File 3 into one file.  Even with 
>> this in mind, i think the restore should have caught that and issues a 
>> warning.
>> Ill look into why this did not happen.
>>
>> If you add up all the data records (stream=2) with fileid=2 and fileid=3, 
>> you will get 738212473 bytes, which is exactly what the filedaemon reported
>> as size for file 2.  Since the log contains the size it expected i 
>> imagine that you could manually restore file 3 by splitting file 2 into two.
>>
>> Can you check which file fileid 3 corresponds to ? Is it a special kind 
>> of file or just a normal one ?
>>
>> Ill try to see if i can reproduce your issue in the copy system test.  Do 
>> you do a local copy (so copy to the same sd) or a remote one?
>>
>> Kind Regards
>>
>> Sebastian Sura
>> Am 23.02.24 um 12:56 schrieb 'Samuel' via bareos-users:
>>
>> Hi Sebastian,
>>
>> After looking at the logs some more I noticed that all jobs with this 
>> warning have in common that they're using compression (LZ4) and autoxflate 
>> on replication.
>> Perhaps this combination is still not entirely fixed yet. I'll try 
>> disabling autoxflate in the coming days.
>>
>> The entire result of bscan is in the attachment. However here's the last 
>> part of it where bscan seems to abort (?):
>> $ sudo -u bareos bscan -b md5_digest_error_copy_only.bsr --list-records 
>> TapeDevice2 2>&1 | tee records.txt
>> ...
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=23 Stream=2 len=65536
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=23 Stream=2 len=65536
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=23 Stream=2 len=65536
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=23 Stream=2 len=22528
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=23 Stream=3 len=16
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=24 Stream=1 len=100
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=25 Stream=1 len=83
>> bscan: stored/bscan.cc:494-0 Record: SessId=3480 SessTim=1707222936 
>> FileIndex=-5 Stream=94145 len=193
>> bscan: stored/bscan.cc:681-0 Could not find SessId=3480 
>> SessTime=1707222936 for EOS record.
>> Records would have been added or updated in the catalog:
>>       0 Media
>>       1 Pool
>>       0 Job
>>       0 File
>>       0 RestoreObject
>> 23-Feb 12:05 bscan JobId 0: Releasing device "TapeDevice2" 
>> (/dev/tape/by-id/scsi-35000e111c71ac0bf-nst).
>>
>> I also tried restoring the copy which ends in error.
>>
>> Most files are restored successfully except for one file which happens to 
>> have FileIndex=2 (corresponding to the `attributes=2` in the warning?):
>> Enter SQL query: select path,name from file,path where jobid=94146 and 
>> fileindex=2 and file.pathid=path.pathid;
>> +-----------------------------------+--------------------------+
>> | path                              | name                     |
>> +-----------------------------------+--------------------------+
>> | /var/backup/old/2024-02-07_23:30/ | backup_www_1100CC.tar.gz |
>> +-----------------------------------+--------------------------+
>>
>> A full restore of the original non-copy backup jobs works fine, as do 
>> restores of other copy jobs on tape of filesets that don't use compression.
>> The original job that was copied (93682) doesn't exist anymore; I think 
>> it was an always-incremental consolidated incremental backup which got 
>> consolidated again the following day.
>>
>> Best regards,
>> Samuel
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "bareos-users" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected].
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/bareos-users/e766e8c6-5b58-424f-816f-f7c5d7252180n%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/bareos-users/e766e8c6-5b58-424f-816f-f7c5d7252180n%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>> -- 
>>  Sebastian Sura                  [email protected]
>>  Bareos GmbH & Co. KG            Phone: +49 221 630693-0
>>  https://www.bareos.com
>>  Sitz der Gesellschaft: Köln | Amtsgericht Köln: HRA 29646
>>  Komplementär: Bareos Verwaltungs-GmbH
>>  Geschäftsführer: Stephan Dühr, Jörg Steffens, Philipp Storz
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "bareos-users" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected].
>>
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/bareos-users/111cecfe-d3e1-4eb9-8a0e-f053a2777d0f%40bareos.com
>>  
>> <https://groups.google.com/d/msgid/bareos-users/111cecfe-d3e1-4eb9-8a0e-f053a2777d0f%40bareos.com?utm_medium=email&utm_source=footer>
>> .
>>
>> -- 
>>  Sebastian Sura                  [email protected]
>>  Bareos GmbH & Co. KG            Phone: +49 221 630693-0
>>  https://www.bareos.com
>>  Sitz der Gesellschaft: Köln | Amtsgericht Köln: HRA 29646
>>  Komplementär: Bareos Verwaltungs-GmbH
>>  Geschäftsführer: Stephan Dühr, Jörg Steffens, Philipp Storz
>>
>>

-- 
You received this message because you are subscribed to the Google Groups 
"bareos-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/bareos-users/3823a22d-b49e-4892-8b45-6c932ce7e699n%40googlegroups.com.

Reply via email to