Re: cross-platform backup tool incremental rdiff-backup takes long

2022-10-19 Thread Jonas Schöpf

Thank you for your fast reply!

Deleting a file is not a simple operation. Since rdiff-backup always 
works backward from the current mirror, restoring a file as of some 
previous date requires storing a snapshot of the last known state of 
that file in the increments. By default, those snapshots are compressed 
except for file names that match the --no-compression-regexp (see 
Globals.py for the default expression), and that compression can take a 
lot of time.

This makes totally sense!



You might want to take a look at the "--no-compression" option in the 
manpage.




I will have a look at this option.

Cheers,
Jonas



Re: cross-platform backup tool incremental rdiff-backup takes long

2022-10-18 Thread Robert Nichols

On 10/18/22 1:37 AM, Jonas Schöpf wrote:

I deleted a lot of stuff on my machine (around 200GB) and if I do a new backup 
I have to abort it after 9 hours as it does not finish.

iostat -x -m 3 shows that %util is always between 70 and 95.
So I assume that IO is again a bottleneck, but shouldn't it be easy for 
rdiff-backup to just delete the files from the backup?
Is there a way how I can speed up this process?


Deleting a file is not a simple operation. Since rdiff-backup always works 
backward from the current mirror, restoring a file as of some previous date 
requires storing a snapshot of the last known state of that file in the 
increments. By default, those snapshots are compressed except for file names 
that match the --no-compression-regexp (see Globals.py for the default 
expression), and that compression can take a lot of time.

You might want to take a look at the "--no-compression" option in the manpage.

--
Bob Nichols "NOSPAM" is really part of my email address.
Do NOT delete it.




Re: cross-platform backup tool incremental rdiff-backup takes long

2022-10-18 Thread Jonas Schöpf

Hi all!

I am using rdiff-backup without flaws since this conversation here last 
year. Although I always need to add the "--no-fsync" argument in order 
to get fast (20-30min) backups.


Unfortunately, I have a problem with the current backup.
I deleted a lot of stuff on my machine (around 200GB) and if I do a new 
backup I have to abort it after 9 hours as it does not finish.


iostat -x -m 3 shows that %util is always between 70 and 95.
So I assume that IO is again a bottleneck, but shouldn't it be easy for 
rdiff-backup to just delete the files from the backup?

Is there a way how I can speed up this process?

Thank you very much and cheers,
Jonas

On 6/2/21 13:18, Patrik Dufresne wrote:

Hi Jonas,

Great news! Fsync is useful but not in you scenario. I'm glad it worked 
for you.


Don't hesitate to come back of you have other questions.




Re: incremental rdiff-backup takes long

2021-06-02 Thread Patrik Dufresne
Hi Jonas,

Great news! Fsync is useful but not in you scenario. I'm glad it worked for
you.

Don't hesitate to come back of you have other questions.

On Wed., Jun. 2, 2021, 4:56 a.m. Jonas Schöpf, 
wrote:

> Hi!
>
> Thank you again for your help!
>
> With the "--no-fsync" argument, the backup succeeded after 13 minutes.
>
> Cheers,
> Jonas
>
> On 6/1/21 9:53 PM, Jonas Schöpf wrote:
> > Hi Patrik!
> >
> > On 6/1/21 4:48 PM, Patrik Dufresne wrote:
> >> Hello Jonas,
> >>
> >> Welcome to rdiff-backup ! Let me try to help you a bit more with you
> >> problem.
> > Thanks :)
> >>
> >> First, the first backup always take alot of time because rdiff-backup is
> >> not like a simple copy and alot of overhead are done in the background
> >> to put in place the first version of the data.
> >>
> >> Second, with your use case of 400GB, I would not expect a very long
> >> elapsed time for subsequent backup. I have a couple of backups with
> >> similar size and it roughly takes ~10-20 minutes over the Internet with
> >> 400Mbps. But you might not have the same setup as I do.
> >>
> >> If $HOME and $DRIVE are both local, It's quite simple to narrow the
> problem.
> >>
> >> While running the backup with you command line, you might want to check
> >> how the IO of your computer behaves. In the command line, try executring
> >> `iostat -x -m 3` and take a look at the '%util' column. It displays the
> >> percentage utilisation of the disk. If the value is near 90%, you have
> >> found the culprit. The IO is your bottleneck.
> >>
> >> To help the situation, you might want to run the backup without fsync.
> >> When Fsync is enabled, it forces the system to wait until the data is
> >> written to the disk and persisted. For slow drives on USB, it may have a
> >> huge impact on the performance. So try running rdiff-backup with
> >> `--no-fsync`.
> >
> > Thank you for your help.
> > Sounds like this is the problem!
> >
> > I will try your this tomorrow and report back if it works :)
> >
> > Thank you all for your help!
> >
> > Cheers,
> > Jonas
> >
> >
>
> --
> GPG fingerprint: 0BF3 B30B F1D5 6556 795E  F68F 8626 F794 FE62 BE1F
>
>


Re: incremental rdiff-backup takes long

2021-06-02 Thread Jonas Schöpf
Hi!

Thank you again for your help!

With the "--no-fsync" argument, the backup succeeded after 13 minutes.

Cheers,
Jonas

On 6/1/21 9:53 PM, Jonas Schöpf wrote:
> Hi Patrik!
> 
> On 6/1/21 4:48 PM, Patrik Dufresne wrote:
>> Hello Jonas,
>>
>> Welcome to rdiff-backup ! Let me try to help you a bit more with you
>> problem.
> Thanks :)
>>
>> First, the first backup always take alot of time because rdiff-backup is
>> not like a simple copy and alot of overhead are done in the background
>> to put in place the first version of the data.
>>
>> Second, with your use case of 400GB, I would not expect a very long
>> elapsed time for subsequent backup. I have a couple of backups with
>> similar size and it roughly takes ~10-20 minutes over the Internet with
>> 400Mbps. But you might not have the same setup as I do.
>>
>> If $HOME and $DRIVE are both local, It's quite simple to narrow the problem.
>>
>> While running the backup with you command line, you might want to check
>> how the IO of your computer behaves. In the command line, try executring
>> `iostat -x -m 3` and take a look at the '%util' column. It displays the
>> percentage utilisation of the disk. If the value is near 90%, you have
>> found the culprit. The IO is your bottleneck.
>>
>> To help the situation, you might want to run the backup without fsync.
>> When Fsync is enabled, it forces the system to wait until the data is
>> written to the disk and persisted. For slow drives on USB, it may have a
>> huge impact on the performance. So try running rdiff-backup with
>> `--no-fsync`.
> 
> Thank you for your help.
> Sounds like this is the problem!
> 
> I will try your this tomorrow and report back if it works :)
> 
> Thank you all for your help!
> 
> Cheers,
> Jonas
> 
> 

-- 
GPG fingerprint: 0BF3 B30B F1D5 6556 795E  F68F 8626 F794 FE62 BE1F



Re: incremental rdiff-backup takes long

2021-06-01 Thread Jonas Schöpf
Hi Patrik!

On 6/1/21 4:48 PM, Patrik Dufresne wrote:
> Hello Jonas,
> 
> Welcome to rdiff-backup ! Let me try to help you a bit more with you
> problem.
Thanks :)
> 
> First, the first backup always take alot of time because rdiff-backup is
> not like a simple copy and alot of overhead are done in the background
> to put in place the first version of the data.
> 
> Second, with your use case of 400GB, I would not expect a very long
> elapsed time for subsequent backup. I have a couple of backups with
> similar size and it roughly takes ~10-20 minutes over the Internet with
> 400Mbps. But you might not have the same setup as I do.
> 
> If $HOME and $DRIVE are both local, It's quite simple to narrow the problem.
> 
> While running the backup with you command line, you might want to check
> how the IO of your computer behaves. In the command line, try executring
> `iostat -x -m 3` and take a look at the '%util' column. It displays the
> percentage utilisation of the disk. If the value is near 90%, you have
> found the culprit. The IO is your bottleneck.
> 
> To help the situation, you might want to run the backup without fsync.
> When Fsync is enabled, it forces the system to wait until the data is
> written to the disk and persisted. For slow drives on USB, it may have a
> huge impact on the performance. So try running rdiff-backup with
> `--no-fsync`.

Thank you for your help.
Sounds like this is the problem!

I will try your this tomorrow and report back if it works :)

Thank you all for your help!

Cheers,
Jonas


-- 
GPG fingerprint: 0BF3 B30B F1D5 6556 795E  F68F 8626 F794 FE62 BE1F



Re: incremental rdiff-backup takes long

2021-06-01 Thread Jonas Schöpf



On 6/1/21 4:20 PM, Yves Bellefeuille wrote:
> "Jonas Schoepf" 
> 
>>  My drive is connected via USB.
> 
> I think we're on the right track. What version of USB?
I have a USB 3.2 Gen 1.
> 
>>  Before using rdiff-backup I used just rsync, where the initial backup
>>  took also quite long, but the following backups took ~20 minutes.
> 
> rdiff-backup will always be slower than rsync, since rdiff-backup
> keeps previous versions, unlike rsync.

Yes, I understand that. But the difference can't be 20 minutes vs. more
than 8 hours...

- Jonas


-- 
GPG fingerprint: 0BF3 B30B F1D5 6556 795E  F68F 8626 F794 FE62 BE1F



Re: incremental rdiff-backup takes long

2021-06-01 Thread Alvin Starr via Any discussion of rdiff-backup

On 2021-06-01 10:32 a.m., Yves Bellefeuille wrote:

Alvin Starr  wrote:


I thought rdiff-backup like rsync keeps track of the file metadata
like size and creation date to skip reading the data again if the
file has not been updated?

I assume rdiff-backup does the same thing to decide if a file has
changed, but if there's a change, rsync simply throws away the old
version.

rdiff-backup notes the changes and keeps older versions (depending on
the options) so you're not limited to only one previous version.
That's the whole point.


That may be the issue then.
If the first run takes as long as subsequent runs then something may be 
interfering with the file checks used to avoid reading the data for 
comparison.


--
Alvin Starr   ||   land:  (647)478-6285
Netvel Inc.   ||   Cell:  (416)806-0133
al...@netvel.net  ||




Re: incremental rdiff-backup takes long

2021-06-01 Thread Patrik Dufresne
Hello Jonas,

Welcome to rdiff-backup ! Let me try to help you a bit more with you
problem.

First, the first backup always take alot of time because rdiff-backup is
not like a simple copy and alot of overhead are done in the background to
put in place the first version of the data.

Second, with your use case of 400GB, I would not expect a very long elapsed
time for subsequent backup. I have a couple of backups with similar size
and it roughly takes ~10-20 minutes over the Internet with 400Mbps. But you
might not have the same setup as I do.

If $HOME and $DRIVE are both local, It's quite simple to narrow the problem.

While running the backup with you command line, you might want to check how
the IO of your computer behaves. In the command line, try executring
`iostat -x -m 3` and take a look at the '%util' column. It displays the
percentage utilisation of the disk. If the value is near 90%, you have
found the culprit. The IO is your bottleneck.

To help the situation, you might want to run the backup without fsync. When
Fsync is enabled, it forces the system to wait until the data is written to
the disk and persisted. For slow drives on USB, it may have a huge impact
on the performance. So try running rdiff-backup with `--no-fsync`.



On Tue, Jun 1, 2021 at 10:33 AM Yves Bellefeuille  wrote:

> Alvin Starr  wrote:
>
> > I thought rdiff-backup like rsync keeps track of the file metadata
> > like size and creation date to skip reading the data again if the
> > file has not been updated?
>
> I assume rdiff-backup does the same thing to decide if a file has
> changed, but if there's a change, rsync simply throws away the old
> version.
>
> rdiff-backup notes the changes and keeps older versions (depending on
> the options) so you're not limited to only one previous version.
> That's the whole point.
>
> --
> Yves Bellefeuille
> 
>
>
>
>

-- 
IKUS Software inc.
https://www.ikus-soft.com/
514-971-6442
130 rue Doris
St-Colomban, QC J5K 1T9


Re: incremental rdiff-backup takes long

2021-06-01 Thread Yves Bellefeuille
Alvin Starr  wrote:

> I thought rdiff-backup like rsync keeps track of the file metadata
> like size and creation date to skip reading the data again if the
> file has not been updated?

I assume rdiff-backup does the same thing to decide if a file has
changed, but if there's a change, rsync simply throws away the old
version.

rdiff-backup notes the changes and keeps older versions (depending on
the options) so you're not limited to only one previous version.
That's the whole point.

-- 
Yves Bellefeuille






Re: incremental rdiff-backup takes long

2021-06-01 Thread Alvin Starr via Any discussion of rdiff-backup

On 2021-06-01 10:20 a.m., Yves Bellefeuille wrote:

"Jonas Schoepf" 


  My drive is connected via USB.

I think we're on the right track. What version of USB?


  Before using rdiff-backup I used just rsync, where the initial backup
  took also quite long, but the following backups took ~20 minutes.

rdiff-backup will always be slower than rsync, since rdiff-backup
keeps previous versions, unlike rsync.

I thought rdiff-backup like rsync keeps track of the file metadata like 
size and creation date to skip reading the data again if the file has 
not been updated?
That should cause the subsequent runs to be much faster even with slow 
disks.


--
Alvin Starr   ||   land:  (647)478-6285
Netvel Inc.   ||   Cell:  (416)806-0133
al...@netvel.net  ||




Re: incremental rdiff-backup takes long

2021-06-01 Thread Yves Bellefeuille
"Jonas Schoepf" 

>  My drive is connected via USB.

I think we're on the right track. What version of USB?

>  Before using rdiff-backup I used just rsync, where the initial backup
>  took also quite long, but the following backups took ~20 minutes.

rdiff-backup will always be slower than rsync, since rdiff-backup
keeps previous versions, unlike rsync.

-- 
Yves Bellefeuille






Re: incremental rdiff-backup takes long

2021-06-01 Thread Jonas Schöpf
My drive is connected via USB.

Before using rdiff-backup I used just rsync, where the initial backup
took also quite long, but the following backups took ~20 minutes.

- Jonas

On 6/1/21 4:03 PM, Yves Bellefeuille wrote:
> Jonas Schoepf  wrote:
> 
>>  I understand that the initial backup is really slow and took more
>>  than ~10h for my 400 GB.
> 
> 
> I'd suspect slow reading or writing speeds. How are the drives
> connected (SATA, USB, network, etc.)?
> 

-- 
GPG fingerprint: 0BF3 B30B F1D5 6556 795E  F68F 8626 F794 FE62 BE1F



Re: incremental rdiff-backup takes long

2021-06-01 Thread Yves Bellefeuille
Jonas Schoepf  wrote:

>  I understand that the initial backup is really slow and took more
>  than ~10h for my 400 GB.


I'd suspect slow reading or writing speeds. How are the drives
connected (SATA, USB, network, etc.)?

-- 
Yves Bellefeuille






Re: incremental rdiff-backup takes long

2021-06-01 Thread Jonas Schöpf
Thank you for the quick response!

The current backup runs since 6h and it only uses around 1-2% of the CPU.
There is also no gzip process running.
So it might be something else...

- Jonas

On 6/1/21 3:15 PM, Reio Remma wrote:
> On 01.06.2021 16:12, Jonas Schöpf wrote:
>> Hi all!
>>
>> I found out about rdiff-backup two weeks ago after I searched for a
>> method to get incremental backups of my system.
>> I experience some problems while doing my backup and I hope that
>> somebody can tell me what I am doing wrong.
>>
>> I understand that the initial backup is really slow and took more than
>> ~10h for my 400 GB.
>> Now if I do another backup, I would expect that it is quite fast, but
>> after it exceeded 8h and I just aborted the whole backup process.
>>
>> I use the following command:
>>> rdiff-backup --terminal-verbosity 8 --print-statistics \
>>>     --exclude-globbing-filelist $HOME/.excludes \
>>>     $HOME $DRIVE
>> Can anybody tell me what I am doing wrong?
>>
>> Thank you for your help!
>>
>> Cheers,
>> Jonas
> 
> Hello! A while ago I had a similar problem and I narrowed it down to
> rdiff-backup compressing files that were already compressed. I would
> suggest having a look at CPU usage etc when the backup is running. iirc
> it uses gzip which is fairly processor hungry.
> 
> Good luck!
> Reio
> 

-- 
GPG fingerprint: 0BF3 B30B F1D5 6556 795E  F68F 8626 F794 FE62 BE1F



Re: incremental rdiff-backup takes long

2021-06-01 Thread Reio Remma

On 01.06.2021 16:12, Jonas Schöpf wrote:

Hi all!

I found out about rdiff-backup two weeks ago after I searched for a
method to get incremental backups of my system.
I experience some problems while doing my backup and I hope that
somebody can tell me what I am doing wrong.

I understand that the initial backup is really slow and took more than
~10h for my 400 GB.
Now if I do another backup, I would expect that it is quite fast, but
after it exceeded 8h and I just aborted the whole backup process.

I use the following command:

rdiff-backup --terminal-verbosity 8 --print-statistics \
--exclude-globbing-filelist $HOME/.excludes \
$HOME $DRIVE

Can anybody tell me what I am doing wrong?

Thank you for your help!

Cheers,
Jonas


Hello! A while ago I had a similar problem and I narrowed it down to 
rdiff-backup compressing files that were already compressed. I would 
suggest having a look at CPU usage etc when the backup is running. iirc 
it uses gzip which is fairly processor hungry.


Good luck!
Reio



incremental rdiff-backup takes long

2021-06-01 Thread Jonas Schöpf
Hi all!

I found out about rdiff-backup two weeks ago after I searched for a
method to get incremental backups of my system.
I experience some problems while doing my backup and I hope that
somebody can tell me what I am doing wrong.

I understand that the initial backup is really slow and took more than
~10h for my 400 GB.
Now if I do another backup, I would expect that it is quite fast, but
after it exceeded 8h and I just aborted the whole backup process.

I use the following command:
> rdiff-backup --terminal-verbosity 8 --print-statistics \
>--exclude-globbing-filelist $HOME/.excludes \
>$HOME $DRIVE

Can anybody tell me what I am doing wrong?

Thank you for your help!

Cheers,
Jonas