Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread Mick
On Saturday 17 May 2014 04:33:57 meino.cra...@gmx.de wrote:
 Hi,
 
 is there any tool in the Gentoo portage which may speed up (make it
 mopre efficient) the following task:
 
 On my HD there are data I want to copy to two identical external HDs.
 These HDs are of the same type/model and each is separately
 connectable via USB to my PC (...these two of the typical mobile external
 USB-HDs).
 
 Instead of copying the data twice from my PC to eah of the HDs I want
 to do it once...like I would be able to give the cp-command two
 instead of one target where to copy two.
 
 The result should be two identically populated external HDs, each of
 them useable/readable without the need to the other one.
 
 What tool of the portage tree I able to accomplish this task?
 
 Best regards,
 mcc


1. You could set up the two ext drives as a mirrored RAID and use your 
copying/tar-ing/dd tool of choice.


2. Or you could use pipe and tee to split the feed into any devices you want, 
e.g.:

  pv /dev/sda1 | tee (dd of=/dev/sdb1) (dd of=/dev/sdc1) 


3. Or you could use a sequential copy:

  cp -a /home /dev/sdb1/  cp -a /home /dev/sdc1


NOTES: 

a) Unlike other commands, pv will give you a progress bar so that you know how 
long your back up is taking.

b) The 3rd example above will copy sequentially, but depending on the size of 
the file(s) the second copy may be read from cache.

c) If you're doing this over the network then you can use nc in listening mode 
at the receiving end, or ssh if the network is untrusted.


I'm interested to see what other ways will be suggested for this task.

-- 
Regards,
Mick


signature.asc
Description: This is a digitally signed message part.


Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread meino . cramer
Mick michaelkintz...@gmail.com [14-05-17 09:48]:
 On Saturday 17 May 2014 04:33:57 meino.cra...@gmx.de wrote:
  Hi,
  
  is there any tool in the Gentoo portage which may speed up (make it
  mopre efficient) the following task:
  
  On my HD there are data I want to copy to two identical external HDs.
  These HDs are of the same type/model and each is separately
  connectable via USB to my PC (...these two of the typical mobile external
  USB-HDs).
  
  Instead of copying the data twice from my PC to eah of the HDs I want
  to do it once...like I would be able to give the cp-command two
  instead of one target where to copy two.
  
  The result should be two identically populated external HDs, each of
  them useable/readable without the need to the other one.
  
  What tool of the portage tree I able to accomplish this task?
  
  Best regards,
  mcc
 
 
 1. You could set up the two ext drives as a mirrored RAID and use your 
 copying/tar-ing/dd tool of choice.
 
 
 2. Or you could use pipe and tee to split the feed into any devices you want, 
 e.g.:
 
   pv /dev/sda1 | tee (dd of=/dev/sdb1) (dd of=/dev/sdc1) 
 
 
 3. Or you could use a sequential copy:
 
   cp -a /home /dev/sdb1/  cp -a /home /dev/sdc1
 
 
 NOTES: 
 
 a) Unlike other commands, pv will give you a progress bar so that you know 
 how 
 long your back up is taking.
 
 b) The 3rd example above will copy sequentially, but depending on the size of 
 the file(s) the second copy may be read from cache.
 
 c) If you're doing this over the network then you can use nc in listening 
 mode 
 at the receiving end, or ssh if the network is untrusted.
 
 
 I'm interested to see what other ways will be suggested for this task.
 
 -- 
 Regards,
 Mick

Hi Mick,

thank your reply! :)

From your numbering of the possibilities...
1.)  ...I am no RAID guru and would try this later with data, which
are not valuable...
2.) That looks interesting! Unfortunately it seems to copy device
contents on low level instead of files. The source are directory
structure -- not whole devices...
Or did I overlook an option mentioned in the manpage...?
3.) The files I want to copy are in the size of some GB each. So the
cache isnt big enough to hold ALL files for the second part.


Best regards,
mcc






Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread Neil Bothwick
On Sat, 17 May 2014 09:59:08 +0200, meino.cra...@gmx.de wrote:

  3. Or you could use a sequential copy:
  
cp -a /home /dev/sdb1/  cp -a /home /dev/sdc1

 3.) The files I want to copy are in the size of some GB each. So the
 cache isnt big enough to hold ALL files for the second part.

Run the two copies simultaneously, start the first, switch to another
tab, start the second. That way the data for the second copy is always
the most recently cached.

However, I expect the speed limit here may be the USB bus unless you are
using USB 3.0 drives on different buses.


-- 
Neil Bothwick

I'll never forget the 1st time I ran Windows, but I'm trying...


signature.asc
Description: PGP signature


Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread Stroller

On Sat, 17 May 2014, at 8:46 am, Mick michaelkintz...@gmail.com wrote:
 ...
 a) Unlike other commands, pv will give you a progress bar so that you know 
 how 
 long your back up is taking.

I've been using ddrescue recently - by default it seems to be faster than plain 
old regular dd.

I assume it must be possible to adjust the parameters (blocksize?) to speed dd 
up, but I just want to run the command and not have to bother tinkering.

On dumb blind copies, I'm finding ddrescue about 3x or 4x faster than dd, I 
would estimate.

Stroller.




Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread Daniel Troeder
Am 17.05.2014 11:58, schrieb Neil Bothwick:
 On Sat, 17 May 2014 09:59:08 +0200, meino.cra...@gmx.de wrote:
 
 3. Or you could use a sequential copy:

   cp -a /home /dev/sdb1/  cp -a /home /dev/sdc1
 
 3.) The files I want to copy are in the size of some GB each. So the
 cache isnt big enough to hold ALL files for the second part.
 
 Run the two copies simultaneously, start the first, switch to another
 tab, start the second. That way the data for the second copy is always
 the most recently cached.
 
 However, I expect the speed limit here may be the USB bus unless you are
 using USB 3.0 drives on different buses.
I was thinking about how to make sure cache is used, and that two
simultanious cp won't work, because the progress for the two cp will
quickly diverge. But then I realized: there is no need to think about
the read cache - the limiting factor is always the writing side,
especially with USB! So IMO it doesn't matter at all how you do it!

I guess two simultaneous cp will be the same as two sequential cp,
except if you have two separate USB-buses. Usually you have just one
externally connectible, use lsusb -t to check.

If you have less that 2 times the size of your files, IMO simultaneous
cp will be worse, because Linux (don't know if USB-subsystem or cp)
creates big buffers when cp'ing (check with free -m), and you'll
probably get into memory trouble.


Greetings,
Daniel

PS: Quickest way is always to open USB-case and plug SATA cable from
motherboard into drive. With 80GB it's always worth the trouble.

-- 
Get my PGP key at:
*
http://keyserver.ubuntu.com:11371/pks/lookup?op=getsearch=0x837FB8B5BB9D4887
* $ gpg --recv-keys --keyserver keyserver.ubuntu.com 0xBB9D4887



signature.asc
Description: OpenPGP digital signature


Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread meino . cramer
Daniel Troeder dan...@admin-box.com [14-05-17 16:36]:
 Am 17.05.2014 11:58, schrieb Neil Bothwick:
  On Sat, 17 May 2014 09:59:08 +0200, meino.cra...@gmx.de wrote:
  
  3. Or you could use a sequential copy:
 
cp -a /home /dev/sdb1/  cp -a /home /dev/sdc1
  
  3.) The files I want to copy are in the size of some GB each. So the
  cache isnt big enough to hold ALL files for the second part.
  
  Run the two copies simultaneously, start the first, switch to another
  tab, start the second. That way the data for the second copy is always
  the most recently cached.
  
  However, I expect the speed limit here may be the USB bus unless you are
  using USB 3.0 drives on different buses.
 I was thinking about how to make sure cache is used, and that two
 simultanious cp won't work, because the progress for the two cp will
 quickly diverge. But then I realized: there is no need to think about
 the read cache - the limiting factor is always the writing side,
 especially with USB! So IMO it doesn't matter at all how you do it!
 
 I guess two simultaneous cp will be the same as two sequential cp,
 except if you have two separate USB-buses. Usually you have just one
 externally connectible, use lsusb -t to check.
 
 If you have less that 2 times the size of your files, IMO simultaneous
 cp will be worse, because Linux (don't know if USB-subsystem or cp)
 creates big buffers when cp'ing (check with free -m), and you'll
 probably get into memory trouble.
 
 
 Greetings,
 Daniel
 
 PS: Quickest way is always to open USB-case and plug SATA cable from
 motherboard into drive. With 80GB it's always worth the trouble.
 
 -- 
 Get my PGP key at:
 *
 http://keyserver.ubuntu.com:11371/pks/lookup?op=getsearch=0x837FB8B5BB9D4887
 * $ gpg --recv-keys --keyserver keyserver.ubuntu.com 0xBB9D4887
 

Hi,

thank you /.*/  for /.*/ replies ! :)

...one aspect is missing: The load (that is the I/O on the 
source hd).
If this hd is busy spitting out the data twice, it cannot
serve outhe jobs twice as long...

H

Best regards,
mcc

PS:
On the source system
# lsusb -t 
/:  Bus 09.Port 1: Dev 1, Class=root_hub, Driver=ohci-pci/4p, 12M
/:  Bus 08.Port 1: Dev 1, Class=root_hub, Driver=ohci-pci/2p, 12M
/:  Bus 07.Port 1: Dev 1, Class=root_hub, Driver=ohci-pci/5p, 12M
|__ Port 2: Dev 2, If 0, Class=Human Interface Device, Driver=usbhid, 1.5M
|__ Port 2: Dev 2, If 1, Class=Human Interface Device, Driver=usbhid, 1.5M
/:  Bus 06.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/2p, 5000M
/:  Bus 05.Port 1: Dev 1, Class=root_hub, Driver=xhci_hcd/2p, 480M
/:  Bus 04.Port 1: Dev 1, Class=root_hub, Driver=ohci-pci/5p, 12M
|__ Port 1: Dev 2, If 0, Class=Human Interface Device, Driver=usbhid, 1.5M
|__ Port 3: Dev 3, If 0, Class=Audio, Driver=snd-usb-audio, 12M
|__ Port 3: Dev 3, If 1, Class=Audio, Driver=snd-usb-audio, 12M
/:  Bus 03.Port 1: Dev 1, Class=root_hub, Driver=ehci-pci/4p, 480M
/:  Bus 02.Port 1: Dev 1, Class=root_hub, Driver=ehci-pci/5p, 480M
/:  Bus 01.Port 1: Dev 1, Class=root_hub, Driver=ehci-pci/5p, 480M
|__ Port 4: Dev 4, If 0, Class=Hub, Driver=hub/6p, 480M






Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread Neil Bothwick
On Sat, 17 May 2014 14:44:04 +0100, Stroller wrote:

  a) Unlike other commands, pv will give you a progress bar so that you
  know how long your back up is taking.  
 
 I've been using ddrescue recently - by default it seems to be faster
 than plain old regular dd.
 
 I assume it must be possible to adjust the parameters (blocksize?) to
 speed dd up, but I just want to run the command and not have to bother
 tinkering.

Use the bs option. The default is 512, which is incredibly slow, bs=4k
makes a substantial difference. However, I usually use dcfldd these days,
whch works out the best block size to use and adds progress reporting.


-- 
Neil Bothwick

Never underestimate the bandwidth of a station wagon full of tapes!


signature.asc
Description: PGP signature


Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread Mick
On Saturday 17 May 2014 08:59:08 meino.cra...@gmx.de wrote:
 Mick michaelkintz...@gmail.com [14-05-17 09:48]:
  On Saturday 17 May 2014 04:33:57 meino.cra...@gmx.de wrote:
   Hi,
   
   is there any tool in the Gentoo portage which may speed up (make it
   mopre efficient) the following task:
   
   On my HD there are data I want to copy to two identical external HDs.
   These HDs are of the same type/model and each is separately
   connectable via USB to my PC (...these two of the typical mobile
   external USB-HDs).
   
   Instead of copying the data twice from my PC to eah of the HDs I want
   to do it once...like I would be able to give the cp-command two
   instead of one target where to copy two.
   
   The result should be two identically populated external HDs, each of
   them useable/readable without the need to the other one.
   
   What tool of the portage tree I able to accomplish this task?
   
   Best regards,
   mcc
  
  1. You could set up the two ext drives as a mirrored RAID and use your
  copying/tar-ing/dd tool of choice.
  
  
  2. Or you could use pipe and tee to split the feed into any devices you
  want,
  
  e.g.:
pv /dev/sda1 | tee (dd of=/dev/sdb1) (dd of=/dev/sdc1)
  
  3. Or you could use a sequential copy:
cp -a /home /dev/sdb1/  cp -a /home /dev/sdc1
  
  NOTES:
  
  a) Unlike other commands, pv will give you a progress bar so that you
  know how long your back up is taking.
  
  b) The 3rd example above will copy sequentially, but depending on the
  size of the file(s) the second copy may be read from cache.
  
  c) If you're doing this over the network then you can use nc in listening
  mode at the receiving end, or ssh if the network is untrusted.
  
  
  I'm interested to see what other ways will be suggested for this task.
 
 Hi Mick,
 
 thank your reply! :)
 
 From your numbering of the possibilities...
 1.)  ...I am no RAID guru and would try this later with data, which
 are not valuable...

If the drives are not permanently connected, then you will have to assemble 
the RAID each time you want to copy something, after you plug them in.  
Perhaps not the most efficient method.


 2.) That looks interesting! Unfortunately it seems to copy device
 contents on low level instead of files. The source are directory
 structure -- not whole devices...

Well, I just offered an example for a whole drive.  You will need to point it 
to the file(s) you want to copy over.  The pv command works like cat; 

e.g. cat myfile | tee (blah ...) 

is the same like:

pv myfile | tee (blah ...)

 Or did I overlook an option mentioned in the manpage...?
 3.) The files I want to copy are in the size of some GB each. So the
 cache isnt big enough to hold ALL files for the second part.
 
 
 Best regards,
 mcc

-- 
Regards,
Mick


signature.asc
Description: This is a digitally signed message part.


Re: [gentoo-user] Copying data efficiently

2014-05-17 Thread Neil Bothwick
On Sat, 17 May 2014 20:41:15 +0100, Mick wrote:

  From your numbering of the possibilities...
  1.)  ...I am no RAID guru and would try this later with data, which
  are not valuable...  
 
 If the drives are not permanently connected, then you will have to
 assemble the RAID each time you want to copy something, after you plug
 them in. Perhaps not the most efficient method.

Whatever method is used will likely consist of multiple commands, and
probably a good few arguments, lending itself best to scripting. In that
case, the RAID approach becomes no more difficult to use once set up.


-- 
Neil Bothwick

The Computer is the logical advancement of humankind:
intelligence without morality.


signature.asc
Description: PGP signature


[gentoo-user] Copying data efficiently

2014-05-16 Thread meino . cramer
Hi,

is there any tool in the Gentoo portage which may speed up (make it
mopre efficient) the following task:

On my HD there are data I want to copy to two identical external HDs.
These HDs are of the same type/model and each is separately
connectable via USB to my PC (...these two of the typical mobile external
USB-HDs).

Instead of copying the data twice from my PC to eah of the HDs I want
to do it once...like I would be able to give the cp-command two
instead of one target where to copy two.

The result should be two identically populated external HDs, each of
them useable/readable without the need to the other one.

What tool of the portage tree I able to accomplish this task?

Best regards,
mcc