Re: [Bug-tar] Adding non-materialized files from a stream

Ryan Jud Hughes Tue, 14 Oct 2008 07:10:59 -0700

I'm afraid I haven't studied the format very thoroughly, but couldn't itdo something like this:

Write a header block and leave spaces for file size and checksum. Leavethese as some constant value that indicates that they are placeholders.Then start writing the data, keeping track of the size of the file as wego. Once we've written all of it, seek back to the header block and fillin the size, and finally compute the checksum.

If we have a stdin file at the time that we create the tar, make sure itgoes last. If we are appending, we know it goes last. That way, if thetar crashes or is interrupted, we only lose the last file, and don't loseour place so we can't find subsequent files.

Future 'append' operations can detect that we've written the "I don't knowyet" value in the header, and can either write over the last file asthough it never happened, or can fix the size and checksum as though thedata we've written is the entirety of the file.

Listing the archive, we can either ignore files we never finished writing,or we can heal it in the same way that append does.

Does that not work? Is the format more complicated than I thought? Arewe avoiding seeks?


Okay, thanks.
--Ryan

On Tue, 14 Oct 2008, Sergey Poznyakoff wrote:

Ryan Jud Hughes <[EMAIL PROTECTED]> ha escrit:

I would rather be able to add it directly to the tar archive from the
stream:

% create_data | tar -rf existing_tar_archive.tar -


There is a serious obstacle to this: tar needs to know beforehand the
size of the file it is archiving.

If not, would this be a friendly addition to tar?


It would, certainly.  But I don't think it is possible to implement due
to above restriction.

Regards,
Sergey

Re: [Bug-tar] Adding non-materialized files from a stream

Reply via email to