Hello, I'm using this wget 1.15 compiled with MinGW:
http://sourceforge.net/projects/getgnuwin32/.
I'm noticing that when I use the --warc-file parameter in combination
with the --directory-prefix parameter, the standard wget output is saved
to the location specified by the directory prefix,
Hi Gijs,
Gijs van Tulder gvtul...@gmail.com writes:
can you please send a complete diff against the current development
tree version?
Here's the diff of the WARC additions (1.9MB zipped) to revision 2565:
http://dl.dropbox.com/u/365100/wget_warc-20110926-complete.patch.bz2
the patch is
Giuseppe Scrivano wrote:
the patch is huge and I think we don't want to add some many files into
the wget tree. Can't we assume the user will install the warc tools by
herself and let configure check if they are installed or not? This will
require some more work but the result will be much
Gijs van Tulder gvtul...@gmail.com writes:
Hi.
It's been a while since we've discussed the WARC addition to Wget. Is
there anything I can help with?
can you please send a complete diff against the current development tree
version?
I'll take a look at it ASAP.
Thanks,
Giuseppe
can you please send a complete diff against the current development
tree version?
Here's the diff of the WARC additions (1.9MB zipped) to revision 2565:
http://dl.dropbox.com/u/365100/wget_warc-20110926-complete.patch.bz2
Thanks,
Gijs
Hi.
It's been a while since we've discussed the WARC addition to Wget. Is
there anything I can help with?
Gijs
Gijs van Tulder gvtul...@gmail.com writes:
It would be cool if Wget could become one of these tools. Already the
Swiss army knife for mirroring websites, the one thing that Wget is
missing is a good way to store these mirrors. The current output of
--mirror is not sufficient for archival
Giuseppe Scrivano writes:
The implementation makes use of the open source WARC Tools library
(Apache License 2.0):
http://code.google.com/p/warc-tools/
how much code is really needed from that library? I wonder if we can
avoid this dependency at all.
The library comes with some
Hi,
I'd like to propose a new feature that allows Wget to make WARC files.
Perhaps you're already familiar with it, but in short: WARC is a file
format for web archives. In a single WARC file, you can store every file
of the website, plus the HTTP request and response headers and other
That sounds awesome! You have my vote... :)
On Tue, Aug 9, 2011 at 4:49 AM, Gijs van Tulder gvtul...@gmail.com wrote:
Hi,
I'd like to propose a new feature that allows Wget to make WARC files.
Perhaps you're already familiar with it, but in short: WARC is a file
format for web archives.
10 matches
Mail list logo