[Bug-wget] WARC output not saved to dir specified by directory prefix

2014-01-29 Thread Nicholas Taylor
Hello, I'm using this wget 1.15 compiled with MinGW: http://sourceforge.net/projects/getgnuwin32/. I'm noticing that when I use the --warc-file parameter in combination with the --directory-prefix parameter, the standard wget output is saved to the location specified by the directory prefix,

Re: [Bug-wget] WARC output

2011-10-08 Thread Giuseppe Scrivano
Hi Gijs, Gijs van Tulder gvtul...@gmail.com writes: can you please send a complete diff against the current development tree version? Here's the diff of the WARC additions (1.9MB zipped) to revision 2565: http://dl.dropbox.com/u/365100/wget_warc-20110926-complete.patch.bz2 the patch is

Re: [Bug-wget] WARC output

2011-10-08 Thread Ángel González
Giuseppe Scrivano wrote: the patch is huge and I think we don't want to add some many files into the wget tree. Can't we assume the user will install the warc tools by herself and let configure check if they are installed or not? This will require some more work but the result will be much

Re: [Bug-wget] WARC output

2011-09-26 Thread Giuseppe Scrivano
Gijs van Tulder gvtul...@gmail.com writes: Hi. It's been a while since we've discussed the WARC addition to Wget. Is there anything I can help with? can you please send a complete diff against the current development tree version? I'll take a look at it ASAP. Thanks, Giuseppe

Re: [Bug-wget] WARC output

2011-09-26 Thread Gijs van Tulder
can you please send a complete diff against the current development tree version? Here's the diff of the WARC additions (1.9MB zipped) to revision 2565: http://dl.dropbox.com/u/365100/wget_warc-20110926-complete.patch.bz2 Thanks, Gijs

Re: [Bug-wget] WARC output

2011-09-25 Thread Gijs van Tulder
Hi. It's been a while since we've discussed the WARC addition to Wget. Is there anything I can help with? Gijs

Re: [Bug-wget] WARC output

2011-08-10 Thread Giuseppe Scrivano
Gijs van Tulder gvtul...@gmail.com writes: It would be cool if Wget could become one of these tools. Already the Swiss army knife for mirroring websites, the one thing that Wget is missing is a good way to store these mirrors. The current output of --mirror is not sufficient for archival

Re: [Bug-wget] WARC output

2011-08-10 Thread Gijs van Tulder
Giuseppe Scrivano writes: The implementation makes use of the open source WARC Tools library (Apache License 2.0): http://code.google.com/p/warc-tools/ how much code is really needed from that library? I wonder if we can avoid this dependency at all. The library comes with some

[Bug-wget] WARC output

2011-08-09 Thread Gijs van Tulder
Hi, I'd like to propose a new feature that allows Wget to make WARC files. Perhaps you're already familiar with it, but in short: WARC is a file format for web archives. In a single WARC file, you can store every file of the website, plus the HTTP request and response headers and other

Re: [Bug-wget] WARC output

2011-08-09 Thread Patrick Steil
That sounds awesome! You have my vote... :) On Tue, Aug 9, 2011 at 4:49 AM, Gijs van Tulder gvtul...@gmail.com wrote: Hi, I'd like to propose a new feature that allows Wget to make WARC files. Perhaps you're already familiar with it, but in short: WARC is a file format for web archives.