Re: [Bug-wget] WARC, new version

2011-11-05 Thread Giuseppe Scrivano
Hey Gijs, I have added a ChangeLog entry and pushed the change. Thanks! Giuseppe Gijs van Tulder gvtul...@gmail.com writes: lovely. I am going to push it soon with some small adjustments. That's good to hear. There's one other small adjustment that you may want to make, see the

Re: [Bug-wget] WARC, new version

2011-11-04 Thread Giuseppe Scrivano
Gijs van Tulder gvtul...@gmail.com writes: Hi Giuseppe, * I've changed the configure.ac and src/Makefile.am. * I've added a ChangeLog entry. lovely. I am going to push it soon with some small adjustments. Thanks for the great work. Whenever it happens to be in the same place, I'll buy you

Re: [Bug-wget] WARC, new version

2011-11-04 Thread Gijs van Tulder
lovely. I am going to push it soon with some small adjustments. That's good to hear. There's one other small adjustment that you may want to make, see the attached patch. One of the WARC functions uses the basename function, which causes problems on OS X. Including libgen.h and strdup-ing

Re: [Bug-wget] WARC, new version

2011-10-30 Thread Giuseppe Scrivano
Gijs van Tulder gvtul...@gmail.com writes: === modified file 'bootstrap.conf' --- bootstrap.conf2011-08-11 12:23:39 + +++ bootstrap.conf2011-10-21 19:24:18 + @@ -28,6 +28,7 @@ accept alloca announce-gen +base32 bind c-ctype clock-time @@ -49,6 +50,7 @@ mbtowc

Re: [Bug-wget] WARC, new version

2011-10-30 Thread David H. Lipman
From: Giuseppe Scrivano gscriv...@gnu.org I have seen WARC mentioned but have not seen a definition. What is WARC ? What is WARC used for ? Windows or 'nix ? What are its benefits, etc ? -- Dave Multi-AV Scanning Tool - http://multi-av.thespykiller.co.uk

Re: [Bug-wget] WARC, new version

2011-10-30 Thread Gijs van Tulder
Hi David, David H. Lipman wrote: I have seen WARC mentioned but have not seen a definition. WARC (Web ARChive, ISO 28500:2009) [1] is a file format for storing web resources. It is used for making archives of web sites. The Internet Archive, for example, uses it as the file format for

Re: [Bug-wget] WARC, new version

2011-10-30 Thread David H. Lipman
From: Gijs van Tulder gvtul...@gmail.com Hi David, David H. Lipman wrote: I have seen WARC mentioned but have not seen a definition. WARC (Web ARChive, ISO 28500:2009) [1] is a file format for storing web resources. It is used for making archives of web sites. The Internet Archive, for

Re: [Bug-wget] WARC, new version

2011-10-23 Thread Giuseppe Scrivano
Gijs van Tulder gvtul...@gmail.com writes: Hi all, Based on the comments by Giuseppe and Ángel I've revised the implementation of the wget WARC extenstion. I've attached a patch. 1. It's no longer based on the warctools library. Instead, I've written a couple of new WARC-writing functions,