Johannes Berg wrote:
Here's a bug-report for wget 1.8.2 (from debian/unstable):
When downloading the URL below from the server with "--page-requisites",
wget misses the images that are included, probably due to some "Internet
explorer conditional comments" that are included in the file. The file is exported from MS Word, and is also available for testing at
http://www.upb.de/cs/ag-engels/ag_dt/Courses/Lehrveranstaltungen/WS0203/TSEI/FAQs.htm
Sorry, but I don't think M$ Word is any kind of "standard" ! Why?Maybe this isn't really a bug in wget but rather in the file, but since this is standard as exported from MS Word I'd like to see wget recognize the images and download them.
1. It is propriety format, without documentation.
2. It actually can be used only on a propriety OS - M$ Windows.
3. It is inconsistent between the different versions.
Do you really expect wget to be able to chew (and not choke) on anything that slightly resembles html?
I don't think anybody will ever try to fix this.
1. Fix your file. Try to validate it on http://validator.w3.org/ for a start.
2. Use html editor, as M$ Word is not.
3. Learn html and write your own source in a simple text editor (NoteTab Light is my favorite and it is free).
4. Change your OS to something better (What about Linux?)
5. Learn XML+XSLT+XSL-FO. This can completely replace Word+html.
6. Think before you post.
I am slightly mad...
Kalin.
--
||///_ o *****************************
||//,_/> WWW: http://ThinRope.net/
|||\ <"
|||\\ '
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
