Kalin KOZHUHAROV wrote:

> Do you really expect wget to be able to chew (and not choke) on anything
> that slightly resembles html?
> I don't think anybody will ever try to fix this.

I doubt that Johannes had any idea that MS Word emits pseudo HTML that only
Internet Explorer can read.

> 1. Fix your file. Try to validate it on http://validator.w3.org/ for a
> start.

As you know if you ran Johannes' file through the validator, it does not get
any meaningful output because it does not contain a DOCTYPE. I doubt that
most users of HTML would have a clue how to select an appropriate DOCTYPE to
try to get to the next phase of validation.

Even if they did, they wouldn't have a clue how to fix all the errors in the
HTML generated by MS Word.

> 2. Use html editor, as M$ Word is not.

Many people find themselves with formatted word processing documents that
they want to make available on the web. How are they supposed to know that
Microsoft is misleading them when it says that it will produce an HTML file
for them?

Can *you* recommend a tool that will take a Rich Text Format file (which
most modern word processors will both write and read) and convert it into an
HTML file? Such a tool might solve Johannes' problem unlike the other
"solutions" you suggested.

> 4. Change your OS to something better (What about Linux?)

In the real world, most people don't have that kind of flexibility.

> 6. Think before you post.

A very good suggestion and one that you should follow in the future.

> I am slightly mad...

Obviously. Unfortunately, you should be directing your anger toward Bill
Gates and his minions rather than someone who is forced into using their
products and innocently posts a request to fix what appeared to him to be a
bug with wget. A simple "that's a bug in M$ Word" response would have been
more appropriate.

Tony

Reply via email to