Kalin KOZHUHAROV wrote: > Do you really expect wget to be able to chew (and not choke) on anything > that slightly resembles html? > I don't think anybody will ever try to fix this.
I doubt that Johannes had any idea that MS Word emits pseudo HTML that only Internet Explorer can read. > 1. Fix your file. Try to validate it on http://validator.w3.org/ for a > start. As you know if you ran Johannes' file through the validator, it does not get any meaningful output because it does not contain a DOCTYPE. I doubt that most users of HTML would have a clue how to select an appropriate DOCTYPE to try to get to the next phase of validation. Even if they did, they wouldn't have a clue how to fix all the errors in the HTML generated by MS Word. > 2. Use html editor, as M$ Word is not. Many people find themselves with formatted word processing documents that they want to make available on the web. How are they supposed to know that Microsoft is misleading them when it says that it will produce an HTML file for them? Can *you* recommend a tool that will take a Rich Text Format file (which most modern word processors will both write and read) and convert it into an HTML file? Such a tool might solve Johannes' problem unlike the other "solutions" you suggested. > 4. Change your OS to something better (What about Linux?) In the real world, most people don't have that kind of flexibility. > 6. Think before you post. A very good suggestion and one that you should follow in the future. > I am slightly mad... Obviously. Unfortunately, you should be directing your anger toward Bill Gates and his minions rather than someone who is forced into using their products and innocently posts a request to fix what appeared to him to be a bug with wget. A simple "that's a bug in M$ Word" response would have been more appropriate. Tony
