Hello everyone. I have a number of e-texts in html format. Many of them are fine, but several seem to be done in some weird character set that renders apostrophes and quotation marks, in particular, as a bunch of weird characters like *s, accent marks and so on. This is all particularly annoying because I like to convert to .txt format and transfer stuff to a braille display device so I can read it properly. I don't know if this is related, but I also have a few html documents that have strange characters showing up in the middle of words with great frequency. I'm looking at one now and jaws identifies it as "character 173", and while it doesn't speak the character when reading it is rendered like a blank space, so words come out all wrong.
What is the best way to remove these characters in the shortest time possible, and in the other case, for example, to replace ý**ý and similar nonsense with proper apostrophes and/or quotes? Can I just change my encoding or something in the web browser and have the characters come out correctly? If so, how do I check to see which encoding the documents are using? It seems like the browser itself doesn't recognise the characters, or else why would they be rendered so nonsensically? Thanks for any help; I feel like i should know this stuff already but I'm kind of drawing a big blank at the moment. For answers to frequently asked questions about this list visit: http://www.jaws-users.com/help/
