At 07:19 AM -0700 on 03/11/2011, LuKreme wrote about Re: transliterate into cyrillic:

On 11-Mar-2011, at 03:54, eleven wrote:
 >
  When I view source, the
 garbled Cyrillic is really all encoded entities like this (mixed with
 Latin accented characters): ńęÓă in a page that is
 rendering as charset=iso-8859-1.

Those entities are not Cyrillic though.

I suspect that someone wrote the page in Cyrillic 8859-5 and then uploaded it to a host that only serves 8859-1 and it all got munged to hell.

Can you try some other encodings in firefox (Windows Latin 2 maybe, or maybe even KOI8-R) and see if something renders the page correctly. If it does, then copy and paste the page contents.

FF has an ISO-8859-5 Charset Setting so that can be tried to see if it comes out. The problem is that if it WERE ISO-8859-1 all the codes would still be in the x00-xFF range. Even if it were originally ISO-8859-5, something mangled it since just serving it as ISO-8859-1 would just use those glyphs for the Cyrillic Glyphs.


What does the server claim is the encoding (this is different than what the HTML page claims is the encoding)?


--
You received this message because you are subscribed to the "BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
<http://groups.google.com/group/bbedit?hl=en>
If you have a feature request or would like to report a problem, please email "[email protected]" rather than posting to the group.
Follow @bbedit on Twitter: <http://www.twitter.com/bbedit>

Reply via email to