At 07:19 AM -0700 on 03/11/2011, LuKreme wrote about Re:
transliterate into cyrillic:
On 11-Mar-2011, at 03:54, eleven wrote:
>
When I view source, the
garbled Cyrillic is really all encoded entities like this (mixed with
Latin accented characters): ńęÓă in a page that is
rendering as charset=iso-8859-1.
Those entities are not Cyrillic though.
I suspect that someone wrote the page in Cyrillic 8859-5 and then
uploaded it to a host that only serves 8859-1 and it all got munged
to hell.
Can you try some other encodings in firefox (Windows Latin 2 maybe,
or maybe even KOI8-R) and see if something renders the page
correctly. If it does, then copy and paste the page contents.
FF has an ISO-8859-5 Charset Setting so that can be tried to see if
it comes out. The problem is that if it WERE ISO-8859-1 all the codes
would still be in the x00-xFF range. Even if it were originally
ISO-8859-5, something mangled it since just serving it as ISO-8859-1
would just use those glyphs for the Cyrillic Glyphs.
What does the server claim is the encoding (this is different than
what the HTML page claims is the encoding)?
--
You received this message because you are subscribed to the
"BBEdit Talk" discussion group on Google Groups.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
<http://groups.google.com/group/bbedit?hl=en>
If you have a feature request or would like to report a problem,
please email "[email protected]" rather than posting to the group.
Follow @bbedit on Twitter: <http://www.twitter.com/bbedit>