I came across characters with incorrect encodings in a web-page I was trying to validate. [Incorrect, that is, in context. I'm sure they were fine in MS Word or wherever they originated.]

So, one character is a smart, single quote:

  '

and when I identify it using BBEdit's ASCII table, it says it's "code" is 8217 and its "escape" is %2019.

What would be the representation of this character in Perl's \-syntax? I tried \x8217 and \x2019, but they didn't match in regexes. I'm obviously a bit out of my depth in terms of what that character is, to Perl. Do I need to "use utf8"? That didn't seem to work either.

TIA,

jh

Reply via email to