--- In [email protected], "entropyreduction" 
<alancampbelllists+ya...@...> wrote:
>
> 
> In the meantime, if you feel like experimenting, construct a utf8
> string from several higher-plane characters, then do
> unicode.from_utf8(xxxx).to_utf8 and see if the same thing comes
> out as went in.
>

Yes, that works. Also using upper plane characters that I can actually see as 
proper looking characters on the table in Firefox, I can put them into my own 
html doc as utf8 and see them equally well. They look like box characters in 
unicode.messagebox or IE. I suppose this is a font or font script issue.

e.g.: (with a recent Powerpro test ver)

local str="2F862" ;;a CJK Compatibility Ideograph
local instr="\xf0\xaf\xa1\xa2"
;teststr(str, instr)
str ++=" and 10401" ;;DESERET CAPITAL LETTER LONG E
instr   ++="\xf0\x90\x90\x81"
teststr(str, instr)
quit

Function TestStr(str, instr)
local outstr=unicode.from_utf8(instr)
if (outstr.to_utf8 !== instr) do
  unicode.messagebox("OK", str++" versions not equal")
else
local var=???xend
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="generator" content="PowerPro 4.9k">
<title>*** Powerpro UTF8 String ***</title>
</head>
<h1>&(outstr.to_utf8)</h1>
</body>
xend
  file.writeall(pprofolder++?"testutf8.htm", var)
  do(pprofolder++?"testutf8.htm") ;; good chars from dif planes
  unicode.messagebox("OK", str++": "++outstr) ;; shows handle not string
  unicode.messagebox("OK", outstr) ;;box chars
endif
outstr.destroy
quit

Reply via email to