So the magic formula appears to be this:

1) Page containing the input form is set to UTF-8
2) Form itself is set to accept-charset=utf-8, or does not specify
3) Display page is set to UTF-8

So if you input text 3 times, in 3 languages, and the input page is set for UTF-8, then the display page will display all of them properly on a single page. Any text input _prior_ to the input page being in UTF-8 will still not display correctly...it would appear that the text would have to be either converted before display (some mb_xyz() function?), or would have to be re-entered on a UTF-8 page. Otherwise not backwards compatible.

I don't have the Multi-byte extension compiled in, and have no experience with it, but can any of its functions determine the encoding type of input text? I'm wondering what will happen when I need to support Japanese or something that doesn't fall within UTF-8...what then?

Cheers,
spud.

On Monday, December 2, 2002, at 10:43 AM, Steve Vernon wrote:

My header includes <meta http-equiv='Content-Type' content='text/html;
charset=utf-8'>. And includes lines such as this <meta
http-equiv='Content-Language' content='en-uk'>, for each language (not sure
if needed though.


-------------------------------------------------------------------
a.h.s. boy
spud(at)nothingness.org            "as yes is to if,love is to yes"
http://www.nothingness.org/
-------------------------------------------------------------------


--
PHP Internationalization Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to