Believe me, I'm listening to what y'all are telling me.

It is a tricky business because for a French typist I can use entities and change an � into é but with Chinese everything comes up unreadable (as you've mentioned). I think this is going to end up being a case by case scenario - is this how it's done?

Unfortunately, even though I worked in a translation office for some time, I don't have much experience at this end of things (I only did design back then, no programming). While I can smoothly transition the insertion of xml:lang and lang attributes into my form elements as needed, it just doesn't feel like it's the right thing to do (of course, this feeling has no basis in reality).

There will be a situation where one page will have the header encoding in ZH and an input/text field as EN-US. I'm pretty sure that the field itself won't establish the language parameters that go into the field - the operating system will. I'm confused, aside from hoping that the user will understand what needs to go into the field, how this will work. Or perhaps this is purely a design/usability issue.

One thing I don't understand though, is at what point does the computer actually use the xml:lang attribute? At the input (client-side)? When it gets to the server/table (server-side)? I can type any language I want into the textarea, but what comes out can vary...

And one more thing, my language declaration (in the header)...I've seen so many different kinds and read a few articles on the subject but I don't know exactly where to go on this:

en
en-us
en-gb
zh
zh-hans
etc.

What, where, which formats do I use and stick with if the idea is to support just about any lanugage that's out there (theoretically)?

Thanks for the help...v



On Jun 2, 2005, at 10:46 PM, Juergen Auer wrote:

On 2 Jun 2005 at 16:49, Vaska.WSG wrote:

It's for a multilanguage site and base language will be English.
Everything on the form will be English except the actual input
(textarea).

Hello Vaska,

I think you are mixing two things which should be separated.

The first problem is the language of the page (defined in the header)
or the language of a block (defined like <div lang='en-us' in a
german document). Your page is english - so mark it as english.

The second problem is how to create a non-ascii character. There you
can use a codepage, ISO-8859-6 or something - and then you will have
problems showing characters which are not in this special codepage.
Additionally, you may use the hex-code (&#x0041; = A), but this is
also hard: Read a chinese text coded with entities - its not
possible.

There it's better to use UTF-8 - you will see the characters
directly.

Additionally, you may use some of the mathematical symbols, dingbats
(x2701 - x27BE) or the IPA (international phonetic alphabet). These
with entities - terrible. But you can't define a language - you can
use all these symbols in any language you can write.


Best Regards
Juergen Auer
http://www.sql-und-xml.de/

******************************************************
The discussion list for  http://webstandardsgroup.org/

 See http://webstandardsgroup.org/mail/guidelines.cfm
 for some hints on posting to the list & getting help
******************************************************




******************************************************
The discussion list for  http://webstandardsgroup.org/

See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list & getting help
******************************************************

Reply via email to