On 12/11/2010 5:38 PM, [email protected] wrote:
Date: Sun, 12 Dec 2010 00:09:22 -0000
From: Kornel Lesi?ski<[email protected]>
To:[email protected]
Subject: Re: [whatwg] Tag Proposal: spelling
Message-ID:<[email protected]>
Content-Type: text/plain; charset=utf-8; format=flowed; delsp=yes

On Sat, 11 Dec 2010 22:23:55 -0000, Charles Pritchard<[email protected]>
wrote:

>  For lack of a better solution, perhaps you can provide an extended
>  language tag:
>
>  <div contenteditable lang="en-GB">
>  <span aria-invalid="false" lang="en-GB-x-John-Grey">John Grey</span>  saw
>  a...
>  </div>
>
>  The aria attribute could let the spelling software know the string is
>  not misspelled, and the lang attribute marks it as an English phrase,
>  helpful with transliteration.
>
>  Does that work?
Transliteration in language code seems like a hack.

Instead of made-up language code, perhaps one of the special ISO 639 codes
would be more appropriate?

http://en.wikipedia.org/wiki/ISO_639-2#Special_situations

The en-GB prefix defines the script, and for phonetics, has a little bit to say about the region. The "x-*" prefix is just intended to mark that it's a special dialect of en-GB.
That "x-*" could signal special spelling, phonetic or transliteration rules.

Many of the whatwg docs have a language tag of "en-US-x-Hixie", telling UAs that the specs are written in an undefined American English dialect.

Consider a multilingual case:

<div contenteditable lang="en-US">
Then he said to me:
<q lang="mul"><span aria-invalid="false" lang="en-GB-x-John-Grey">John Grey</span>  <span 
lang="es-US">es mi hombre.</span></q>
</div>

That gives enough information to let a translation/transliteration service work with the user to make firm decisions. A user who can not read Roman script, English or Spanish would have the "x-*" namespace transliterated, based on British phonetics, and would translate the "es" and "en" portions based on its understanding of US usage of those two languages.

And should entries to the "en-GB-x-John-Grey" space be defined by the user, it would translate based on those definitions.


Reply via email to