On Wed, Jul 27, 2011 at 02:00:13AM +0200, Tony Mechelynck wrote:
> On 27/07/11 00:42, Benjamin R. Haskell wrote:
> >On Tue, 26 Jul 2011, Benjamin R. Haskell wrote:
> >
> >>On Tue, 26 Jul 2011, Benjamin R. Haskell wrote:
> >>
> >>>>From my reading of http://unicode.org/reports/tr11/, none of these
> >>>>characters should be considered to be of ambiguous width (they
> >>>>should all be single-width).
> >>>
> >>>In whatever font rxvt-unicode is currently using, ₁₂₃₄ indeed shows
> >>>up incorrectly. (even outside of Vim). Is this something that changed?
> >>
> >>[Referent was unclear... I meant:]
> >>
> >>Did the ambiguous width status of ₁₂₃₄ change in a recent Unicode
> >>version?
> >>
> >>Maybe Vim just needs its tables updated.
> >
> >Scratch that. It appears the OP is misreading something. The current
> >EastAsianWidth.txt¹ shows that U+2081 through U+2084 are indeed
> >categorized as A (= East Asian Ambiguous), whereas U+2080 and U+2085
> >through U+2089 are N (= Neutral).
> >
> >Vim's correct here. The technical report points out that:
> >
> >"""
> >Examples [of Ambiguous characters] are the basic Greek and Cyrillic
> >alphabet found in East Asian character sets, but also some of the
> >mathematical symbols.
> >"""
> >
> >Just guessing, but perhaps subscripted 1-4 appeared in some legacy CJK
> >character set. (Otherwise I don't see why they'd be marked as anything.)
> >
> 
> Weird. I would have thought that those ten digits would be either
> all N=narrow (as opposed to W=wide) or all A=ambiguous, but not four
> of the one and six of the other. Or... Are subscripts 1-4 sometimes
> used to indicate the four tones of the Mandarin (North Chinese)
> language? (Just a guess.) I've seen superscripts used in some
> romanizations for that purpose (Wade-Giles, I think; hanyu pinyin
> prefers macron, acute, caron and grave, and IIRC EFEO used macron,
> grave, circumflex and acute), but not subscripts.

Really weird. As a native Chinese, I never see those digits used as part
of the Chinese language. And those digits all are only present in the
Chinese official encoding GB18030, but not in previous ones (GB2312 and
GBK). I have no idea about why subscripts 1-4 is special.

-- 
Best regards,
lilydjwg

Linux Vim Python 我的博客
http://lilydjwg.is-programmer.com/

-- 
You received this message from the "vim_dev" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php

Raspunde prin e-mail lui