Hi Tony, Tony Mechelynck <[email protected]> wrote: > U2000.pdf from the Unicode site lists them as U+200C ZERO-WIDTH > NON-JOINER and U+200D ZERO-WIDTH JOINER (the NO-BREAK SPACE is U+00A0).
Correct. > Since they are zero-width characters but not combining characters, some > artifact is necessary so that in Vim (a text editor, not a word > processor) you could see that they're there and add or remove them if > necessary. Vim shows all zero-width Unicode codepoints as <xxxx>, that's > documented somewhere. I don't think you can hide them with plain-vanilla > Vim, but with Vince-Negri's conceal/ownsyntax page (available as an > unofficial patch on the vim_dev Google site) perhaps you could. Thanks for the info. > When I need to break a word in the middle after an initial or medial > shape (which is rare, but there are two examples on my front page, and I > don't know all the fine points of Arabic Unicode), I may use a tatweel > just before the break. There are other cases where that wouldn't be > practical though; the Arabic dictionary on my table says whether a verb > accepts a "person" or a "thing" as an object by means of the isolated > and/or final shapes of the letter heh standing alone after the verb. Although ZWJ is used in Arabic (for instance for writing hijri-qamari initials), I'm not sure about ZWNJ (but I might be wrong). ZWNJ is used a lot in Farsi (see the wikipedia page for an example). Regards, Ali --~--~---------~--~----~------------~-------~--~----~ You received this message from the "vim_dev" maillist. For more information, visit http://www.vim.org/maillist.php -~----------~----~----~----~------~----~------~--~---
