Hi Tony,

Tony Mechelynck <[email protected]> wrote:
> U2000.pdf from the Unicode site lists them as U+200C ZERO-WIDTH  
> NON-JOINER and U+200D ZERO-WIDTH JOINER (the NO-BREAK SPACE is U+00A0).  

Correct.

> Since they are zero-width characters but not combining characters, some  
> artifact is necessary so that in Vim (a text editor, not a word  
> processor) you could see that they're there and add or remove them if  
> necessary. Vim shows all zero-width Unicode codepoints as <xxxx>, that's  
> documented somewhere. I don't think you can hide them with plain-vanilla  
> Vim, but with Vince-Negri's conceal/ownsyntax page (available as an  
> unofficial patch on the vim_dev Google site) perhaps you could.

Thanks for the info.

> When I need to break a word in the middle after an initial or medial  
> shape (which is rare, but there are two examples on my front page, and I  
> don't know all the fine points of Arabic Unicode), I may use a tatweel  
> just before the break. There are other cases where that wouldn't be  
> practical though; the Arabic dictionary on my table says whether a verb  
> accepts a "person" or a "thing" as an object by means of the isolated  
> and/or final shapes of the letter heh standing alone after the verb.

Although ZWJ is used in Arabic (for instance for writing hijri-qamari
initials), I'm not sure about ZWNJ (but I might be wrong).  ZWNJ is used
a lot in Farsi (see the wikipedia page for an example).

Regards,
Ali

--~--~---------~--~----~------------~-------~--~----~
You received this message from the "vim_dev" maillist.
For more information, visit http://www.vim.org/maillist.php
-~----------~----~----~----~------~----~------~--~---

Raspunde prin e-mail lui