On Friday, May 31, 2013 4:39:14 PM UTC+3, Mike Williams wrote:
> Alas I have zero knowledge of Hebrew so will have to bow to your
> superior knowledge. You will know better the use case of finding base
> characters with and without combining marks.
Thanks, I guess...
There are no ligatures in Hebrew. Wait, that's not true -- there's a weird
"aleph+lamed" ligature used in some texts, but it is semantically the same as
"aleph" + "lamed".
> As such you could argue that ligatures should always be treated as the
> sequence of their characters, and so doing s/fi/it/ on ffi should always result
> in the 3 character sequence "fit" replacing the original single
> representative character. Effectively ligatures should always be treated as
> their expanded equivalents.
I agree. Using TeX for instance, one often produces ligatures *on output*.
When screen-scraping e.g. a PDF produced from TeX, I will often see ligatures
("ffi" among others).
If this is the expected direction, then there shouldn't be extra regex options,
since the s/// expression you mentioned would necessarily produce "fit".
It becomes a bit more difficult in the case of combining characters (I can
speak about Hebrew, but Arabic, Thai and others have similar issues).
When *searching* for a base character (as in my example of an "aleph"), I would
expect to always match that character even if it had combining diacritic marks
about it. But when substituting, it's not as clear to me that that is what I
would want. So there is a need regarding combining characters, at least, for
the ability to selectively enable the matching (which is what the \Z flag
currently does).
> My 2ps worth anyway.
That would be about 10 agorot (0.10 ILS). Coins are still in circulation, but
nobody knows why...
--
--
You received this message from the "vim_dev" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php
---
You received this message because you are subscribed to the Google Groups
"vim_dev" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
For more options, visit https://groups.google.com/groups/opt_out.