On Saturday, November 14, 2015 at 3:04:44 PM UTC-8, Random832 wrote: > Bram Moolenaar writes: > > Can we see the start and/or end of a word by recognizing characters? > > Or do we need to recognize words? > > Everything I can find online indicates that word boundary detection (and > line breaking, which requires _syllable_ boundary detection). ICU > provides algorithms for this, which use dictionaries for Thai, Khmer, > Chinese, and Japanese, though I don't know if this is what is used by > the platforms that provide this capability in standard editing controls. > > Cynically, I suppose that users of these scripts are probably used to > minor inconsistencies between different software packages, and that > matching platform behavior exactly is less important than having > reasonable behavior 99% of the time. > > http://userguide.icu-project.org/boundaryanalysis > > One thing I wonder about is, will \< and \> be in scope for such a > feature? I don't think they can be on the same column right now.
Yes! Thank you. It is the ICU algorithms that I am thinking of. Correct. Depending on the language, expectations vary. Lao has been under supported and has only had ICU algorithms for a couple of years now. Thai, on the other hand is closer to the 99.++% Not sure I understand the question about \< and \> being on the same column. Brian -- -- You received this message from the "vim_dev" maillist. Do not top-post! Type your reply below the text you are replying to. For more information, visit http://www.vim.org/maillist.php --- You received this message because you are subscribed to the Google Groups "vim_dev" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
