you might try to identify hanzi as anything above 0xFF: assuming 'nocompatible' mode, searching on [^\x00-\xFF] might do it: this regexp atom matches anything above U+00FF, i.e., any hanzi, but (this is the caveat) also any non-Latin letter, any non-Western Latin letter, and the above-mentioned œ Œ and Ÿ which are Western (French, at least, has a use for them) but not included in Latin 1.
Another caveat is that things such as spaces and punctuation can be used in Chinese text, which are in that lower range (though there are alternatives in the CJK Unicode regions too). This may or may not matter for your text and application, of course. Ben. -- You received this message from the "vim_use" maillist. Do not top-post! Type your reply below the text you are replying to. For more information, visit http://www.vim.org/maillist.php
