On Thu, Oct 29, 2009 at 08:53:47AM +0800, Brian Wang wrote: > On Wed, Oct 28, 2009 at 9:11 PM, Joerg Sonnenberger > <jo...@britannica.bec.de> wrote: > > On Wed, Oct 28, 2009 at 01:41:01PM +0100, Peter Wehrfritz wrote: > >> Your patch assumes that an UTF8 character initiates a new word. That is > >> wrong for the most European language, if not for all. So there must be > >> another solution. > > > > The best bet you can do is likely iswblank(x) || wcwidth(x) == 0. There > > is a common rule to include zero width characters for this purpose for > > languages like Chinese or Japanese, the only alternative would be > > dictionary based. > > Could you elaborate more about this? > Do you mean that the programmer needs to put a non-printable character > after each Chinese/European character?
A non-printable character is a different concept from a zero width character. Think of the zero width character as hyphenation hint or word border. The rule of using blanks works well for Indogermanic languages. The zero width rule works as fallback for other languages that don't have a natural word boundary. The main point is that it doesn't affect the visual representation. Joerg ------------------------------------------------------------------------------ Come build with us! The BlackBerry(R) Developer Conference in SF, CA is the only developer event you need to attend this year. Jumpstart your developing skills, take BlackBerry mobile applications to market and stay ahead of the curve. Join us from November 9 - 12, 2009. Register now! http://p.sf.net/sfu/devconference _______________________________________________ enlightenment-devel mailing list enlightenment-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/enlightenment-devel