Same thing goes for strlen(). It won't break, but it reports bytes, so be aware any validation you're doing using strlen may break. Check out mb_strlen() for an alternative.
http://us2.php.net/manual/en/function.mb-strlen.php On Wed, Mar 5, 2008 at 7:25 PM, John Campbell <[EMAIL PROTECTED]> wrote: > 2008/3/5 Cliff Hirsch <[EMAIL PROTECTED]>: > > > Is there a downside to using utf-8? > > No, but there are issues with some string functions. In php4/5 > functions like substr() operate on bytes, and may break UTF-8 strings > that contain multibyte characters. I'd search your code for things > like "substr" and "pos", and make sure they are not operating on > userdata in an unsafe way. > > see: > http://www.phpwact.org/php/i18n/utf-8 > > Regards, > John Campbell > > > _______________________________________________ > New York PHP Community Talk Mailing List > http://lists.nyphp.org/mailman/listinfo/talk > > NYPHPCon 2006 Presentations Online > http://www.nyphpcon.com > > Show Your Participation in New York PHP > http://www.nyphp.org/show_participation.php > -- realm3 web applications [realm3.com] freelance consulting, application development (917) 512-3594 _______________________________________________ New York PHP Community Talk Mailing List http://lists.nyphp.org/mailman/listinfo/talk NYPHPCon 2006 Presentations Online http://www.nyphpcon.com Show Your Participation in New York PHP http://www.nyphp.org/show_participation.php