On 6 sep, 15:25, shawnhco...@gmail.com (Shawn H Corey) wrote: > On Mon, 2010-09-06 at 15:10 +0200, Pierre Nugues wrote: > > > I wrote a simple tokenizer for texts containing Latin9 characters. It > > does not behave as expected with the Swedish text below and I would > > like to find a workaround. > > Add these lines to top of your program: > > use strict; > use warnings; > > binmode STDIN, 'encoding(utf8)'; > binmode STDOUT, 'encoding(utf8)';
There is also utf8 in the perl sourcecode, therefore you should add use utf8; -- To unsubscribe, e-mail: beginners-unsubscr...@perl.org For additional commands, e-mail: beginners-h...@perl.org http://learn.perl.org/