On Mon, 2010-09-06 at 15:10 +0200, Pierre Nugues wrote:
> 
> I wrote a simple tokenizer for texts containing Latin9 characters. It
> does not behave as expected with the Swedish text below and I would
> like to find a workaround. 

Add these lines to top of your program:

use strict;
use warnings;

binmode STDIN, 'encoding(utf8)';
binmode STDOUT, 'encoding(utf8)';


-- 
Just my 0.00000002 million dollars worth,
  Shawn

Programming is as much about organization and communication
as it is about coding.

The secret to great software:  Fail early & often.

Eliminate software piracy:  use only FLOSS.


-- 
To unsubscribe, e-mail: beginners-unsubscr...@perl.org
For additional commands, e-mail: beginners-h...@perl.org
http://learn.perl.org/


Reply via email to