Detecting/Decoding Unicode Text

2004-04-06 Thread gohaku
Hi everyone, I have some ( actually many ) records in a Database that I want to "clean" Some of these records contain Unicode Text ( Mostly East-Asian ) I have tried matching for "\W+" and "\S+" but that is not what I am looking for because I would like to keep "&" and "-" Thanks in advance. -go

Re: PERL_UNICODE environment variable

2004-04-06 Thread Jarkko Hietaniemi
Jonathan Warden wrote: > As I understand it, the -CSD commandline option should add UTF8 to the > PerlIO layers for all file streams. But it seems only to be applying it to > STDIN and STDOUT, and not other streams. > > Anyone know what's going on? > > my $file = new IO::File($filename) or di

PERL_UNICODE environment variable

2004-04-06 Thread Jonathan Warden
As I understand it, the -CSD commandline option should add UTF8 to the PerlIO layers for all file streams. But it seems only to be applying it to STDIN and STDOUT, and not other streams. Anyone know what's going on? my $file = new IO::File($filename) or die "Error opening delimited file $filen