character occurs
before you can insert a filter.
Any advice? Should Perl ignore a UTF8 BOM on its own?
Martin Hosken
Dear Tim,
"CESU-8 defines an encoding scheme for Unicode identical to UTF-8
except for its representation of supplementary characters. In CESU-8,
supplementary characters are represented as six-byte sequences
resulting from the transformation of each UTF-16 surrogate code
unit into an eight-bit fo
Does anyone have any experience of a bug I've encountered in 5.8.7 only
whereby occasionally (which is what makes it hard to report) this type
of code:
$fh = IO::File->new("< input.dat") || die;
binmode $fh;
# lots of code, even to different package
$fh->read($dat, $num_bytes);
does utf-8
9a-fA-F]{4,6}/ to allow for multi-lingual plane stuff?
Martin Hosken
Dear Georg,
> Isn't it about time to find a good name for crippled character sets
> with ordinals below 256 only? Otherwise Unicode characters will
> continue to be considered the special case...
>
Legacy encodings.
Nicely derogatory and generally accepted.
Yours,
Martin
Dear Gemma,
> I have downloaded ActiveState for windows, and cannot get scripts to run
> other then the perl -v and perl -h command.
> The hello program was saved to the desktop in a folder called "perlscripts".
> The commands typed in the console window are as follows.
> cd \desktop
> cd \perls
Dear All,
Is this me or is it a problem in 5.10?
Code that previously worked for me in 5.8 has stopped working in 5.10. The best
way to show this (not that I have 5.8 now) is that:
perl -e 'use utf8; $t="abc"; pos($t) = 1; print scalar $t =~ m/a\Gb/gcs;'
prints 1 and:
perl -e 'use utf8; $t="\
Dear Karl,
> >> I'm afraid that when it comes to normalization in Perl5, you have to
> >> do it yourself. I hear that Perl6 is much friendlier in this regard,
> >> but I have no personal experience with it. Your $unistring is in
> >> whatever normalization you made it when you typed it into your