Dodging BOMs

2000-04-27 Thread Martin Hosken
character occurs before you can insert a filter. Any advice? Should Perl ignore a UTF8 BOM on its own? Martin Hosken

Re: AL32UTF8

2004-04-30 Thread Martin Hosken
Dear Tim, "CESU-8 defines an encoding scheme for Unicode identical to UTF-8 except for its representation of supplementary characters. In CESU-8, supplementary characters are represented as six-byte sequences resulting from the transformation of each UTF-16 surrogate code unit into an eight-bit fo

perlio probs

2005-09-22 Thread Martin Hosken
Does anyone have any experience of a bug I've encountered in 5.8.7 only whereby occasionally (which is what makes it hard to report) this type of code: $fh = IO::File->new("< input.dat") || die; binmode $fh; # lots of code, even to different package $fh->read($dat, $num_bytes); does utf-8

Re: CGI::Util unescape() after escape() loses utf8 flag

2005-09-28 Thread Martin Hosken
9a-fA-F]{4,6}/ to allow for multi-lingual plane stuff? Martin Hosken

Re: good name for characters matching [^\0-\377]?

2007-10-18 Thread Martin Hosken
Dear Georg, > Isn't it about time to find a good name for crippled character sets > with ordinals below 256 only? Otherwise Unicode characters will > continue to be considered the special case... > Legacy encodings. Nicely derogatory and generally accepted. Yours, Martin

Re: loading necessary module to Interpreter

2008-09-11 Thread Martin Hosken
Dear Gemma, > I have downloaded ActiveState for windows, and cannot get scripts to run > other then the perl -v and perl -h command. > The hello program was saved to the desktop in a folder called "perlscripts". > The commands typed in the console window are as follows. > cd \desktop > cd \perls

\G problems in 5.10

2009-06-04 Thread Martin Hosken
Dear All, Is this me or is it a problem in 5.10? Code that previously worked for me in 5.8 has stopped working in 5.10. The best way to show this (not that I have 5.8 now) is that: perl -e 'use utf8; $t="abc"; pos($t) = 1; print scalar $t =~ m/a\Gb/gcs;' prints 1 and: perl -e 'use utf8; $t="\

Re: Comparing inputs with source strings

2016-05-11 Thread Martin Hosken
Dear Karl, > >> I'm afraid that when it comes to normalization in Perl5, you have to > >> do it yourself. I hear that Perl6 is much friendlier in this regard, > >> but I have no personal experience with it. Your $unistring is in > >> whatever normalization you made it when you typed it into your