Hi,
I am using perl-5.8.6 and running unicode
tests for the same on z/OS unix. I have categorised unicode test failures
as below and wish to know which of the below (any or all of them) is/ are
the most important to fix from a customer perspective.
1) Unicode properties. E.g. regular
Hi,
I am running a test which uses PVA_abbr_map and fails
while checking the property EastAsianWidth:A. Is it
possible (the pl file says not allowed) to manually
modify PVA.pl (the order of properties in
PVA_abbr_map) and get perl -d to run according to the
new order ?
e.g. the test originally
Hi,
I have a basic doubt regarding unicode and z/OS
(ebcdic : ibm-1047).
$a = chr(0x00A1);
$b = chr(0xA1);
Should $a and $b be equal or yield different results
on ebcdic ?
$b is definitely the character ~. Is $a also the
same thing or is it the character equivalent to \xAA
?
$a on linux
Hi,
I am running this on z/OS and perl-5.8.6.
$a = 160;
$b = 240;
for($i=$a;$i=$b;$i++)
{
$str = join ,$str, pack 'U*',$i;
}
if ($str =~ /(\p{inlatin1supplement}+)/)
{
print \$1 : $1\n;
}
1) If I pipe the o/p to od -tc -tx, $1 shows me
two bytes for each code pt value (e.g.
Hi,
I run this on z/OS and perl-5.8.6.
$a = 128;
$b = 256;
for ($i=$a;$i=$b;$i++)
{
$str = join '', $str, pack 'U*', $i;
}
if ($str =~ /(\p{inlatin1supplement}+)/)
{
print \$1 : $1\n;
}
I get the following values :
a) for $a = 128
$b = 256
$1 has 1 byte representations for each of
I run the following script :
$a = 160;
$b = 256;
for ($i=$a;$i=$b;$i++)
{
$str = join '', $str, pack 'U*', $i;
}
if ($str =~ /(\p{inlatin1supplement}+)/)
{
print \$1 : $1\n;
}
on redirecting above o/p to od -tc -tx, $1 has 2 bytes
for all matching code point values (160-255).
If the above
Hi,
This is on z/OS and perl-5.8.6.
--
U8 u, *s;
1: s++;
2: u = ((len = 6) ? 0x01 : (0x1F (len-2)));
3: uv = (((u) 5)|(PL_e2utf[(U8)(*s)]
((U8)0x1f)));
---
Before starting, s is set to \x8a\x41 (utf-ebcdic
bytes
Hi,
Following is a snapshot of a bareword test :
--
use utf8;
my %hash = (#1090;#1077; = 123);
is($hash{#1090;#1077;}, $hash{'#1090;#1077;'});
---
It runs on ascii and passes but fails on ebcdic (z/OS,
ibm-1047) with perl-5.8.6.
The above two
hi,
A bareword test :
-
use utf8;
my %hash = (#1090;#1077;#1089;#1090; = 123);
if (($hash{#1090;#1077;#1089;#1090;}) eq
($hash{'#1090;#1077;#1089;#1090;'})) print ok\n;
--
The chars in the hash are \x{0442}, \x{0435},
hi,
A bareword test :
-
use utf8;
my %hash = (#1090;#1077;#1089;#1090; = 123);
if (($hash{#1090;#1077;#1089;#1090;}) eq
($hash{'#1090;#1077;#1089;#1090;'})) print ok\n;
--
The chars in the hash are \x{0442}, \x{0435},
I run the following on an ebcdic platform
(perl-5.8.6),
$BOM = chr(0xFEFF);
open(UTF_PL, :raw:encoding(utf16le), utf.pl)
or die utf.pl($enc,$tag): $!;
print UTF_PL $BOM;
print UTF_PL 1;
should the data that is written using PerlLIO_write,
be \xFF \xFE \xF1 or should it be \xFF \xFE
Hi,
I made the following modifications to utf8.c :
#ifdef EBCDIC
uv = NATIVE_TO_UTF(uv);
#endif
I get the following output from gmake :
--
`sh cflags optimize='-g' utf8.o` utf8.c
CCCMD = c89 -DPERL_CORE -c -DMAXSIG=38
-DOEMVS
--- Nicholas Clark [EMAIL PROTECTED] wrote:
On Tue, Jul 26, 2005 at 07:55:21AM -0700, rajarshi
das wrote:
The change is in the fn Perl_utf8n_to_uvuni :
---
.
#define UTF8_WARN_LONG 8
#define UTF8_WARN_
--- Nicholas Clark [EMAIL PROTECTED] wrote:
On Tue, Jul 26, 2005 at 08:12:16AM -0700, rajarshi
das wrote:
I basically want to know if there are alternate
ways
of representing barewords (as I mentioned in
question
2) above) ?
No. By definition there can not be.
You're failing
--- Dave Mitchell [EMAIL PROTECTED] wrote:
On Tue, Jul 26, 2005 at 03:29:43PM +0100, Nicholas
Clark wrote:
On Tue, Jul 26, 2005 at 07:22:55AM -0700, rajarshi
das wrote:
Hi,
I made the following modifications to utf8.c :
#ifdef EBCDIC
uv = NATIVE_TO_UTF(uv);
#endif
Nicholas Clark [EMAIL PROTECTED] wrote:
On Tue, Jul 26, 2005 at 08:48:10AM -0700, rajarshi das wrote: For the code points being tested ("\x{0442}\x{0435}\x{0441}\x{0442}") does the perl source file contain the correct byte sequence in UTF-EBCDIC? Yes it does, since I ran the test,
--- Nicholas Clark [EMAIL PROTECTED] wrote:
On Thu, Jul 28, 2005 at 02:37:36AM -0700, rajarshi
das wrote:
However, if I change the first instance to :
--- utf8.c 2004-11-17 18:22:09.0
+0530
+++ utf8.c.22005-07-28 13:48:24.0
+0530
@@ -363,6 +363,11
--- Yitzchak Scott-Thoennes [EMAIL PROTECTED] wrote:
On Thu, Jul 28, 2005 at 12:35:13AM -0700, rajarshi
das wrote:
Nicholas Clark wrote:
If you put those 3 bytes directly between the '{'
and '}' characters in
the EBCDIC version of that 1 liner, does it also
print 3500?
I am unable
Hi,
Here's a test on perl-5.8.6 run on z/OS :
$a = '0178';
$b = '00FF';
$a1 = pack("U0U*", hex $a);
$b1 = pack("U0U*", map { hex } split " ", $b);
if (":$b1:" =~ /:[$a1]:/i)
print "ok";
The test runs through the opcodes OP_REGCOMP, OP_MATCH and then gets into the opcode OP_COND_EXPR.
Now, if
Dave Mitchell [EMAIL PROTECTED] wrote:
On Thu, Aug 18, 2005 at 11:19:16PM -0700, rajarshi das wrote: Hi, Here's a test on perl-5.8.6 run on z/OS : $a = '0178'; $b = '00FF'; $a1 = pack("U0U*", hex $a); $b1 = pack("U0U*", map { hex } split " ", $b); if (&quo
Hi,
The following unicode folding test fails on EBCDIC
(perl-5.8.6) :
$a = '0178';
$b = '00FF';
$a1 = pack(U0U*, hex $code);
$b1 = pack(U0U*, map { hex } split , $mapping);
if (:$b1: =~ /:[$a1]:/i) {
print ok\n;
}
Alternately, if $a = '0178', and $b = '00DF', the test
passes.
Why is this so
Hi,
On EBCDIC, I get the following results on perl-5.8.6 :
@t1 = unpack("aU0C/UU", "b\0\341\277\274");
print "t1 : @t1\n";
@t2 = unpack("aU0C/CU", "b\0\341\277\274");
print "t2 : @t2\n";
Gives :
t1 : b 0
t2 : b 0
Are these results correct ?
Thanks,
Rajarshi.
Yahoo! FareChase - Search
22 matches
Mail list logo