request for categorising unicode test failures on z/OS.

2005-04-13 Thread Rajarshi Das
Hi, I am using perl-5.8.6 and running unicode tests for the same on z/OS unix. I have categorised unicode test failures as below and wish to know which of the below (any or all of them) is/ are the most important to fix from a customer perspective. 1) Unicode properties. E.g. regular

modify PVA.pl (z/OS and perl-5.8.6)

2005-05-12 Thread rajarshi das
Hi, I am running a test which uses PVA_abbr_map and fails while checking the property EastAsianWidth:A. Is it possible (the pl file says not allowed) to manually modify PVA.pl (the order of properties in PVA_abbr_map) and get perl -d to run according to the new order ? e.g. the test originally

chr function on z/OS.

2005-05-24 Thread rajarshi das
Hi, I have a basic doubt regarding unicode and z/OS (ebcdic : ibm-1047). $a = chr(0x00A1); $b = chr(0xA1); Should $a and $b be equal or yield different results on ebcdic ? $b is definitely the character ~. Is $a also the same thing or is it the character equivalent to \xAA ? $a on linux

codepoint value 192 on utf-ebcdic.

2005-06-01 Thread rajarshi das
Hi, I am running this on z/OS and perl-5.8.6. $a = 160; $b = 240; for($i=$a;$i=$b;$i++) { $str = join ,$str, pack 'U*',$i; } if ($str =~ /(\p{inlatin1supplement}+)/) { print \$1 : $1\n; } 1) If I pipe the o/p to od -tc -tx, $1 shows me two bytes for each code pt value (e.g.

no. of bytes for each matching code pt in $1

2005-06-02 Thread rajarshi das
Hi, I run this on z/OS and perl-5.8.6. $a = 128; $b = 256; for ($i=$a;$i=$b;$i++) { $str = join '', $str, pack 'U*', $i; } if ($str =~ /(\p{inlatin1supplement}+)/) { print \$1 : $1\n; } I get the following values : a) for $a = 128 $b = 256 $1 has 1 byte representations for each of

confusing no. of bytes in $1 (z/OS, perl-5.8.6)

2005-06-06 Thread rajarshi das
I run the following script : $a = 160; $b = 256; for ($i=$a;$i=$b;$i++) { $str = join '', $str, pack 'U*', $i; } if ($str =~ /(\p{inlatin1supplement}+)/) { print \$1 : $1\n; } on redirecting above o/p to od -tc -tx, $1 has 2 bytes for all matching code point values (160-255). If the above

modifying the processed unicode value.

2005-06-13 Thread rajarshi das
Hi, This is on z/OS and perl-5.8.6. -- U8 u, *s; 1: s++; 2: u = ((len = 6) ? 0x01 : (0x1F (len-2))); 3: uv = (((u) 5)|(PL_e2utf[(U8)(*s)] ((U8)0x1f))); --- Before starting, s is set to \x8a\x41 (utf-ebcdic bytes

bareword test on ebcdic

2005-06-15 Thread rajarshi das
Hi, Following is a snapshot of a bareword test : -- use utf8; my %hash = (#1090;#1077; = 123); is($hash{#1090;#1077;}, $hash{'#1090;#1077;'}); --- It runs on ascii and passes but fails on ebcdic (z/OS, ibm-1047) with perl-5.8.6. The above two

how to run a bareword test on ebcdic ?

2005-06-22 Thread rajarshi das
hi, A bareword test : - use utf8; my %hash = (#1090;#1077;#1089;#1090; = 123); if (($hash{#1090;#1077;#1089;#1090;}) eq ($hash{'#1090;#1077;#1089;#1090;'})) print ok\n; -- The chars in the hash are \x{0442}, \x{0435},

how to run a bareword test on ebcdic ?

2005-06-22 Thread rajarshi das
hi, A bareword test : - use utf8; my %hash = (#1090;#1077;#1089;#1090; = 123); if (($hash{#1090;#1077;#1089;#1090;}) eq ($hash{'#1090;#1077;#1089;#1090;'})) print ok\n; -- The chars in the hash are \x{0442}, \x{0435},

data written on ebcdic

2005-07-07 Thread rajarshi das
I run the following on an ebcdic platform (perl-5.8.6), $BOM = chr(0xFEFF); open(UTF_PL, :raw:encoding(utf16le), utf.pl) or die utf.pl($enc,$tag): $!; print UTF_PL $BOM; print UTF_PL 1; should the data that is written using PerlLIO_write, be \xFF \xFE \xF1 or should it be \xFF \xFE

gmake (perl-5.8.6) fails on z/OS

2005-07-26 Thread rajarshi das
Hi, I made the following modifications to utf8.c : #ifdef EBCDIC uv = NATIVE_TO_UTF(uv); #endif I get the following output from gmake : -- `sh cflags optimize='-g' utf8.o` utf8.c CCCMD = c89 -DPERL_CORE -c -DMAXSIG=38 -DOEMVS

Re: gmake (perl-5.8.6) fails on z/OS

2005-07-26 Thread rajarshi das
--- Nicholas Clark [EMAIL PROTECTED] wrote: On Tue, Jul 26, 2005 at 07:55:21AM -0700, rajarshi das wrote: The change is in the fn Perl_utf8n_to_uvuni : --- . #define UTF8_WARN_LONG 8 #define UTF8_WARN_

Re: bareword test on ebcdic.

2005-07-26 Thread rajarshi das
--- Nicholas Clark [EMAIL PROTECTED] wrote: On Tue, Jul 26, 2005 at 08:12:16AM -0700, rajarshi das wrote: I basically want to know if there are alternate ways of representing barewords (as I mentioned in question 2) above) ? No. By definition there can not be. You're failing

Re: gmake (perl-5.8.6) fails on z/OS

2005-07-26 Thread rajarshi das
--- Dave Mitchell [EMAIL PROTECTED] wrote: On Tue, Jul 26, 2005 at 03:29:43PM +0100, Nicholas Clark wrote: On Tue, Jul 26, 2005 at 07:22:55AM -0700, rajarshi das wrote: Hi, I made the following modifications to utf8.c : #ifdef EBCDIC uv = NATIVE_TO_UTF(uv); #endif

Re: bareword test on ebcdic.

2005-07-28 Thread rajarshi das
Nicholas Clark [EMAIL PROTECTED] wrote: On Tue, Jul 26, 2005 at 08:48:10AM -0700, rajarshi das wrote: For the code points being tested ("\x{0442}\x{0435}\x{0441}\x{0442}") does the perl source file contain the correct byte sequence in UTF-EBCDIC? Yes it does, since I ran the test,

Re: gmake (perl-5.8.6) fails on z/OS

2005-07-28 Thread rajarshi das
--- Nicholas Clark [EMAIL PROTECTED] wrote: On Thu, Jul 28, 2005 at 02:37:36AM -0700, rajarshi das wrote: However, if I change the first instance to : --- utf8.c 2004-11-17 18:22:09.0 +0530 +++ utf8.c.22005-07-28 13:48:24.0 +0530 @@ -363,6 +363,11

Re: bareword test on ebcdic.

2005-07-31 Thread rajarshi das
--- Yitzchak Scott-Thoennes [EMAIL PROTECTED] wrote: On Thu, Jul 28, 2005 at 12:35:13AM -0700, rajarshi das wrote: Nicholas Clark wrote: If you put those 3 bytes directly between the '{' and '}' characters in the EBCDIC version of that 1 liner, does it also print 3500? I am unable

opcode sequence.

2005-08-19 Thread rajarshi das
Hi, Here's a test on perl-5.8.6 run on z/OS : $a = '0178'; $b = '00FF'; $a1 = pack("U0U*", hex $a); $b1 = pack("U0U*", map { hex } split " ", $b); if (":$b1:" =~ /:[$a1]:/i) print "ok"; The test runs through the opcodes OP_REGCOMP, OP_MATCH and then gets into the opcode OP_COND_EXPR. Now, if

Re: opcode sequence.

2005-08-19 Thread rajarshi das
Dave Mitchell [EMAIL PROTECTED] wrote: On Thu, Aug 18, 2005 at 11:19:16PM -0700, rajarshi das wrote: Hi, Here's a test on perl-5.8.6 run on z/OS : $a = '0178'; $b = '00FF'; $a1 = pack("U0U*", hex $a); $b1 = pack("U0U*", map { hex } split " ", $b); if (&quo

case folding failure on EBCDIC

2005-10-03 Thread rajarshi das
Hi, The following unicode folding test fails on EBCDIC (perl-5.8.6) : $a = '0178'; $b = '00FF'; $a1 = pack(U0U*, hex $code); $b1 = pack(U0U*, map { hex } split , $mapping); if (:$b1: =~ /:[$a1]:/i) { print ok\n; } Alternately, if $a = '0178', and $b = '00DF', the test passes. Why is this so

unpack results on EBCDIC

2005-10-28 Thread rajarshi das
Hi, On EBCDIC, I get the following results on perl-5.8.6 : @t1 = unpack("aU0C/UU", "b\0\341\277\274"); print "t1 : @t1\n"; @t2 = unpack("aU0C/CU", "b\0\341\277\274"); print "t2 : @t2\n"; Gives : t1 : b 0 t2 : b 0 Are these results correct ? Thanks, Rajarshi. Yahoo! FareChase - Search