--- Nicholas Clark <[EMAIL PROTECTED]> wrote:
> On Tue, Jul 26, 2005 at 08:12:16AM -0700, rajarshi > das wrote: > > > I basically want to know if there are alternate > ways > > of representing barewords (as I mentioned in > question > > 2) above) ? > > No. By definition there can not be. > You're failing to grasp what is meant by "bareword". > There is only one representation. > > > Also, any pointers that you have regarding where > to > > look to fix this ? > > Not much better than "in toke.c or utf8.c" > > However, based on a comment I've spotted at the top > of utfebcdic.h *think* > that the internal encoding of perl on an EBCDIC > system is UTF-EBCDIC rather > than UTF-8. The byte sequence in the source file for > the bareword will need > to be valid UTF-EBCDIC. > > For the code points being tested > ("\x{0442}\x{0435}\x{0441}\x{0442}") > does the perl source file contain the correct byte > sequence in UTF-EBCDIC? Yes it does, since I ran the test, if (($hash{"\x{0442}\x{0435}\x{0441}\x{0442}"}) eq ($hash{eval '"\x{0442}\x{0435}\x{0441}\x{0442}"'})) print "ok\n"; and the test ran fine, if that is what you mean by the source file containing the correct byte sequence. Or am I mistaken ? > > Does the byte sequence in UTF-EBCDIC for those 4 > code points differ from the > byte sequence in UTF-8? > Yes the byte sequence for the 4 code points is different on UTF-EBCDIC from the sequence in UTF-8. > Does the source file happen to have the UTF-8 byte > sequence? It has the UTF-EBCDIC byte sequence on the ebcdic platform. > > If so, *that* would explain the failures, and be the > thing that needs > correcting. The test file would need if/else with a > different test on EBCDIC. what would you suggest be put in the if/ else ? > > Nicholas Clark > Thanks, Rajarshi. > > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com