(The following is a message I sent to Zoran off-list, but I figured
folks from the list might already know the answer, so I'm sending it to
the list as well.)
Zoran,
Here's a reproducible example of what I'm talking about:
wats:nscp 75> encoding system
iso8859-1
wats:nscp 76> set u ��Ʈ�̵��
��Ʈ�̵��
wats:nscp 77> set u
��Ʈ�̵��
wats:nscp 78> regexp {^(.*)$} $u junk m
1
wats:nscp 79> set m
¾ÆƮ¹̵ð¾î
See how $m isn't the same as $u?
Also, the default encoding is iso8859-1 ... is this the problem?
Doesn't appear to be:
wats:nscp 9> encoding system
iso8859-1
wats:nscp 10> encoding system utf-8
wats:nscp 11> encoding system
utf-8
wats:nscp 12> set u {��Ʈ�̵��}
��Ʈ�̵��
wats:nscp 13> regexp {^(.*)$} $u junk m
1
wats:nscp 14>
¾ÆƮ¹̵ð¾î
Same behavior.
Any idea what I'm doing wrong?
-- Dossy
--
Dossy Shiobara mail: [EMAIL PROTECTED]
Panoptic Computer Network web: http://www.panoptic.com/
"He realized the fastest way to change is to laugh at your own
folly -- then you can let go and quickly move on." (p. 70)