Hi,

From: "Jason R. Mastaler"
Subject: Re: handling internationalized headers
Date: Thu, 17 Oct 2002 11:17:00 -0600

> If it turns out that many other users also don't like having to
> specify the input charset, I may investigate this further (unless
> you'd like to attempt this yourself).

That sounds reasonable.

> > From my perspective, this particular problem arises from an
> > unfortunate coincidence of:
> >
> >   1) A mechanism in Python being a certain way
> >
> >   2) TMDA employing that particular mechanism
> >
> >   3) ISO-2022-JP escape sequences being a certain way
> >
> > It's very unfortunate that Python didn't get a lot of feedback from
> > Japanese-using users early in its development -- I suspect if it
> > had, this particular problem would not exist.
> 
> I'm not sure that it's a problem specific to Python.  
> 
> Because ISO-2022-JP is 7bit, it contains many special characters like
> \, %, &, etc. which I think would cause problems for many languages
> when doing string manipulation.

I was thinking if the manipulation of ISO-2022-JP had been brought up
early on (that is, in the days of 1.3 and 1.4) and there had been
significant interest from users, some way of working with it better
might have been developed.  Then again, may be it wouldn't have been
possible because of the 7-bit issue you mention.

> I've been told that regardless of application or language, EUC-JP is
> the best to use in programming because all the Japanese characters
> are msb set 1 (like UTF-8).

It's my preference as well.

> So, I feel the problem is due more to the extra complexity arising
> from the fact that Japanese has three different encoding schemes.

Certainly a lot of problems are caused by this.  May be you are right.

[ I just remember how much more difficulty I had (years ago) doing
  Japanese manipulation in Python compared to the other languages I
  was using.  I'd be happy if the support has improved as I've started
  using Python recently -- perhaps I'll find out if I start doing
  manipulation of Japanese text. ]

> Anyway, you'd know better than me, but I was under the impression it
> would not be difficult for Japanese users to write their templates in
> EUC-JP.  At least it doesn't seem to be a problem in the Mailman
> world.  Time will tell I guess.

I suspect it won't be a problem -- at least while the users are mostly
UNIX-users.  I hope it remains a non-issue.
_________________________________________________
tmda-workers mailing list ([EMAIL PROTECTED])
http://tmda.net/lists/listinfo/tmda-workers

Reply via email to