"Jason R. Mastaler" <[EMAIL PROTECTED]> writes:
>> I also tried to install cjkcodecs-1.0.1 and use it, but it wasn't
>> solved.
>
> This is because cjkcodecs isn't supported. If you install
> KoreanCodecs 2.0.5, it will probably work though.
>
> However, I also see that the author has retired KoreanCodecs in favor
> of cjkcodecs. Let me take a look at see what needs to be done to
> support cjkcodecs..
Jinhyok, could you please try the attached diff and let me know if it
fixes your problem? This patch adds support for cjkcodecs.
Index: Charset.py
===================================================================
RCS file: /cvsroot/tmda/tmda/TMDA/pythonlib/email/Charset.py,v
retrieving revision 1.7
diff -u -r1.7 Charset.py
--- Charset.py 6 Mar 2003 21:07:43 -0000 1.7
+++ Charset.py 25 Nov 2003 03:13:29 -0000
@@ -88,26 +88,21 @@
'ascii': 'us-ascii',
}
-# Map charsets to their Unicode codec strings. Note that Python doesn't come
-# with any Asian codecs by default. Here's where to get them:
-#
-# Japanese -- http://www.asahi-net.or.jp/~rd6t-kjym/python
-# Korean -- http://sf.net/projects/koco
-# Chinese -- http://sf.net/projects/python-codecs
-#
-# Note that these codecs have their own lifecycle and may be in varying states
-# of stability and useability.
+# Map charsets to their Unicode codec strings. Note that Python
+# doesn't come with any Asian codecs by default. The CJKCodecs
+# package available at http://cjkpython.i18n.org/ provides a unified
+# unicode codec set for Chinese, Japanese and Korean encodings.
CODEC_MAP = {
- 'euc-jp': 'japanese.euc-jp',
- 'iso-2022-jp': 'japanese.iso-2022-jp',
- 'shift_jis': 'japanese.shift_jis',
- 'euc-kr': 'korean.euc-kr',
- 'ks_c_5601-1987': 'korean.cp949',
- 'iso-2022-kr': 'korean.iso-2022-kr',
- 'johab': 'korean.johab',
- 'gb2132': 'eucgb2312_cn',
- 'big5': 'big5_tw',
+ 'euc-jp': 'cjkcodecs.euc-jp',
+ 'iso-2022-jp': 'cjkcodecs.iso-2022-jp',
+ 'shift_jis': 'cjkcodecs.shift_jis',
+ 'euc-kr': 'cjkcodecs.euc-kr',
+ 'ks_c_5601-1987': 'cjkcodecs.cp949',
+ 'iso-2022-kr': 'cjkcodecs.iso-2022-kr',
+ 'johab': 'cjkcodecs.johab',
+ 'gb2312': 'cjkcodecs.gb2312',
+ 'big5': 'cjkcodecs.big5',
'utf-8': 'utf-8',
# Hack: We don't want *any* conversion for stuff marked us-ascii, as all
# sorts of garbage might be sent to us in the guise of 7-bit us-ascii.
_________________________________________________
tmda-workers mailing list ([EMAIL PROTECTED])
http://tmda.net/lists/listinfo/tmda-workers