[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread STINNER Victor
Change by STINNER Victor : -- nosy: -vstinner ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: Found an "Unlink" bottom at the bottom of the message view. This appears to remove the messages from the issue. -- ___ Python tracker

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread Marc-Andre Lemburg
Change by Marc-Andre Lemburg : -- Removed message: https://bugs.python.org/msg367514 ___ Python tracker ___ ___ Python-bugs-list

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread Marc-Andre Lemburg
Change by Marc-Andre Lemburg : -- Removed message: https://bugs.python.org/msg367515 ___ Python tracker ___ ___ Python-bugs-list

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread Marc-Andre Lemburg
Change by Marc-Andre Lemburg : -- Removed message: https://bugs.python.org/msg320603 ___ Python tracker ___ ___ Python-bugs-list

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: I have marked the messages as spam. Can't seem to remove them, though. -- ___ Python tracker ___

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread Antti Haapala
Antti Haapala added the comment: The messages above seem to be a (quite likely a machine) translation of André's comment with a spam link to a paint ad site, so no need to bother to translate it. Also, I invited Hiếu to the nosy list in case this patch needs some info that requires a native

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2020-04-28 Thread Hieu Nguyen
Change by Hieu Nguyen : -- nosy: +hieu.nguyen ___ Python tracker ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2018-06-27 Thread STINNER Victor
STINNER Victor added the comment: Google Translate of msg320603 :-) As far as I can understand, we are "subset" of each other only in the sense that VN1 have extensive map of the characters, but it also overlaps partially with control characters C0 and C1 in the page ISO code - 139

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2018-06-27 Thread Tô Thị Mai
Tô Thị Mai added the comment: Theo tôi có thể hiểu, chúng là "tập hợp con" của nhau chỉ theo nghĩa là VN1 có bản đồ rộng nhất của các ký tự, nhưng điều này cũng trùng lặp một phần với các ký tự điều khiển C0 và C1 trong các trang mã ISO - có 139 nhân vật bổ sung! VN2 sau đó cho phép C0 và C1

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2016-10-21 Thread Antti Haapala
Antti Haapala added the comment: Ah there was something that I overlooked before - the VN1 and VN2 both have combining accents too. If I read correctly, the main letter should precede the combining character, just as in Unicode; VN3 seems to lack combining characters altogether. Thus, for

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2016-10-21 Thread Antti Haapala
Antti Haapala added the comment: I found the full document on SlideShare: http://www.slideshare.net/sacobat/tcvn-5712-1993-cng-ngh-thng-tin-b-m-chun-8bit-k-t-vit-dng-trong-trao-i-thng-tin As far as I can understand, they're "subsets" of each other only in the sense that VN1 has the widest

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2016-10-21 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: Since no Unicode mapping table is found at the Unicode website, we need at least the link to public official document that specifies the encoding. If VN3 is a subset of VN2, which itself is a subset of VN1, VN1 definitely looks the most preferable choice

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2016-10-13 Thread orban
orban added the comment: Here this is a patch to added vietnamese codec tcvn. I am not sure about the name of the codecs...tcvn5712, tcvn5712_3 ? test_xml_etree, test_codesc, test_unicode is running. Is it enough for the doc? -- keywords: +patch nosy: +matorban Added file:

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-29 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: Jean Christophe: Please have a look at the patch for ticket http://bugs.python.org/issue22681 as example of the doc patch. Thanks. -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-29 Thread Serhiy Storchaka
Serhiy Storchaka added the comment: Or issue22682. Needed: * The codec itself (in Lib/encodings/ directory). * Entries in aliases table (Lib/encodings/aliases.py). * A row in encodings table (Doc/library/codecs.rst). * An entry in What's New (Doc/whatsnew/3.5.rst). * May be addition in

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-28 Thread Jakub Wilk
Changes by Jakub Wilk jw...@jwilk.net: -- nosy: +jwilk ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081 ___ ___ Python-bugs-list mailing list

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-28 Thread Jean Christophe André
Changes by Jean Christophe André pyt...@andrele.org: Added file: http://bugs.python.org/file37054/TCVN5712-1.TXT ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081 ___

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-28 Thread Jean Christophe André
Changes by Jean Christophe André pyt...@andrele.org: Added file: http://bugs.python.org/file37055/TCVN5712-2.TXT ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081 ___

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-28 Thread Jean Christophe André
Changes by Jean Christophe André pyt...@andrele.org: Added file: http://bugs.python.org/file37056/TCVN5712-3.TXT ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081 ___

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-28 Thread Jean Christophe André
Jean Christophe André added the comment: I failed to find anything about TCVN 5712:1999 except the official announcement of it superseding TCVN 5712:1993 on TCVN's website. I also was not able to find any material using TCVN 5712:1999. My guess is that TCVN 6909:2001 having been released only

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-28 Thread Jean Christophe André
Jean Christophe André added the comment: Marc-Andre, about “Please also provide a patch for the documentation”, could you please guide me on this? I can write some documentation, but I simply don't know in what form you expect it. Could you point me to some examples please? --

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-28 Thread Jean Christophe André
Changes by Jean Christophe André pyt...@andrele.org: Removed file: http://bugs.python.org/file34644/vntime_tcvn.py ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081 ___

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-20 Thread Jean Christophe André
Jean Christophe André added the comment: A note to inform about my progress. (I had a long period without free time at hand) While seeking (again) official documents on the topic, I mainly found a lot of non-official ones, but some are notorious enough to use them as references. I am now in

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-10-05 Thread Serhiy Storchaka
Changes by Serhiy Storchaka storch...@gmail.com: -- nosy: +serhiy.storchaka stage: - needs patch ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081 ___

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-03-28 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: Some comments: * Please provide some background information how widely the encoding is used. I get less than 1000 hits in Google when looking for TCVN 5712:1993. Now, the encoding was a standard in Vietnam, but it has been updated in 1999 to TCVN

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-03-28 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: Retargeting to 3.5, since all other releases don't allow addition of new features. -- versions: +Python 3.5 -Python 2.7, Python 3.2 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue21081

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-03-28 Thread Jean Christophe André
Jean Christophe André added the comment: * Please provide some background information how widely the encoding is used. I get less than 1000 hits in Google when looking for TCVN 5712:1993. Here is the background for the need for this encoding. The recent laws[0] in Vietnam have set TCVN

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-03-28 Thread Marc-Andre Lemburg
Marc-Andre Lemburg added the comment: Thanks for your answers. I think the best way forward would be to some up with an official encoding map of the TCVN 5712:1999 encoding, translate that into a format that gencodec.py can use and then add the generated codec to Python 3.5. We can then add the

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-03-28 Thread Jean Christophe André
Jean Christophe André added the comment: I will prepare the official encoding map(s) based on the standard(s). I'll also have to check which encoding correspond to my current encoding map, since this is the one useful in real life. Please also provide a patch for the documentation I

[issue21081] missing vietnamese codec TCVN 5712:1993 in Python

2014-03-27 Thread Jean Christophe André
New submission from Jean Christophe André: In Python version 2.x and at least 3.2 there no Vietnamese encoding support for TCVN 5712:1993. This encoding is currently largely used in Vietnam and I think it would be usefull to add it to the python core encodings. I already wrote some codec