[issue8024] upgrade to Unicode 5.2

2010-03-19 Thread Marc-Andre Lemburg
Marc-Andre Lemburg m...@egenix.com added the comment: Florent Xicluna wrote: Florent Xicluna florent.xicl...@gmail.com added the comment: Reverted in 3.x: it triggers some failures. Symptoms: * repr('\uaaa') gives an empty string * test_bigmem fails repr() for Unicode doesn't use

[issue8024] upgrade to Unicode 5.2

2010-03-19 Thread Florent Xicluna
Florent Xicluna florent.xicl...@gmail.com added the comment: The bug was a side-effect of the update. Code point \u is now assigned to a printable character: ;TAI VIET LETTER LOW VO;Lo;0;L;N; And test_bigmem relies on this code point being non-printable. I changed it for a

[issue8024] upgrade to Unicode 5.2

2010-03-19 Thread Amaury Forgeot d'Arc
Amaury Forgeot d'Arc amaur...@gmail.com added the comment: Looking closer at the patch, you also changed the unicodetype mappings and since this removes a lot of entries, it looks like the Unicode consortium either moved some mappings out of the UCD file into a separate file or made some

[issue8024] upgrade to Unicode 5.2

2010-03-19 Thread Marc-Andre Lemburg
Marc-Andre Lemburg m...@egenix.com added the comment: Florent Xicluna wrote: Florent Xicluna florent.xicl...@gmail.com added the comment: The bug was a side-effect of the update. Code point \u is now assigned to a printable character: ;TAI VIET LETTER LOW

[issue8024] upgrade to Unicode 5.2

2010-03-19 Thread Marc-Andre Lemburg
Marc-Andre Lemburg m...@egenix.com added the comment: Amaury Forgeot d'Arc wrote: Amaury Forgeot d'Arc amaur...@gmail.com added the comment: Looking closer at the patch, you also changed the unicodetype mappings and since this removes a lot of entries, it looks like the Unicode

[issue8024] upgrade to Unicode 5.2

2010-03-19 Thread Florent Xicluna
Florent Xicluna florent.xicl...@gmail.com added the comment: Merged with r79093 -- resolution: accepted - fixed stage: commit review - committed/rejected status: open - closed ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue8024

[issue8024] upgrade to Unicode 5.2

2010-03-18 Thread Florent Xicluna
Florent Xicluna florent.xicl...@gmail.com added the comment: Done with r79059 and r79062. -- resolution: - fixed stage: patch review - committed/rejected status: open - closed ___ Python tracker rep...@bugs.python.org

[issue8024] upgrade to Unicode 5.2

2010-03-18 Thread Florent Xicluna
Florent Xicluna florent.xicl...@gmail.com added the comment: Reverted in 3.x: it triggers some failures. Symptoms: * repr('\uaaa') gives an empty string * test_bigmem fails -- resolution: fixed - accepted stage: committed/rejected - commit review status: closed - open

[issue8024] upgrade to Unicode 5.2?

2010-03-15 Thread Florent Xicluna
Florent Xicluna florent.xicl...@gmail.com added the comment: It is just a matter of running makeunicodedata affter changing 5.1 - 5.2. It generates the 3 db files: * Modules/unicodedata_db.h * Modules/unicodename_db.h * Objects/unicodetype_db.h Then you adjust the expectedchecksum in

[issue8024] upgrade to Unicode 5.2?

2010-03-15 Thread Marc-Andre Lemburg
Marc-Andre Lemburg m...@egenix.com added the comment: Florent Xicluna wrote: Florent Xicluna florent.xicl...@gmail.com added the comment: It is just a matter of running makeunicodedata affter changing 5.1 - 5.2. It generates the 3 db files: * Modules/unicodedata_db.h *

[issue8024] upgrade to Unicode 5.2?

2010-03-15 Thread Florent Xicluna
Florent Xicluna florent.xicl...@gmail.com added the comment: So the Unicode database format itself has not changed ? No. The changes listed below have no impact afai-have-tested. - - - - - - - F. Unicode Character Database Changes The

[issue8024] upgrade to Unicode 5.2?

2010-03-15 Thread Marc-Andre Lemburg
Marc-Andre Lemburg m...@egenix.com added the comment: Florent Xicluna wrote: Florent Xicluna florent.xicl...@gmail.com added the comment: So the Unicode database format itself has not changed ? No. The changes listed below have no impact afai-have-tested. Ok, so +1 for updating to 5.2.

[issue8024] upgrade to Unicode 5.2

2010-03-15 Thread Florent Xicluna
Changes by Florent Xicluna florent.xicl...@gmail.com: -- title: upgrade to Unicode 5.2? - upgrade to Unicode 5.2 ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue8024 ___

[issue8024] upgrade to Unicode 5.2?

2010-02-26 Thread Florent Xicluna
Florent Xicluna florent.xicl...@gmail.com added the comment: Excerpt of the release note: http://www.unicode.org/versions/Unicode5.2.0/ The Unicode Standard, Version 5.2, adds 6,648 characters and significantly improves the documentation of conformance requirements for the specification of

[issue8024] upgrade to Unicode 5.2?

2010-02-26 Thread Marc-Andre Lemburg
Marc-Andre Lemburg m...@egenix.com added the comment: Have you checked how big the structural changes are between 5.2 and 5.1. If we only have to rerun the makeunicodedata.py script, then I'd be +1 on going with 5.2. Otherwise, I think it's better to wait another release before upgrading to