STINNER Victor victor.stin...@haypocalc.com added the comment:
I added a cp65001 codec to Python 3.3: see issue #13216.
--
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
STINNER Victor victor.stin...@haypocalc.com added the comment:
Different tests proved that cp65001 can *not* be set as an alias to utf-8, and
that's why I'm closing this issue.
Anyway, I don't think that cp65001 is configured by default on any Windows
setup. It is only set by the user, using
Changes by David Sankel cam...@gmail.com:
--
nosy: +David.Sankel
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
___
Python-bugs-list
Changes by Michael Foord mich...@voidspace.org.uk:
--
nosy: -michael.foord
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
___
David-Sarah Hopwood david-sa...@jacaranda.org added the comment:
This problem causes {{{os.getcwdu()}}} to fail when the console code page is
set to 65001 (always, I think):
{{{
t:\ver
Microsoft Windows [Version 6.0.6002]
t:\chcp
Active code page: 65001
t:\python -c import os; print
David-Sarah Hopwood david-sa...@jacaranda.org added the comment:
I said: There is only one correct way to encode/decode UTF-8. This is true
modulo differences in the treatment of initial byte order marks.
--
___
Python tracker rep...@bugs.python.org
David-Sarah Hopwood david-sa...@jacaranda.org added the comment:
I meant to say that the os.getcwdu() test in msg119440 was done with Windows
native Python 2.6.2.
--
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
David-Sarah Hopwood david-sa...@jacaranda.org added the comment:
Oops, false alarm. python -c import os; print repr(os.getcwdu()) works as
expected, so the exception is part of issue 1602.
(My command about there being no need to distinguish this codepage from UTF-8
stands.)
--
Changes by Terry J. Reedy tjre...@udel.edu:
--
versions: -Python 2.6, Python 2.7, Python 3.1
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
STINNER Victor victor.stin...@haypocalc.com added the comment:
Would it be possible to implement a cp65001 codec in Python using
MultiByteToWideChar() / WideCharToMultiByte() with codepage=CP_UTF8?
--
nosy: +haypo
___
Python tracker
Marc-Andre Lemburg m...@egenix.com added the comment:
I created two scripts for exporting the IronPython findings and checking them
in CPython.
These are the results:
Checking code Page 28591 against encoding 'iso-8859-1' using file
'iso-8859-1.map'
0 errors
Checking code Page 28592
Changes by Marc-Andre Lemburg m...@egenix.com:
Added file: http://bugs.python.org/file15858/export-encodings.py
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
Changes by Marc-Andre Lemburg m...@egenix.com:
Added file: http://bugs.python.org/file15859/check-encodings.py
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
Marc-Andre Lemburg m...@egenix.com added the comment:
What we could do is add new codecs based on the .NET tables for cp65000 et al.
However, before doing this, I'd like to know where these code page settings can
occur and what exact names are used for them. If they only appear in .NET and
Changes by Antoine Pitrou pit...@free.fr:
--
priority: - high
stage: - patch review
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
___
Stefan Krah stefan-use...@bytereef.org added the comment:
I wrote a small C application that converts all possible
wchar_t to multibyte strings, using code page 65001.
Usage:
cl.exe gen65001.c
python check65001.py
Except for the newline character and a sequence from
55296-57343, this code
Changes by Stefan Krah stefan-use...@bytereef.org:
Added file: http://bugs.python.org/file15662/check65001.py
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
Martin v. Löwis mar...@v.loewis.de added the comment:
This report is really about the issues reported in #1602 and #7441, i.e.
where console output fails if the terminal encoding is 65001. Rather
than adding the alias, I would prefer to find out why terminal output
fails in that code page.
Χρήστος Γεωργίου (Christos Georgiou) t...@users.sourceforge.net added the
comment:
re Martin's question, I can offer the indirect wisdom of Michael Kaplan
in this blog post:
http://blogs.msdn.com/michkap/archive/2008/03/18/8306597.aspx
where he mentions that the easiest way to output unicode
Martin v. Löwis mar...@v.loewis.de added the comment:
I also wonder whether stdin/stdout/stderr should be streams on Windows
that use WriteConsole instead of WriteFile. Then the entire issue of
console CP would go away for Unicode output.
--
___
Antoine Pitrou pit...@free.fr added the comment:
(I tried running your script under IronPython 2.6 with Mono but I got a
bunch of errors; since I don't know IronPython at all I can't really
investigate)
--
nosy: +pitrou
___
Python tracker
Marc-Andre Lemburg m...@egenix.com added the comment:
Could you provide some official reference defining the alias ?
Thanks.
--
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
Marc-Andre Lemburg m...@egenix.com added the comment:
Nevermind, I found this reference:
http://msdn.microsoft.com/en-us/library/system.text.encoding(VS.80).aspx
Looks like we could add a few more aliases for other encodings as well.
--
___
Python
Martin v. Löwis mar...@v.loewis.de added the comment:
http://msdn.microsoft.com/en-us/library/system.text.encoding(VS.80).aspx
Looks like we could add a few more aliases for other encodings as well.
I wouldn't trust this table. Microsoft is on record of implementing the
code pages with
Marc-Andre Lemburg m...@egenix.com added the comment:
Martin v. Löwis wrote:
Martin v. Löwis mar...@v.loewis.de added the comment:
http://msdn.microsoft.com/en-us/library/system.text.encoding(VS.80).aspx
Looks like we could add a few more aliases for other encodings as well.
I
Changes by Antoine Pitrou pit...@free.fr:
--
nosy: +michael.foord
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
___
Python-bugs-list
Marc-Andre Lemburg m...@egenix.com added the comment:
Here's a script for IronPython 2.6 that checks a few encoders.
Since IronPython doesn't appear to come with the full set of Python
codecs and it's also not clear whether the implemented codecs actually
match the default Python ones, I'm not
Changes by flox la...@yahoo.fr:
--
versions: +Python 2.6, Python 3.1, Python 3.2
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
___
New submission from Χρήστος Γεωργίου (Christos Georgiou)
t...@users.sourceforge.net:
Add 'cp65001' (Microsoft term for UTF-8) as an alias to 'utf_8'
--
components: Library (Lib), Unicode
files: alias_cp65001.diff
keywords: patch
messages: 88060
nosy: tzot
severity: normal
status: open
Changes by Χρήστος Γεωργίου (Christos Georgiou) t...@users.sourceforge.net:
--
components: +Windows
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
Changes by Χρήστος Γεωργίου (Christos Georgiou) t...@users.sourceforge.net:
Removed file: http://bugs.python.org/file14013/alias_cp65001.diff
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
Changes by Χρήστος Γεωργίου (Christos Georgiou) t...@users.sourceforge.net:
Added file: http://bugs.python.org/file14014/alias_cp65001.diff
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
Changes by Antoine Pitrou pit...@free.fr:
--
nosy: +lemburg, loewis
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
___
Python-bugs-list
Changes by Ezio Melotti ezio.melo...@gmail.com:
--
nosy: +ezio.melotti
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6058
___
___
Python-bugs-list
34 matches
Mail list logo