[issue1046] HTMLCalendar.formatyearpage not behaving as documented

2007-08-28 Thread Walter Dörwald
Walter Dörwald added the comment: Fixed in r57620 -- nosy: +doerwalter resolution: -> fixed status: open -> closed __ Tracker <[EMAIL PROTECTED]> <http://bugs.pytho

[issue1125] bytes.split shold have same interface as str.split, or different name

2007-09-07 Thread Walter Dörwald
Walter Dörwald added the comment: Because it's not clear whether b'\xa0' *is* whitespace or not. Bytes have no meaning, characters do. -- nosy: +doerwalter __ Tracker <[EMAIL PROTECTED]> <http://b

[issue1399] XML codec

2007-11-07 Thread Walter Dörwald
Walter Dörwald added the comment: "xml-auto-detect" sounds OK to me, it even makes sense for the encoder, because it normally detects the encoding to use for writing from the XML declaration. We could put "xml-auto-detect" into the alias mapping and keep xml as the module na

[issue1399] XML codec

2007-11-08 Thread Walter Dörwald
Walter Dörwald added the comment: OK, I've changed the name of the codec to xml_auto_detect and added support for EBCDIC. Added file: http://bugs.python.org/file8717/diff2.txt __ Tracker <[EMAIL PROTECTED]> <http://bugs.python

[issue1427] Error in standard module calendar

2007-11-12 Thread Walter Dörwald
Walter Dörwald added the comment: Fixed in r58942 (trunk) and r58943 (2.5). Closing the issue. -- resolution: -> fixed status: open -> closed __ Tracker <[EMAIL PROTECTED]> <http://bugs.pytho

[issue1328] feature request: force BOM option

2007-11-15 Thread Walter Dörwald
Walter Dörwald added the comment: jgsack wrote: > > If codec utf_8 or utf_8_sig were to accept input with or without the > 3-byte BOM, and write it as currently specified without/with the BOM > respectively, then _I_ can reread again with either utf_8 or utf_8_sig. That's

[issue1328] feature request: force BOM option

2007-11-15 Thread Walter Dörwald
Walter Dörwald added the comment: > For utf16, (arguably) a missing BOM should merely assume machian endianess. > For utf_16_le, utf_16_be input, both should accept & discard a BOM. > On output, I'm not sure; maybe all should write a BOM unless passed a flag > signifying no

[issue1444] utf_8_sig streamreader bug, patch, and test

2007-11-19 Thread Walter Dörwald
Walter Dörwald added the comment: Checked in your change and the test as r59049 (trunk) and r59050 (2.5). Thanks for the patch. -- resolution: -> fixed status: open -> closed __ Tracker <[EMAIL PROTECTED]> <http://bugs.pytho

[issue1521] string.decode() fails on long strings

2007-11-29 Thread Walter Dörwald
Walter Dörwald added the comment: Can you attach a (small) example that demonstrates the bug? -- nosy: +doerwalter __ Tracker <[EMAIL PROTECTED]> <http://bugs.python.org/

[issue12171] Reset method of the incremental encoders of CJK codecs calls the decoder reset function

2011-05-26 Thread Walter Dörwald
Walter Dörwald added the comment: +1 on the documentation changes. -- nosy: +doerwalter ___ Python tracker <http://bugs.python.org/issue12171> ___ ___ Python-bug

[issue10087] HTML calendar is broken

2010-10-13 Thread Walter Dörwald
Walter Dörwald added the comment: Does the following patch fix your problems? -- keywords: +patch nosy: +doerwalter Added file: http://bugs.python.org/file19217/calendar.diff ___ Python tracker <http://bugs.python.org/issue10

[issue10038] json.loads() on str should return unicode, not str

2010-11-02 Thread Walter Dörwald
Walter Dörwald added the comment: The following patch (against the release27-maint branch) seems to fix the problem. -- keywords: +patch nosy: +doerwalter Added file: http://bugs.python.org/file19468/json.diff ___ Python tracker <h

[issue10329] trace.py and unicode in Python 3

2010-11-05 Thread Walter Dörwald
New submission from Walter Dörwald : It seems that on Python 3 (i.e. the py3k branch) trace.py can not handle source that includes Unicode characters. Running the test suite with code coverage info via ./python Lib/test/regrtest.py -T -N -uurlfetch,largefile,network,decimal sometimes

[issue10329] trace.py and unicode in Python 3

2010-11-08 Thread Walter Dörwald
Walter Dörwald added the comment: Using the original encoding of the Python source file might be the politically correct thing to do, but it complicates handling of the output of trace.py. For each file you have to do the encoding detection dance again. It would be great if I could specify

[issue10329] trace.py and unicode in Python 3

2010-11-09 Thread Walter Dörwald
Walter Dörwald added the comment: > STINNER Victor added the comment: > >> ... it complicates handling of the output of trace.py. >> For each file you have to do the encoding detection dance again ... > > What? You just have to call one function! tokenize.open() :

[issue10541] regrtest.py -T broken

2010-11-26 Thread Walter Dörwald
New submission from Walter Dörwald : Running regrtest.py with coverage option seems to be broken for the py3k branch at the moment. Run the following commands on the shell: wget http://svn.python.org/snapshots/python3k.tar.bz2 tar xjf python3k.tar.bz2 cd python ./configure --enable-unicode

[issue10541] regrtest.py -T broken

2010-11-29 Thread Walter Dörwald
Walter Dörwald added the comment: OK, I reran the test with:: ./python -mtest.regrtest -T -N test_urllib and this does indeed produce coverage files (for _abcoll, _weakrefset, abc, base64, codecs, collections, contextlib, functools, genericpath, hashlib, locale, mimetypes, os, posixpath

[issue1706460] access to unicodedata (via codepoints or 2-char surrogates)

2008-06-02 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: Fixed for 2.6 in r63899. -- nosy: +doerwalter resolution: -> fixed status: open -> closed ___ Python tracker <[EMAIL PROTECTED]> <http://bugs.py

[issue1706460] access to unicodedata (via codepoints or 2-char surrogates)

2008-06-03 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: Fixed for 3.0 in r63918 ___ Python tracker <[EMAIL PROTECTED]> <http://bugs.python.org/issue1706460> ___ __

[issue701743] Reloading pseudo modules

2008-06-23 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: AFAIK reload() is gone in 3.0 anyway, so I don't think this patch is relevant any longer. ___ Python tracker <[EMAIL PROTECTED]> <http://bugs.py

[issue3739] unicode-internal encoder reports wrong length

2008-08-30 Thread Walter Dörwald
New submission from Walter Dörwald <[EMAIL PROTECTED]>: The encoder for the "unicode-internal" codec reports the wrong length: Python 3.0b3+ (py3k, Aug 30 2008, 11:55:21) [GCC 4.0.1 (Apple Inc. build 5484)] on darwin Type "help", "copyright", "c

[issue7309] crasher in str(Exception())

2010-02-24 Thread Walter Dörwald
Walter Dörwald added the comment: On 24.02.10 15:28, Eric Smith wrote: > Eric Smith added the comment: > > Fixed: > > trunk: r78418 > release26-maint: r78419 > > Still working on porting to py3k and release31-maint. A much better solution would IMHO be to for

[issue8014] Setting a T_INT attribute raises internal error

2010-02-24 Thread Walter Dörwald
New submission from Walter Dörwald : In the current py3k branch setting an attribute of an object with PyMemberDefs raises an internal error: $ ./python.exe Python 3.2a0 (py3k:78419M, Feb 24 2010, 17:56:06) [GCC 4.2.1 (Apple Inc. build 5646)] on darwin Type "help", "copyright

[issue8092] utf8, backslashreplace and surrogates

2010-03-09 Thread Walter Dörwald
Walter Dörwald added the comment: After the patch the comment: /* Implementation limitations: only support error handler that return bytes, and only support up to four replacement bytes. */ no longer applies. Also I would like to see a version of this patch where the length limitation

[issue8377] Errata on page:http://docs.python.org/library/stdtypes.html

2010-04-12 Thread Walter Dörwald
Walter Dörwald added the comment: This is a common thinko. ;) If i is negative then len(s) - i would be greater that len(s). However len(s) + i is correct. Example: foo[-1] is foo[len(foo) + (-1)] is foo[len(foo)-1] -- nosy: +doerwalter resolution: -> invalid status: o

[issue7651] Python3: guess text file charset using the BOM

2010-04-13 Thread Walter Dörwald
Walter Dörwald added the comment: Yes, that's the posting I was referring to. I wonder why the link is gone. -- ___ Python tracker <http://bugs.python.org/i

[issue2017] Calendar.yeardatescalendar etc. do not take 'month' argument

2008-02-06 Thread Walter Dörwald
Walter Dörwald added the comment: Fixed in r60618 (trunk) and r60619 (release25-maint) -- nosy: +doerwalter resolution: -> fixed status: open -> closed __ Tracker <[EMAIL PROTECTED]> <http://bugs.pytho

[issue2018] TextCalendar.formatmonth is not influenced by setfirstweekday

2008-02-06 Thread Walter Dörwald
Walter Dörwald added the comment: setfirstweekday() isn't supposed to have any influence on calendar objects created explicitely. The function setfirstweekday() is only for the module level interface. The documentation is wrong here. However you *can* change the first weekday wit

[issue2018] TextCalendar.formatmonth is not influenced by setfirstweekday

2008-02-07 Thread Walter Dörwald
Walter Dörwald added the comment: You're supposed to use firstweekday as a property instead of using the getter method getfirstweekday(). Anyway this is fixed now in r60651 (trunk) and r60652 (release25-maint) -- resolution: accepted -> fixed status: open -

[issue2018] TextCalendar.formatmonth is not influenced by setfirstweekday

2008-02-07 Thread Walter Dörwald
Walter Dörwald added the comment: The doccumentation is here:http://docs.python.org/dev/library/calendar.html#calendar.TextCalendar.formatmonth (or in Doc/library/calendar.rst in the source). Anyway the first of those documentation bugs is fixed now in r60649 (trunk) and r60650 (release25-maint

[issue1399] XML codec

2008-03-17 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: There was resistance in python-dev against this patch (see the thread at http://mail.python.org/pipermail/python-dev/2007-November/075138.html), so this issue should probably closed as rejected. However there was consensus,

[issue1328] Force BOM option in UTF output.

2008-03-20 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: I don't see exactly what James is proposing. > For my needs, I would like the decoding parts of the utf_8 module > to treat an initial BOM as an optional signature and skip it if > there is one (just like the utf_8_sig

[issue1477] UnicodeDecodeError that cannot be caught in narrow unicode builds

2008-03-20 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: For a wide build, the code if (x <= 0x) *p++ = (Py_UNICODE) x; else { *p++ = (Py_UNIC0DE) x; looks strange. Furthermore with the patch applied Python no longer complains about ill

[issue1477] UnicodeDecodeError that cannot be caught in narrow unicode builds

2008-03-22 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: The patch looks goog to me now. Go ahead and check it in. -- assignee: doerwalter -> amaury.forgeotdarc __ Tracker <[EMAIL PROTECTED]> <http://bugs.py

[issue1328] Force BOM option in UTF output.

2008-03-22 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: If you want to use UTF-8-sig for decoding and UTF-8 for encoding and have this available as one codec you can define your owen codec for this: import codecs def search_function(name): if name == "myutf8": utf8

[issue1328] Force BOM option in UTF output.

2008-03-22 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: Oops, that code was supposed to read: import codecs def search_function(name): if name == "myutf8": utf8 = codecs.lookup("utf-8") utf8_sig = codecs.lookup("utf-8-sig") retur

[issue4178] codecs: Documentation Inconsistency

2008-10-23 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: I agree that the documentation should be fixed to read "encode/decode" instead of "encoder/decoder". ___ Python tracker <[EMAIL PROTECTED]> &

[issue4178] codecs: Documentation Inconsistency

2008-10-23 Thread Walter Dörwald
Walter Dörwald <[EMAIL PROTECTED]> added the comment: Fixed in r67005 (trunk) and r67006 (pk3k). -- resolution: -> fixed status: open -> closed ___ Python tracker <[EMAIL PROTECTED]> <http://bugs

[issue5135] Expose simplegeneric function in functools module

2009-02-04 Thread Walter Dörwald
Walter Dörwald added the comment: The patch looks fine to me. Tests pass. I have no opinion about the name. Both "simplegeneric" and "generic" are OK to me. I wonder if being able to use register() directly instead of as a decorator should be dropped. Also IMHO the

[issue1076233] distutils.core.setup() with unicode arguments broken

2009-02-11 Thread Walter Dörwald
Walter Dörwald added the comment: It does indeed work with Python 2.6 (however not with 2.5). Closing. -- resolution: -> out of date status: open -> closed ___ Python tracker <http://bugs.python.org/iss

[issue5094] datetime lacks concrete tzinfo impl. for UTC

2009-02-11 Thread Walter Dörwald
Walter Dörwald added the comment: The patch doesn't include any changes to the documentation. -- nosy: +doerwalter ___ Python tracker <http://bugs.python.org/i

[issue38482] BUG in codecs.BufferedIncrementalDecoder

2019-10-15 Thread Walter Dörwald
Walter Dörwald added the comment: The documentation might be unclear here. But the argument iterator of iterdecode(iterator, encoding, errors='strict', **kwargs) *is* supposed to be an iterable over bytes objects. In fact iterencode() transforms an iterator over strings into a

[issue38482] BUG in codecs.BufferedIncrementalDecoder

2019-10-17 Thread Walter Dörwald
Walter Dörwald added the comment: codecs.iterencode()/iterdecode() are just shallow 10-line wrappers around incremental codecs (which are used as the basis of io streams). Note that the doc string for iterencode() contains: Encodes the input strings from the iterator using an

[issue36819] Crash during encoding using UTF-16/32 and custom error handler

2021-09-29 Thread Walter Dörwald
Walter Dörwald added the comment: The original specification (PEP 293) required that an error handler called for encoding *must* return a replacement string (not bytes). This returned string must then be encoded again. Only if this fails an exception must be raised. Returning bytes from the

[issue39939] Add str methods to remove prefixes or suffixes

2020-03-20 Thread Walter Dörwald
Walter Dörwald added the comment: IMHO the names don't fit Pythons current naming scheme, so what about naming them "lchop" and "rchop"? -- nosy: +doerwalter ___ Python tracker <https:

[issue38352] In typing docs, note explicit import needed for IO and Pattern/Match

2020-12-04 Thread Walter Dörwald
Walter Dörwald added the comment: Shadowing the real modules `re` and `io` by from typing import * would indeed be bad, but that argument IMHO doesn't hold for the types `IO`, `TextIO` and `BinaryIO`, yet they are not listed in `typing.__all__`. Is there a reason for that? And i

[issue42930] xml.parsers.expat results differ buffer_text and / or buffer_size

2021-01-15 Thread Walter Dörwald
Walter Dörwald added the comment: Just a guess, but the buffer size might be so small that the text that you expect gets passed via **two** calls to _char_data(). You should refactor your code the simply collect all the text in _char_data() and act on it in the _end_element() handler. So

[issue35078] Allow customization of CSS class name of a month in calendar module

2020-06-02 Thread Walter Dörwald
Walter Dörwald added the comment: New changeset 85339f5c220a5e79c47c3a33c93f1dca5c59c52e by Srinivas Reddy Thatiparthy (శ్రీనివాస్ రెడ్డి తాటిపర్తి) in branch 'master': bpo-35078: Allow customization of CSS class name of a month in calendar module (gh-10137) https://github.

[issue35078] Allow customization of CSS class name of a month in calendar module

2020-06-02 Thread Walter Dörwald
Change by Walter Dörwald : -- resolution: -> fixed stage: patch review -> resolved status: open -> closed ___ Python tracker <https://bugs.python.or

[issue41115] Codecs should raise precise UnicodeDecodeError or UnicodeEncodeError

2020-06-25 Thread Walter Dörwald
Walter Dörwald added the comment: UnicodeEncodeError and UnicodeDecodeError are used to report un(en|de)codedable ranges in the source object, so it wouldn't make sense to use them for errors that have nothing to do with problems in the source object. Their constructor requires 5 argu

[issue41465] io.TextIOWrapper.errors not writable

2020-08-03 Thread Walter Dörwald
New submission from Walter Dörwald : PEP 293 states the following: """ For stream readers/writers the errors attribute must be changeable to be able to switch between different error handling methods during the lifetime of the stream reader/writer. This is current

[issue41465] io.TextIOWrapper.errors not writable

2020-08-06 Thread Walter Dörwald
Walter Dörwald added the comment: I guess that is good enough. "Being changeable" does not necessarily mean mean "being changeable via attribute assignment". Thanks for your research. Closing the issue as "not a bug". -- resolution: -> not a bug

[issue13830] codecs error handler is called with a UnicodeDecodeError with the same args

2012-02-03 Thread Walter Dörwald
Walter Dörwald added the comment: See this ancient posting about this problem: http://mail.python.org/pipermail/python-dev/2002-August/027661.html (see point 4.). So I guess somebody did finally complain! ;) The error attributes are documented in PEP 293. The existence of the attributes

[issue34443] enum repr should use __qualname__

2019-07-16 Thread Walter Dörwald
Walter Dörwald added the comment: Can we at least get the __qualname__ in exception messages? Currently enum.Enum.__new__() and enum.Enum._missing_() use: raise ValueError("%r is not a valid %s" % (value, cls.__name__)) IMHO this should be: raise ValueError("%r is

[issue34443] enum repr should use __qualname__

2019-07-17 Thread Walter Dörwald
Change by Walter Dörwald : -- pull_requests: +14603 stage: needs patch -> patch review pull_request: https://github.com/python/cpython/pull/14809 ___ Python tracker <https://bugs.python.org/issu

[issue30733] Typo in Document What's New: Calendar

2017-06-26 Thread Walter Dörwald
Changes by Walter Dörwald : -- pull_requests: +2463 ___ Python tracker <http://bugs.python.org/issue30733> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue30733] Typo in Document What's New: Calendar

2017-06-26 Thread Walter Dörwald
Walter Dörwald added the comment: New changeset f5c58c781aa0bb296885baf62f4f39100f2cd93d by Walter Dörwald in branch 'master': bpo-30733: Fix typos in "What's New" entry (GH-2414) https://github.com/python/cpython/commit/f5c58c781aa0bb296885baf62f4f39100f2cd93d ---

[issue30733] Typo in Document What's New: Calendar

2017-06-26 Thread Walter Dörwald
Walter Dörwald added the comment: Should be fixed now. Thanks for noticing it. -- resolution: -> fixed ___ Python tracker <http://bugs.python.org/issu

[issue30733] Typo in Document What's New: Calendar

2017-06-26 Thread Walter Dörwald
Changes by Walter Dörwald : -- stage: -> resolved status: open -> closed ___ Python tracker <http://bugs.python.org/issue30733> ___ ___ Python-bugs-list

[issue2661] Mapping tests cannot be passed by user implementations

2018-12-14 Thread Walter Dörwald
Change by Walter Dörwald : -- pull_requests: +10390 stage: needs patch -> patch review ___ Python tracker <https://bugs.python.org/issue2661> ___ ___ Python-

[issue2661] Mapping tests cannot be passed by user implementations

2018-12-14 Thread Walter Dörwald
Walter Dörwald added the comment: OK, I've created the pull request (11157). -- ___ Python tracker <https://bugs.python.org/issue2661> ___ ___ Python-bugs-l

[issue18059] Add multibyte encoding support to pyexpat

2017-03-27 Thread Walter Dörwald
Walter Dörwald added the comment: This looks to me like a limited reimplementation of the codec machinery. Why not use incremental codecs as a preprocessor? Would this be to slow? -- ___ Python tracker <http://bugs.python.org/issue18

[issue30095] HTMLCalendar allow custom classes

2017-04-19 Thread Walter Dörwald
Walter Dörwald added the comment: IMHO this could all be done by overwriting the relevant methods. But this might be overkill. I think a solution might be to move the CSS classes into class attributes of HTMLCalendar. Customizing the CSS classes would then be done by subclassing HTMLCalendar

[issue30095] HTMLCalendar allow custom classes

2017-04-19 Thread Walter Dörwald
Walter Dörwald added the comment: OK, go ahead. I'm looking forward to what you come up with. -- ___ Python tracker <http://bugs.python.org/issue30095> ___ ___

[issue30095] HTMLCalendar allow custom classes

2017-04-24 Thread Walter Dörwald
Walter Dörwald added the comment: The second link is a 404. For the v1 patch: The variable names are a bit inconsistent: The first uses "classes" all others use "styles". This should be consistent within itself and with the existing code, i.e. "classes" sh

[issue30095] HTMLCalendar allow custom classes

2017-05-05 Thread Walter Dörwald
Walter Dörwald added the comment: See comments on Github -- ___ Python tracker <http://bugs.python.org/issue30095> ___ ___ Python-bugs-list mailing list Unsub

[issue30095] HTMLCalendar allow custom classes

2017-05-31 Thread Walter Dörwald
Walter Dörwald added the comment: See my comments on the pull request: https://github.com/python/cpython/pull/1439 After you address those, IMHO this is ready to be merged. -- ___ Python tracker <http://bugs.python.org/issue30

[issue30095] HTMLCalendar allow custom classes

2017-06-01 Thread Walter Dörwald
Walter Dörwald added the comment: See comments on the pull request. Also it seems that currently the pull request can't be merged because of merge conflicts. -- ___ Python tracker <http://bugs.python.org/is

[issue30095] HTMLCalendar allow custom classes

2017-06-06 Thread Walter Dörwald
Walter Dörwald added the comment: New changeset 8b7a4cc40e9b2f34da94efb75b158da762624015 by Walter Dörwald (Oz N Tiram) in branch 'master': bpo-30095: Make CSS classes used by calendar.HTMLCalendar customizable (GH-1439) https://github.com/python/cpyt

[issue30095] HTMLCalendar allow custom classes

2017-06-06 Thread Walter Dörwald
Walter Dörwald added the comment: Closing the issue. The patch has been merged. -- resolution: -> fixed stage: patch review -> resolved status: open -> closed ___ Python tracker <http://bugs.python.or

[issue33850] Json.dump() bug when using generator

2018-06-13 Thread Walter Dörwald
Walter Dörwald added the comment: The problem here is that StreamArray lies about the length of the iterator. This confuses json.encoder._make_iterencode._iterencode_list(), (which is called by json.dump()), because it first does a check for "if not lst" and then assumes in the lo

[issue33967] functools.singledispatch: Misleading exception when calling without arguments

2018-06-26 Thread Walter Dörwald
New submission from Walter Dörwald : When I call a function decorated with functools.singledispatch without an argument, I get the following: $ python Python 3.6.5 (default, Jun 17 2018, 12:13:06) [GCC 4.2.1 Compatible Apple LLVM 9.1.0 (clang-902.0.39.2)] on darwin Type "help",

[issue34443] enum repr should use __qualname__

2018-08-20 Thread Walter Dörwald
New submission from Walter Dörwald : The __repr__ output of an enum class should use __qualname__ instead of __name__. The following example shows the problem: import enum class X: class I: pass class Y: class I(enum.Enum): pass print(X.I) print(Y.I) This prints: I

[issue34443] enum repr should use __qualname__

2018-08-20 Thread Walter Dörwald
Change by Walter Dörwald : -- keywords: +easy ___ Python tracker <https://bugs.python.org/issue34443> ___ ___ Python-bugs-list mailing list Unsubscribe:

[issue34935] Misleading error message in str.decode()

2018-10-08 Thread Walter Dörwald
New submission from Walter Dörwald : The following code issues a misleading exception message: >>> b'\xed\xa0\xbd\xed\xb3\x9e'.decode("utf-8") Traceback (most recent call last): File "", line 1, in UnicodeDecodeError: 'utf-8' codec can't

[issue34935] Misleading error message in str.decode()

2018-10-08 Thread Walter Dörwald
Walter Dörwald added the comment: OK, I see, http://www.unicode.org/versions/Unicode5.2.0/ch03.pdf (Table 3-7 on page 93) states that the only valid 3-bytes UTF-8 sequences starting with the byte 0xED have a value for the second byte in the range 0x80 to 0x9F. 0xA0 is just beyond that range

[issue1470548] Bugfix for #1470540 (XMLGenerator cannot output UTF-16)

2012-05-28 Thread Walter Dörwald
Walter Dörwald added the comment: An alternative would be to use an incremental encoder instead of a StreamWriter. (Which is what TextIOWrapper does internally). -- nosy: +doerwalter ___ Python tracker <http://bugs.python.org/issue1470

[issue15408] os.fork/os.popen behaviour change between 2.7 and 3.2

2012-07-20 Thread Walter Dörwald
New submission from Walter Dörwald : The attached script behaves differently on Python 2.7.2 and Python 3.2.3. With Python 2.7 the script runs for ca. 30 seconds and then I get back my prompt. With Python 3.2 the script runs in the background, I get back my prompt immediately and can type

[issue15408] os.fork/os.popen behaviour change between 2.7 and 3.2

2012-07-23 Thread Walter Dörwald
Walter Dörwald added the comment: So is this simply a documentation issue, or can we close the bug as "won't fix"? -- ___ Python tracker <http://bugs.pyt

[issue15278] UnicodeDecodeError when readline in codecs.py

2012-10-10 Thread Walter Dörwald
Walter Dörwald added the comment: > >>> codecs.utf_8_decode('\u20ac'.encode('utf8')[:2]) > ('', 0) > > Oh... codecs.CODEC_decode are incremental decoders? I misunderstood completly > this. No, those function are not decoders, the

[issue21400] Code coverage documentation is out-of-date.

2014-04-30 Thread Walter Dörwald
Walter Dörwald added the comment: The cronjob that produces this information has been deactivated, because it currently produces broken output. The code for that job is available from here: https://pypi.python.org/pypi/pycoco It would be great to have up to date coverage info for Python again

[issue16577] Suspect test.test_codeccallbacks.test_mutatingdecodehandler

2012-11-29 Thread Walter Dörwald
Walter Dörwald added the comment: True, the second test uses the wrong error handler. And yes, you're correct, bytes are now immutable. And even if I try to decode a bytearray, what the callback gets to see is still an immutable bytes object:: import codecs def mutating(exc):

[issue16585] surrogateescape broken w/ multibytecodecs' encode

2012-12-02 Thread Walter Dörwald
Walter Dörwald added the comment: And returning bytes is documented in PEP 383, as an extension to the PEP 293 machinery: """To convert non-decodable bytes, a new error handler ([2]) "surrogateescape" is introduced, which produces these surrogates. On encoding, the er

[issue16613] ChainMap.new_child could use improvement

2013-01-10 Thread Walter Dörwald
Walter Dörwald added the comment: I'd like to have this feature too. However the code should use d if d is not None else {} instead of d or {} For example I might want to use a subclass of dict (lowerdict) that converts all keys to lowercase. When I use an empty lowerdict in new_

[issue19585] Frame annotation

2013-11-14 Thread Walter Dörwald
New submission from Walter Dörwald: This patch adds frame annotations, i.e. it adds an attribute f_annotation to frame objects, a decorator to set this attribute on exceptions and extensions to the traceback machinery that display the annotation in the traceback. -- components

[issue19585] Frame annotation

2013-11-14 Thread Walter Dörwald
Walter Dörwald added the comment: See http://bugs.python.org/issue18861 and the discussion started here: https://mail.python.org/pipermail/python-dev/2013-November/130155.html. Basically it allows to add context information to a traceback without changing the type of the exception. In the

[issue19585] Frame annotation

2013-11-25 Thread Walter Dörwald
Walter Dörwald added the comment: Here is a new version of the patch. The annotation is done on the code object instead of on the frame object. This avoids two problems: There is no runtime overhead, as the decorator returns the original function and no additional frames show up in the

[issue19585] Frame annotation

2013-11-25 Thread Walter Dörwald
Walter Dörwald added the comment: Do you have an example where code objects are shared? We could attach the annotation formatter to the function object, but unfortunately the function object is now accessible in the traceback. Note the co_annotation is not the annotation string, rather it is

[issue19834] Unpickling exceptions pickled by Python 2

2013-11-29 Thread Walter Dörwald
New submission from Walter Dörwald: Exception objects that have been pickled with Python 2 can not be unpickled with Python 3, even when fix_imports=True is specified: $ python2.7 Python 2.7.2 (default, Aug 30 2011, 11:04:13) [GCC 3.3.5 (Debian 1:3.3.5-13)] on linux2 Type "

[issue19834] Unpickling exceptions pickled by Python 2

2013-11-30 Thread Walter Dörwald
Walter Dörwald added the comment: OK, here is a patch. Instead of mapping the exceptions module to builtins, it does the mapping for each exception class separately. I've excluded StandardError, because I think there's no appropriate equivalent in Python 3. -- keywords: +p

[issue19834] Unpickling exceptions pickled by Python 2

2013-12-01 Thread Walter Dörwald
Walter Dörwald added the comment: Here's an updated version of the patch, addressing most of Alexandre's comments. -- Added file: http://bugs.python.org/file32918/python-2-exception-pickling-2.diff ___ Python tracker <http://bu

[issue19834] Unpickling exceptions pickled by Python 2

2013-12-02 Thread Walter Dörwald
Changes by Walter Dörwald : -- resolution: -> fixed ___ Python tracker <http://bugs.python.org/issue19834> ___ ___ Python-bugs-list mailing list Unsubscrib

[issue12808] Coverage of codecs.py

2014-06-28 Thread Walter Dörwald
Walter Dörwald added the comment: The requirement that getstate() returns a (buffer, int) tuple has to do with the fact that for text streams seek() and tell() somehow have to take the state of the codec into account. See _pyio.TextIOWrapper.(seek|tell|_pack_cookie|_unpack_cookie). However I

[issue2661] Mapping tests cannot be passed by user implementations

2014-06-29 Thread Walter Dörwald
Walter Dörwald added the comment: Here is a patch that implements suggestion 2 and 3. -- keywords: +patch Added file: http://bugs.python.org/file35800/mapping-tests.diff ___ Python tracker <http://bugs.python.org/issue2

[issue19806] smtpd crashes when a multi-byte UTF-8 sequence is split between consecutive data packets

2014-07-11 Thread Walter Dörwald
Walter Dörwald added the comment: I don't know anything about SMTP, but would it make sense to use an incremental decoder for decoding UTF-8? -- nosy: +doerwalter ___ Python tracker <http://bugs.python.org/is

[issue21968] 'abort' object is not callable

2014-07-15 Thread Walter Dörwald
Walter Dörwald added the comment: The problem seems to be in that line: except imaplib.IMAP4_SSL.abort, imaplib.IMAP4.abort: This does *not* catch both exception classes, but catches only IMAP4_SSL.abort and stores the exception object in imaplib.IMAP4.abort. What you want is: except

[issue19100] Use backslashreplace in pprint

2013-12-11 Thread Walter Dörwald
Walter Dörwald added the comment: This is not the fault of pprint. IMHO it doesn't make sense to fix anything here, at least not for pprint specifically. print() has the same "problem": $ LANG= ./python -c

[issue19100] Use backslashreplace in pprint

2013-12-14 Thread Walter Dörwald
Walter Dörwald added the comment: sys.displayhook doesn't fail, because it uses the backslashreplace error handler, and for sys.displayhook that's OK, because it's only used for screen output and there some output is better than no output. However print and pprint.pprint mi

[issue20132] Many incremental codecs don’t handle fragmented data

2014-01-10 Thread Walter Dörwald
Walter Dörwald added the comment: The best solution IMHO would be to implement real incremental codecs for all of those. Maybe iterencode() with an empty iterator should never call encode()? (But IMHO it would be better to document that iterencode()/iterdecode() should only be used with

[issue13881] Stream encoder for zlib_codec doesn't use the incremental encoder

2014-01-10 Thread Walter Dörwald
Walter Dörwald added the comment: The stream part of the codecs isn't used that much in Python 3 any more, so I'm not sure if this is worth fixing. -- ___ Python tracker <http://bugs.python.o

  1   2   >