[issue9985] difflib.SequenceMatcher has slightly buggy and undocumented caching behavior

2010-10-01 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: Here's a test case and a fix for get_matching_blocks() to return the same content on subsequent calls. -- keywords: +patch nosy: +christoph Added file: http://bugs.python.org/file19084/get_matching_blocks.diff

[issue9985] difflib.SequenceMatcher has slightly buggy and undocumented caching behavior

2010-10-01 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: BTW, here's the commit that broke the behavior in the first place: http://svn.python.org/view/python/trunk/Lib/difflib.py?r1=54230r2=59907 -- ___ Python tracker rep...@bugs.python.org http

[issue6412] Titlecase as defined in Unicode Case Mappings not followed

2010-08-04 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: @Terry How is the behavior changed? To me it seems the same to as initially reported. The results are consistent but nonetheless wrong. It's not about whether your agree with the result, but rather about following the Unicode standard

[issue1602] windows console doesn't print utf8 (Py30a2)

2010-06-19 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: Will this bug be tackled or Python2.7? And is there a way to get hold of the access denied error? Here are my steps to reproduce: I started the console with cmd /u /k chcp 65001

[issue8192] SQLite3 PRAGMA table_info doesn't respect database on Win32

2010-03-21 Thread Christoph Burgmer
New submission from Christoph Burgmer cburg...@ira.uka.de: 'PRAGMA database.table_info(SOME_TABLE_NAME)' will report table metadata for the given database. The main database called 'main', can be extended by attaching further databases via 'ATTACH DATABASE'. The above PRAGMA should respect

[issue6412] Titlecase as defined in Unicode Case Mappings not followed

2009-09-14 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: Implementing full patch solving it the old way (UTR#21). The correct way for the latest Unicode version would be to implement the word breaking algorithm described in (UAX#29) [1] first. [1] http://www.unicode.org/reports/tr29

[issue6412] Titlecase as defined in Unicode Case Mappings not followed

2009-09-14 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: I should add that I didn't include the two header files generated by Tools/unicode/makeunicodedata.py -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org/issue6412

[issue6656] locale.format_string fails on escaped percentage

2009-08-06 Thread Christoph Burgmer
New submission from Christoph Burgmer cburg...@ira.uka.de: locale.format_string doesn't return same result as a normal string % format directive, but raises a TypeError. See attached test case for Python 2.6. locale.format_string('%f%%', 1.0) Traceback (most recent call last): File stdin

[issue6656] locale.format_string fails on escaped percentage

2009-08-06 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: This patch removes '%%' entities from the regex results and only replaces other matches with '%s' which later then get replaced by localized versions so that escaped percentage entities don't show up in localized parsing anymore. Removing

[issue6625] UnicodeEncodeError on pydoc's CLI

2009-08-05 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: Here is a diff for test/test_pydoc.py (against Python2.6) which though doesn't trigger due to how Python handles output encoding. This test here will pass, but pydoc will still fail: $ pydoc test/pydoc_mod.py /dev/null Traceback (most

[issue6625] UnicodeEncodeError on pydoc's CLI

2009-08-02 Thread Christoph Burgmer
New submission from Christoph Burgmer cburg...@ira.uka.de: pydoc fails with a UnicodeEncodeError for properly specified Unicode docstrings (u...) on the command line interface. See attached patch that encodes the output with the system's encoding. -- components: Extension Modules files

[issue6412] Titlecase as defined in Unicode Case Mappings not followed

2009-07-16 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: Casing algorithms should follow Section 3.13 Default Case Algorithms in the standard itself, not UTR#21. See http://www.unicode.org/Public/5.2.0/ucd/DerivedCoreProperties-5.2.0d11. Unicode 5.2. A nice mail on the Unicode mail list has

[issue6412] Titlecase as defined in Unicode Case Mappings not followed

2009-07-03 Thread Christoph Burgmer
New submission from Christoph Burgmer cburg...@ira.uka.de: Titlecase, i.e. istitle() and title(), is buggy when the string includes combining diacritical marks. u'H\u0301ngh'.istitle() False u'H\u0301ngh'.title() u'H\u0301Ngh' The string given already is in titlecase so that the following

[issue6412] Titlecase as defined in Unicode Case Mappings not followed

2009-07-03 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: Adding a incomplete patch in need of a function Py_UNICODE_ISCASEIGNORABLE defining the case-ignorable class. I don't want to touch capitalize() as I don't fully understand the semantics, where it is different to title(). It seems though

[issue1293741] doctest runner cannot handle non-ascii characters

2009-07-01 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: My last patch only changed the encoding used in DocTestRunner.run(). This new patch will apply the same to DocTestCase.runTest(). -- Added file: http://bugs.python.org/file14422/doctest.unicode.patch

[issue3955] maybe doctest doesn't understand unicode_literals?

2009-07-01 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: JFTR: To yield the results of my last comment, you need to apply the patch posted in http://bugs.python.org/issue1293741 -- ___ Python tracker rep...@bugs.python.org http://bugs.python.org

[issue3955] maybe doctest doesn't understand unicode_literals?

2009-06-30 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: This problem seems more severe as the appended test case shows. That gives me: Expected: u'ī' Got: u'\u012b' Both literals are the same. Unicode literals in doc strings are not treated as other escaped characters: repr(r'\n

[issue1293741] doctest runner cannot handle non-ascii characters

2009-06-30 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: See attached patch which works for error reporting and verbose output. -- keywords: +patch nosy: +christoph Added file: http://bugs.python.org/file14407/doctest.unicode.patch ___ Python tracker

[issue3955] maybe doctest doesn't understand unicode_literals?

2009-06-29 Thread Christoph Burgmer
Christoph Burgmer cburg...@ira.uka.de added the comment: OutputChecker.check_output() seems to be responsible for comparing 'example.want' and 'got' literals and this is obviously done literally. So as u'1' is different to '1' this is reflected in the result. This gets more complicated

[issue2517] Error when printing an exception containing a Unicode string

2008-04-02 Thread Christoph Burgmer
Christoph Burgmer [EMAIL PROTECTED] added the comment: JFTR: print unicode(e.message).encode(utf-8) only works for Python 2.5, not downwards. __ Tracker [EMAIL PROTECTED] http://bugs.python.org/issue2517

[issue2517] Error when printing an exception containing a Unicode string

2008-03-31 Thread Christoph Burgmer
Christoph Burgmer [EMAIL PROTECTED] added the comment: To be more precise: I see no way to convert the encapsulated non-ASCII data from the string in an easy way. Taking e from my last post none of the following will work: str(e) # UnicodeDecodeError e.__str__() # UnicodeDecodeError e

[issue2517] Error when printing an exception containing a Unicode string

2008-03-31 Thread Christoph Burgmer
Christoph Burgmer [EMAIL PROTECTED] added the comment: Thanks, this does work. But, where can I find the piece of information you just gave to me in the docs? I couldn't find any interface definition for Exceptions. Further more will this be regarded as a bug? From [1] I understand

[issue2517] Error when printing an exception containing a Unicode string

2008-03-31 Thread Christoph Burgmer
Christoph Burgmer [EMAIL PROTECTED] added the comment: Though I welcome the reopening of the bug for Python 3.0 I must say that plans of not fixing a core element rather surprises me. I never believed Python to be a programming language with good Unicode integration. Several points were

[issue2517] Error when printing an exception containing a Unicode string

2008-03-30 Thread Christoph Burgmer
New submission from Christoph Burgmer [EMAIL PROTECTED]: Python seems to have problems when an exception is thrown that contains non-ASCII text as a message and is converted to a string. try: ... raise Exception(u'Error when printing ü') ... except Exception, e: ... print e