Christoph Burgmer cburg...@ira.uka.de added the comment:
Here's a test case and a fix for get_matching_blocks() to return the same
content on subsequent calls.
--
keywords: +patch
nosy: +christoph
Added file: http://bugs.python.org/file19084/get_matching_blocks.diff
Christoph Burgmer cburg...@ira.uka.de added the comment:
BTW, here's the commit that broke the behavior in the first place:
http://svn.python.org/view/python/trunk/Lib/difflib.py?r1=54230r2=59907
--
___
Python tracker rep...@bugs.python.org
http
Christoph Burgmer cburg...@ira.uka.de added the comment:
@Terry
How is the behavior changed? To me it seems the same to as initially reported.
The results are consistent but nonetheless wrong. It's not about whether your
agree with the result, but rather about following the Unicode standard
Christoph Burgmer cburg...@ira.uka.de added the comment:
Will this bug be tackled or Python2.7?
And is there a way to get hold of the access denied error?
Here are my steps to reproduce:
I started the console with cmd /u /k chcp 65001
New submission from Christoph Burgmer cburg...@ira.uka.de:
'PRAGMA database.table_info(SOME_TABLE_NAME)' will report table metadata for
the given database. The main database called 'main', can be extended by
attaching further databases via 'ATTACH DATABASE'. The above PRAGMA should
respect
Christoph Burgmer cburg...@ira.uka.de added the comment:
Implementing full patch solving it the old way (UTR#21).
The correct way for the latest Unicode version would be to implement
the word breaking algorithm described in (UAX#29) [1] first.
[1] http://www.unicode.org/reports/tr29
Christoph Burgmer cburg...@ira.uka.de added the comment:
I should add that I didn't include the two header files generated by
Tools/unicode/makeunicodedata.py
--
___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue6412
New submission from Christoph Burgmer cburg...@ira.uka.de:
locale.format_string doesn't return same result as a normal
string % format
directive, but raises a TypeError. See attached test case for Python
2.6.
locale.format_string('%f%%', 1.0)
Traceback (most recent call last):
File stdin
Christoph Burgmer cburg...@ira.uka.de added the comment:
This patch removes '%%' entities from the regex results and only
replaces other matches with '%s' which later then get replaced by
localized versions so that escaped percentage entities don't show up in
localized parsing anymore.
Removing
Christoph Burgmer cburg...@ira.uka.de added the comment:
Here is a diff for test/test_pydoc.py (against Python2.6) which though
doesn't trigger due to how Python handles output encoding. This test
here will pass, but pydoc will still fail:
$ pydoc test/pydoc_mod.py /dev/null
Traceback (most
New submission from Christoph Burgmer cburg...@ira.uka.de:
pydoc fails with a UnicodeEncodeError for properly specified Unicode
docstrings (u...) on the command line interface.
See attached patch that encodes the output with the system's encoding.
--
components: Extension Modules
files
Christoph Burgmer cburg...@ira.uka.de added the comment:
Casing algorithms should follow Section 3.13 Default Case Algorithms
in the standard itself, not UTR#21.
See
http://www.unicode.org/Public/5.2.0/ucd/DerivedCoreProperties-5.2.0d11.
Unicode 5.2. A nice mail on the Unicode mail list has
New submission from Christoph Burgmer cburg...@ira.uka.de:
Titlecase, i.e. istitle() and title(), is buggy when the string
includes combining diacritical marks.
u'H\u0301ngh'.istitle()
False
u'H\u0301ngh'.title()
u'H\u0301Ngh'
The string given already is in titlecase so that the following
Christoph Burgmer cburg...@ira.uka.de added the comment:
Adding a incomplete patch in need of a function
Py_UNICODE_ISCASEIGNORABLE defining the case-ignorable class.
I don't want to touch capitalize() as I don't fully understand the
semantics, where it is different to title(). It seems though
Christoph Burgmer cburg...@ira.uka.de added the comment:
My last patch only changed the encoding used in DocTestRunner.run().
This new patch will apply the same to DocTestCase.runTest().
--
Added file: http://bugs.python.org/file14422/doctest.unicode.patch
Christoph Burgmer cburg...@ira.uka.de added the comment:
JFTR: To yield the results of my last comment, you need to apply the
patch posted in http://bugs.python.org/issue1293741
--
___
Python tracker rep...@bugs.python.org
http://bugs.python.org
Christoph Burgmer cburg...@ira.uka.de added the comment:
This problem seems more severe as the appended test case shows.
That gives me:
Expected:
u'ī'
Got:
u'\u012b'
Both literals are the same.
Unicode literals in doc strings are not treated as other escaped
characters:
repr(r'\n
Christoph Burgmer cburg...@ira.uka.de added the comment:
See attached patch which works for error reporting and verbose output.
--
keywords: +patch
nosy: +christoph
Added file: http://bugs.python.org/file14407/doctest.unicode.patch
___
Python tracker
Christoph Burgmer cburg...@ira.uka.de added the comment:
OutputChecker.check_output() seems to be responsible for comparing
'example.want' and 'got' literals and this is obviously done literally.
So as u'1' is different to '1' this is reflected in the result.
This gets more complicated
Christoph Burgmer [EMAIL PROTECTED] added the comment:
JFTR:
print unicode(e.message).encode(utf-8)
only works for Python 2.5, not downwards.
__
Tracker [EMAIL PROTECTED]
http://bugs.python.org/issue2517
Christoph Burgmer [EMAIL PROTECTED] added the comment:
To be more precise: I see no way to convert the encapsulated non-ASCII
data from the string in an easy way.
Taking e from my last post none of the following will work:
str(e) # UnicodeDecodeError
e.__str__() # UnicodeDecodeError
e
Christoph Burgmer [EMAIL PROTECTED] added the comment:
Thanks, this does work.
But, where can I find the piece of information you just gave to me in
the docs? I couldn't find any interface definition for Exceptions.
Further more will this be regarded as a bug?
From [1] I understand
Christoph Burgmer [EMAIL PROTECTED] added the comment:
Though I welcome the reopening of the bug for Python 3.0 I must say
that plans of not fixing a core element rather surprises me.
I never believed Python to be a programming language with good Unicode
integration. Several points were
New submission from Christoph Burgmer [EMAIL PROTECTED]:
Python seems to have problems when an exception is thrown that
contains non-ASCII text as a message and is converted to a string.
try:
... raise Exception(u'Error when printing ü')
... except Exception, e:
... print e
24 matches
Mail list logo