[issue14874] Faster charmap decoding

2012-06-16 Thread Antoine Pitrou

Changes by Antoine Pitrou pit...@free.fr:


--
nosy: +loewis

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14874
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14874] Faster charmap decoding

2012-06-16 Thread Antoine Pitrou

Antoine Pitrou pit...@free.fr added the comment:

Thank you for the patch! Now pushed to 3.3.

--
resolution:  - fixed
stage: patch review - committed/rejected
status: open - closed

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14874
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14874] Faster charmap decoding

2012-06-16 Thread Roundup Robot

Roundup Robot devn...@psf.upfronthosting.co.za added the comment:

New changeset 8f3a5308f50b by Antoine Pitrou in branch 'default':
Issue #14874: Restore charmap decoding speed to pre-PEP 393 levels.
http://hg.python.org/cpython/rev/8f3a5308f50b

--
nosy: +python-dev

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14874
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14874] Faster charmap decoding

2012-06-15 Thread Antoine Pitrou

Changes by Antoine Pitrou pit...@free.fr:


--
stage:  - patch review

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14874
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14874] Faster charmap decoding

2012-05-21 Thread Serhiy Storchaka

New submission from Serhiy Storchaka storch...@gmail.com:

Charmap decoders are not as important as UTF decoders, but are still widely 
used. In Python 3.3 with PEP 393 they slowed down 4x. The proposed patch 
restores the performance.

Optimized only the most common case, when the decoder is specified by the UCS2 
table with length = 256. Map-based decoders translated to table-based. UCS1 
tables widened to UCS2 by adding 257th fake characters.

Benchmark results:

 3.2   3.3(vanilla)  3.3(patched)

cp1251'A'*1  111 (+10%)31 (+294%)122
cp1251'\xa0'*1   111 (+8%) 29 (+314%)120
cp1251'\u0402'*1 111 (+6%) 25 (+372%)118

--
components: Interpreter Core, Unicode
files: decode_charmap.patch
keywords: patch
messages: 161301
nosy: ezio.melotti, haypo, lemburg, pitrou, storchaka
priority: normal
severity: normal
status: open
title: Faster charmap decoding
type: performance
versions: Python 3.3
Added file: http://bugs.python.org/file25664/decode_charmap.patch

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14874
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14874] Faster charmap decoding

2012-05-21 Thread Serhiy Storchaka

Changes by Serhiy Storchaka storch...@gmail.com:


Added file: http://bugs.python.org/file25665/charmapdecodebench.py

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14874
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14874] Faster charmap decoding

2012-05-21 Thread Serhiy Storchaka

Changes by Serhiy Storchaka storch...@gmail.com:


Added file: http://bugs.python.org/file25666/bench-diff.py

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14874
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com