[issue14654] Faster utf-8 decoding

2012-05-07 Thread Éric Araujo

Changes by Éric Araujo mer...@netwok.org:


--
nosy: +eric.araujo

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-07 Thread Serhiy Storchaka

Serhiy Storchaka storch...@gmail.com added the comment:

See issue14738 for advanced optimization.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-06 Thread Jesús Cea Avión

Jesús Cea Avión j...@jcea.es added the comment:

I understand Martin point, but I think 95% of issues in the bugtracker are 
work in progress, mine included.

Maybe the issue is that Serhiy hasn't made a concrete proposal to be tested  
integrated. It seems to be more an exploratory work.

I am in the nosy list because I am interested in this work.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-06 Thread Serhiy Storchaka

Serhiy Storchaka storch...@gmail.com added the comment:

Martin, sorry to have wasted your time. I understand that you are busy,
so I'm not too worried not receiving a feedback for ten days.

 At this point, it appears that you don't intend to submit any of these 
 patches for inclusion into Python.

I'm at a loss. What causes such an impression? I quickly reacting to the
comments, responding by the new patches. I take into account your
comments, and if I do not agree, reinforce my opinion by the benchmarks.
I suggest only cleaned and well-tested code. I provide tools for
benchmarking. What am I doing wrong? May be my bad English has been
misunderstood?

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-05 Thread Serhiy Storchaka

Serhiy Storchaka storch...@gmail.com added the comment:

Well, it seems, 64-bit processors are smart enough to not feel the need
for this optimization. On 32-bit platforms I see a noticeable increase
in speed.

I am now working on a more advanced optimization, which now shows a gain
of +20-60% compared with the previous patches, but I hope to increase a
gain by +50%-100%.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-05 Thread Martin v . Löwis

Martin v. Löwis mar...@v.loewis.de added the comment:

I'll be closing this issue at this point. Serhiy: I don't think the bug tracker 
should be used to evolve work in progress (except when responding to reviews 
received). Use a Mercurial clone for that instead. By posting a patch here, you 
are requesting that it be reviewed and considered - please understand that you 
consume a lot of people's time by such a posting.

At this point, it appears that you don't intend to submit any of these patches 
for inclusion into Python. If you ever do want to contribute something in this 
area, please create a new issue.

--
resolution:  - rejected
status: open - closed

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-05 Thread Antoine Pitrou

Antoine Pitrou pit...@free.fr added the comment:

 I'll be closing this issue at this point. Serhiy: I don't think the
 bug tracker should be used to evolve work in progress (except when
 responding to reviews received). Use a Mercurial clone for that
 instead. By posting a patch here, you are requesting that it be
 reviewed and considered - please understand that you consume a lot of
 people's time by such a posting.

That's not very nice. If Serhiy wants feedback on his work, he
definitely has to post *somewhere*. The bug tracker sounds like a
reasonable place (certainly more reasonable than python-dev).

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-05 Thread Martin v . Löwis

Martin v. Löwis mar...@v.loewis.de added the comment:

 That's not very nice. If Serhiy wants feedback on his work, he
 definitely has to post *somewhere*. The bug tracker sounds like a
 reasonable place (certainly more reasonable than python-dev).

I completely disagree (and I really tried to be nice).

It is my utmost belief that the tracker must not be used for
work-in-progress. For any open issue, numerous people review the
issue, and even if they spend only a few minutes, this easily adds
up to a lot of wasted time if there isn't anything to be done about
an issue.

OTOH, discussing it on python-dev indeed seems more appropriate:
even though the readership is larger, people know that they can safely
skip over messages that clearly don't need their attention. So if
Serhiy posts a message titled UTF-8 performance, people will hit
the delete button very quickly if they are not interested.

However, it would really be best in this case if Serhiy takes a step
back, and analyzes the performance of the current decoder carefully,
then proposes a patch which undoubtedly improves the performance and
is meanwhile also maintainable.

He may come to the conclusion that further improvement isn't really
possible or reasonable, in which case it would be good if he posted
his findings to python-dev.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-04 Thread Serhiy Storchaka

Serhiy Storchaka storch...@gmail.com added the comment:

 title: More fast utf-8 decoding - Faster utf-8 decoding

Éric, there is already an issue (#4868) with this title.

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-04 Thread Martin v . Löwis

Martin v. Löwis mar...@v.loewis.de added the comment:

There is nothing wrong with two issues having the same title. Of course, it 
would be best if the title reflected the *actual* defect or change, such as 
specialize UTF-8 decoding by character width, or some such.

In any case, the title change is desirable since the original title was 
ungrammatical. If you wanted to point out that this really is an augmented, 
escalated rise, then Even faster utf-8 decoded, amazingly faster UTF-8 
decoding, or unbelievably faster utf-8 decoding could have worked :-)

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-04 Thread Serhiy Storchaka

Serhiy Storchaka storch...@gmail.com added the comment:

Thank you, Martin, this is what I had in mind. Lost in translation. ;)

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-04 Thread Antoine Pitrou

Antoine Pitrou pit...@free.fr added the comment:

64-bit Linux, Intel Core i5-2500K CPU @ 3.30GHz:

  vanilla 3.3   patch 2 patch 3

utf-8 'A'*1   6931 (+3%)7115 (+0%)  7117
utf-8 'A'*+'\x80' 2347 (+1%)2410 (-2%)  2360
utf-8 'A'*+'\u0100'   2279 (+1%)2282 (+1%)  2310
utf-8 'A'*+'\u8000'   2264 (+2%)2275 (+1%)  2300
utf-8 'A'*+'\U0001'   2351 (+0%)2283 (+3%)  2359
utf-8 '\x80'*1516 (+8%) 558 (+0%)   559
utf-8   '\x80'+'A'*   859 (+0%) 868 (-1%)   860
utf-8 '\x80'*+'\u0100'526 (+6%) 558 (+0%)   558
utf-8 '\x80'*+'\u8000'535 (+4%) 558 (+0%)   558
utf-8 '\x80'*+'\U0001'525 (+6%) 559 (-0%)   558
utf-8 '\u0100'*1  517 (+6%) 548 (+0%)   548
utf-8   '\u0100'+'A'* 818 (+0%) 820 (+0%)   821
utf-8   '\u0100'+'\x80'*  517 (+6%) 548 (+0%)   548
utf-8 '\u0100'*+'\u8000'  525 (+4%) 548 (+0%)   548
utf-8 '\u0100'*+'\U0001'  517 (+6%) 549 (+0%)   549
utf-8 '\u8000'*1  490 (-8%) 433 (+4%)   451
utf-8   '\u8000'+'A'* 818 (+0%) 819 (+0%)   821
utf-8   '\u8000'+'\x80'*  529 (+4%) 548 (+0%)   548
utf-8   '\u8000'+'\u0100'*529 (+4%) 548 (+0%)   548
utf-8 '\u8000'*+'\U0001'  470 (-4%) 451 (+0%)   451
utf-8 '\U0001'*1  554 (-18%)427 (+6%)   453
utf-8   '\U0001'+'A'* 938 (+0%) 927 (+2%)   941
utf-8   '\U0001'+'\x80'*  572 (+4%) 595 (+0%)   595
utf-8   '\U0001'+'\u0100'*571 (+4%) 595 (+0%)   595
utf-8   '\U0001'+'\u8000'*503 (-4%) 481 (+0%)   482

--

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com



[issue14654] Faster utf-8 decoding

2012-05-03 Thread Éric Araujo

Changes by Éric Araujo mer...@netwok.org:


--
title: More fast utf-8 decoding - Faster utf-8 decoding

___
Python tracker rep...@bugs.python.org
http://bugs.python.org/issue14654
___
___
Python-bugs-list mailing list
Unsubscribe: 
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com