[Python-ideas] Re: Python 4000: Have stringlike objects provide sequence views rather than being sequences

Random832 Sat, 26 Oct 2019 20:02:00 -0700

On Sat, Oct 26, 2019, at 20:26, David Mertz wrote:
> Absolutely, utf-8 is a wonderful encoding. And indeed, worst case is 
> the same storage requirement as utf-16 or utf-32. For O(1) random 
> access into all strings, we have to eat 32-bits per character, one way 
> or the other, but of course there are space/speed trade-offs one could 
> make for intermediate behavior.


A string representation considering of (say) a UTF-8 string, plus an auxiliary 
list of byte indices of, say, 256-codepoint-long chunks [along with perhaps a 
flag to say that the chunk is all-ASCII or not] would provide O(1) random 
access, though, of course, despite both being O(1), "single index access" vs 
"single index access then either another index access or up to 256 
iterate-forward operations" aren't *really* the same speed.
_______________________________________________
Python-ideas mailing list -- [email protected]
To unsubscribe send an email to [email protected]
https://mail.python.org/mailman3/lists/python-ideas.python.org/
Message archived at 
https://mail.python.org/archives/list/[email protected]/message/N4ONH5O443FWB7M7E2FF24QR32HXAPAD/
Code of Conduct: http://python.org/psf/codeofconduct/

[Python-ideas] Re: Python 4000: Have stringlike objects provide sequence views rather than being sequences

Reply via email to