Re: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None)

Steven D'Aprano Thu, 05 Jul 2012 08:59:47 -0700

anatoly techtonik wrote:

On Wed, Jul 4, 2012 at 9:31 PM, Terry Reedy <tjre...@udel.edu> wrote:

A sliding window for a generic iterable requires a deque or ring buffer
approach that is quite different from the zip-longest -- grouper approach.


That's why I'd like to drastically reduce the scope of proposal.
itertools doesn't seem to be the best place anymore. How about
sequence method?

   string.chunks(size)  -> ABC DEF G
   list.chunks(size) -> [A,B,C], [C,D,E],[G]

-1

This is a fairly trivial problem to solve, and there are many variations onit. Many people will not find the default behaviour helpful, and will need towrite their own. Why complicate the API for all sequence types with this?

I don't believe that we should enshrine one variation as a built-in method,without any evidence that it is the most useful or common variation. Even ifthere is one variation far more useful than the others, that doesn'tnecessarily mean we ought to make it a builtin method unless it is afundamental sequence operation, has wide applicability, and is genuinely hardto write. I don't believe chunking meets *any* of those criteria, let aloneall three.


Not every six line function needs to be a builtin.

I believe that splitting a sequence (or a string) into fixed-size chunks ismore of a programming exercise problem than a genuinely useful tool. That doesnot mean that there is never any real use-cases for splitting into fixed-sizechunks, only that this is the function that *seems* more useful in theory thanit turns out in practice.

Compare this with more useful sequence/iteration tools, like (say) zip. Youcan hardly write a hundred lines of code without using zip at least once. ButI bet you can write tens of thousands of lines of code without needing tosplit sequences into fixed chunks like this.

Besides, the name "chunks" is more general than how you are using it. Forexample, I consider chunking to be splitting a sequence up at a variousdelimiters or separators, not at fixed character positions. E.g. "the thirdword of item two of the fourth line" is a chunk.

This fits more with the non-programming use of the term chunk or chunking, andhas precedence in Apple's Hypertalk language, which literally allowed you totalk about words, items and lines of text, each of which are described as chunks.

This might be a good candidate for a utility module made up of assorted usefulfunctions, but not for the string and sequence APIs.




--
Steven

_______________________________________________
Python-Dev mailing list
Python-Dev@python.org
http://mail.python.org/mailman/listinfo/python-dev
Unsubscribe: 
http://mail.python.org/mailman/options/python-dev/archive%40mail-archive.com

Re: [Python-Dev] [Python-ideas] itertools.chunks(iterable, size, fill=None)

Reply via email to