I had a look at C-solution, it delegates the summation over one axis from the axis tuple to the C-helper. And then the remaining axes are summed from _methods.py. Worst case: if the axis delegated to helper is very short compared to the other axes I would expect hardly any speed-up, and savings on memory usage would also be limited.
Sticking with this solution it would be a better from the point of view of speed and memory use to delegate the longest axis from the axis tuple to C-code. In my view a solution with which many would be happier (https://github.com/numpy/numpy/pull/13263#issuecomment-1048122467) would probably delegate all the axes to the helper function. _______________________________________________ NumPy-Discussion mailing list -- numpy-discussion@python.org To unsubscribe send an email to numpy-discussion-le...@python.org https://mail.python.org/mailman3/lists/numpy-discussion.python.org/ Member address: arch...@mail-archive.com