Why ranges don't return vectors?

Piotr Szturmaj Thu, 28 Feb 2013 17:05:23 -0800

Seriously, nobody will ever get performance from single-elementiterator/range pattern - this makes CPU stall!

Anytime you read one byte from typical hard disk, system reads fullsector - typically 512 bytes, no more, no less. Anytime you read onebyte from memory, CPU loads entire cache line from RAM into the cache(64 bytes on all modern Intel CPU's). Why not exploit that with ranges?

Ranges could potentially return arrays in front() (or infrontVector/whatever), so basically they will become ranges of rangeswhere the deepest range is always RandomAccessRange. This has obviousperformance benefits. Everyone knows that traversing memory is fasterthan iteraring by popFront(). On the other hand memory lacks theflexibility of ranges - so lets make a hybrid range!


Advantages:
* performance (!)
* limited lookahead/backtracking

How?

1. instruction level parallelism: CPU's can execute few instructions inparallel given that they operate on different data (different vectorelements)2. SIMD: load entire vector to SSE/AVX register then run the operationon all elements at once3. use of L1 cache: traversing vector in memory is way faster thatcalling popFront() for each vector element - likely if it's inlined


Disadvantages:
* potential inconvenience

I know it can't be easy drop-in addition to the current rangeimplementation, but who knows...

Why ranges don't return vectors?

Reply via email to