Re: [Numpy-discussion] index partition

Daπid Tue, 15 Apr 2014 01:36:14 -0700

On 14 April 2014 18:17, Alan G Isaac <alan.is...@gmail.com> wrote:

> I find it rather more convenient to use boolean arrays,
> but I wonder if arrays of indexes might have other
> advantages (which would suggest using the set operations
> instead). In particular, might a[boolean_array] be slower
> that a[indexes]?  (I'm just asking, not suggesting.)



Indexing is generally faster, but convert from boolean to indexes gets more
expensive:

In [2]: arr =np.random.random(1000)

In [3]: mask = arr>0.7

In [4]: mask.sum()
Out[4]: 290

In [5]: %timeit arr[mask]
100000 loops, best of 3: 4.01 µs per loop

In [6]: %%timeit
   ...: wh = np.where(mask)
   ...: arr[wh]
   ...:
100000 loops, best of 3: 6.47 µs per loop

In [8]: wh = np.where(mask)

In [9]: %timeit arr[wh]
100000 loops, best of 3: 2.57 µs per loop

In [10]: %timeit np.where(mask)
100000 loops, best of 3: 3.89 µs per loop

In [14]: np.all(arr[wh] == arr[mask])
Out[14]: True


If you want to apply the same mask to several arrays, it is then worth
(performance-wise) to do it.


/David.

_______________________________________________
NumPy-Discussion mailing list
NumPy-Discussion@scipy.org
http://mail.scipy.org/mailman/listinfo/numpy-discussion

Re: [Numpy-discussion] index partition

Reply via email to