It is pretty weird that these two statements don't necessarily produce the
same result:

someufunc(*inputs, out=out_arr)
out_arr[...] = someufunc(*inputs)

On Fri, Sep 27, 2019, 15:02 Sebastian Berg <sebast...@sipsolutions.net>
wrote:

> On Fri, 2019-09-27 at 11:50 -0700, Sebastian Berg wrote:
> > Hi all,
> >
> > Looking at the ufunc dispatching rules with an `out` argument, I was
> > a
> > bit surprised to realize this little gem is how things work:
> >
> > ```
> > arr = np.arange(10, dtype=np.uint16) + 2**15
> > print(arr)
> > # array([ 0,  2,  4,  6,  8, 10, 12, 14, 16, 18], dtype=uint16)
> >
>
> Whoops, copied that print wrong of course.
>
> Just to be clear, I personally will consider this an accuracy/precision
> bug and assume that we can just switch the behaviour failry
> unceremoniously at some point (and if someone feels that should be a
> major release, I do not mind).
> It seems like one of those things that will definitely fix some bugs
> but could break the odd system/assumption somewhere. Similar to fixing
> the memory overlap issues.
>
> - Sebastian
>
>
> > out = np.zeros(10)
> >
> > np.add(arr, arr, out=out)
> > print(repr(out))
> > # array([ 0.,  2.,  4.,  6.,  8., 10., 12., 14., 16., 18.])
> > ```
> >
> > This is strictly speaking correct/consistent. What the ufunc tries to
> > ensure is that whatever the loop produces fits into `out`.
> > However, I still find it unexpected that it does not pick the full
> > precision loop.
> >
> > There is currently only one way to achieve that, and this by using
> > `dtype=out.dtype` (or similar incarnations) which specify the exact
> > dtype [0].
> >
> > Of course this is also because I would like to simplify things for a
> > new dispatching system, but I would like to propose to disable the
> > above behaviour. This would mean:
> >
> > ```
> > # make the call:
> > np.add(arr, arr, out=out)
> >
> > # Equivalent to the current [1]:
> > np.add(arr, arr, out=out, dtype=(None, None, out.dtype))
> >
> > # Getting the old behaviour requires (assuming inputs have same
> > dtype):
> > np.add(arr, arr, out=out, dtypes=arr.dtype)
> > ```
> >
> > and thus force the high precision loop. In very rare cases, this
> > could
> > lead to no loop being found.
> >
> > The main incompatibility is if someone actually makes use of the
> > above
> > (integer over/underflow) behaviour, but wants to store it in a higher
> > precision array.
> >
> > I personally currently think we should change it, but am curious if
> > we
> > think that we may be able to get away with an accelerate process and
> > not a year long FutureWarning.
> >
> > Cheers,
> >
> > Sebastian
> >
> >
> > [0] You can also use `casting="no"` but in all relevant cases that
> > should find no loop, since the we typically only have homogeneous
> > loop
> > definitions, and
> >
> > [1] Which is normally the same as the shorter spelling
> > `dtype=out.dtype` of course.
> > _______________________________________________
> > NumPy-Discussion mailing list
> > NumPy-Discussion@python.org
> > https://mail.python.org/mailman/listinfo/numpy-discussion
> _______________________________________________
> NumPy-Discussion mailing list
> NumPy-Discussion@python.org
> https://mail.python.org/mailman/listinfo/numpy-discussion
>
_______________________________________________
NumPy-Discussion mailing list
NumPy-Discussion@python.org
https://mail.python.org/mailman/listinfo/numpy-discussion

Reply via email to