Re: [Rd] NA in doc for options(matprod="default")

Tomas Kalibera Mon, 17 Feb 2020 10:41:10 -0800

On 2/17/20 6:18 PM, Serguei Sokol wrote:

Le 17/02/2020 à 17:50, Tomas Kalibera a écrit :
On 2/17/20 5:36 PM, Serguei Sokol wrote:
Hi,
A colleague of mine has spotted me a passage of the doc ?optiontalking about Inf and NaN check in 'matprod=default' section:
https://stat.ethz.ch/R-manual/R-devel/library/base/html/options.html
I am wondering if NA should be mentioned too as the check seems toinclude this "value" too. NA being different from Inf and NaN it isworth mentioning, isn't it?
Yes, NA is handled, too. NA is one of NaN values for the purpose ofthis text
Thanks for clarification. It was not clear for me from the text itself.

(and it is also implemented that way, see ?NaN).

 Indeed, the text of ?NaN says "... systems typically have
     many different NaN values.  One of these is used for the numeric
     missing value ‘NA’, and ‘is.nan’ is false for that value."

However, R can return both NA and NaN symbols, e.g.

> mean(c(1, NA))
[1] NA
> mean(c(1, NaN))
[1] NaN
which does not help to understand their relationship.

That's why I continue to think that it would be clearer to mention NAexplicitly in option(matprod=default). It could be a phrasing like"... ensure correct propagation of Inf and NaN (including NA) ..."

I've intentionally left that out from this part of the text in ?options.It is irrelevant to talk about how NA propagates through computationbecause NaNs may become NAs and vice versa (see ?NaN). Intuitively itwould be nice if NAs were different, if a computation of say a purefunction would result in NA iff at least one of its inputs was an NA. Inmore complicated situations it would be hard to define what should bethe correct result, but even in the simpler cases this does not work inR anymore. We lost this with architectural changes in CPUs that nolonger defined the payload of NaNs resulting from elementary floatingpoint operations, so we would have to always check explicitly, with alot of effort and additional performance overhead. Also, we could nothope for the distinction to work through external code (such as BLAS orLAPACK) that is not aware of R's notion of NA.

All of ?options for "matprod" is about propagation of standard floatingpoint non-finite values (NaN, Inf) through matrix multiplication. Thenaive 3-loop algorithm with a correct compiler (following the standardin implementing floating point operations) is regarded as producing thecorrect results. Some BLAS implementations produce different results dueto optimizations in code (not the naive 3-loop algorithm) and likelyaggressive compiler optimizations that violate the standard. R users canchoose based on their preference, the differences in performance can besignificant.


Best
Tomas


Best,
Serguei.


______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Re: [Rd] NA in doc for options(matprod="default")

Reply via email to