Re: [Rd] double in summary.c : isum

Matthew Dowle Mon, 25 Mar 2013 04:45:22 -0700

On 25.03.2013 11:31, Matthew Dowle wrote:

On 25.03.2013 11:27, Matthew Dowle wrote:
On 25.03.2013 09:20, Prof Brian Ripley wrote:
On 24/03/2013 15:01, Duncan Murdoch wrote:
On 13-03-23 10:20 AM, Matthew Dowle wrote:
On 23.03.2013 12:01, Prof Brian Ripley wrote:
On 20/03/2013 12:56, Matthew Dowle wrote:
Hi,

Please consider the following :
x = as.integer(2^30-1)
[1] 1073741823
sum(c(rep(x, 10000000), rep(-x,9999999)))
[1] 1073741824

Tested on 2.15.2 and a recent R-devel (r62132).
I'm wondering if s in isum could be LDOUBLE instead of double,like
rsum, to fix this edge case?
No, because there is no guarantee that LDOUBLE differs fromdouble
(and platform on which it does not).
That's a reason for not using LDOUBLE at all isn't it? Yetsrc/main/*.c
has 19 lines using LDOUBLE e.g. arithmetic.c and cum.c as well as
summary.c.
I'd assumed LDOUBLE was being used by R to benefit from longdouble (orequivalent) on platforms that support it (which is all modernUnix, Macand Windows as far as I know). I do realise that the edge casewouldn't
Actually, you don't know. Really only on almost all Intel ix86:mostother current CPUs do not have it in hardware. C99/C11 requirelongdouble, but does not require the accuracy that you are thinking ofand
it can be implemented in software.
This is very interesting, thanks. Which of the CRAN machines don't
support LDOUBLE with higher accuracy than double, either in hardware
or software?  Yes I had assumed that all CRAN machines would do. It
would be useful to know for something else I'm working on as well.
be fixed on platforms where LDOUBLE is defined as double.
I think the problem is that there are two opposing targets in R:we
want things to be as accurate as possible, and we want them to be
consistent across platforms. Sometimes one goal wins, sometimestheother. Inconsistencies across platforms give false positives inteststhat tend to make us miss true bugs. Some people think we shouldneveruse LDOUBLE because of that. In other cases, the extra accuracyis sohelpful that it's worth it. So I think you'd need to argue thatthecase you found is something where the benefit outweighs the costs.Sincealmost all integer sums are done exactly with the current code, isitreally worth introducing inconsistencies in the rare inexactcases?
But as I said lower down, a 64-bit integer accumulator would be
helpful, C99/C11 requires one at least that large and it is
implemented in hardware on all known R platforms. So there is away
to do this pretty consistently across platforms.
That sounds much better. Is it just a matter of changing s to be
declared as uint64_t?
Typo. I meant int64_t.

But even 64-bit integer might under or overflow. Which is one of thereasons for accumulating in double (or LDOUBLE) isn't it? To save a

test for over/underflow on each iteration.

Duncan Murdoch
What have I misunderstood?
Users really need to take responsibility for the numericalstability
of calcuations they attempt.  Expecting to sum 20 million large
numbers exactly is unrealistic.
Trying to take responsibility, but you said no. Changing fromdouble to
LDOUBLE would mean that something that wasn't realistic, was then
realistic (on platforms that support long double).
And it would bring open source R into line with TERR, which getstheanswer right, on 64bit Windows at least. But I'm not sure Ishould be asconfident in TERR as I am in open source R because I can't seeits
source code.
There are cases where 64-bit integer accumulators would be
beneficial, and this is one. Unfortunately C11 does not requirethem
but some optional moves in that direction are planned.
https://svn.r-project.org/R/trunk/src/main/summary.c

Thanks,
Matthew

______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel
______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel


______________________________________________
R-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-devel

Re: [Rd] double in summary.c : isum

Reply via email to