Re: [users] HTML Post Re: Problem with sum in Calc

Harold Fuchs Mon, 08 Mar 2010 17:31:30 -0800

Daniel Lewis wrote:

Harold Fuchs wrote:
Daniel Lewis wrote:
Harold Fuchs wrote:
Twayne wrote:
<snip>
I managed to create a tiny spreadsheet that exhibits the
problem: 1.456    1.46    1.46
2.345    2.35    2.35
4.567    4.57    4.57
8.368    8.37    8.38

The first column shows how I entered the data [the last row
shows the result of =sum(..)]; the second row shows how it
and the =sum(...) appears under default (2 decimal places)
formatting; the third shows the result of the procedure I
describe below.
Please try the following, assuming your values are in
column A and that column B is spare - adjust as appropriate:

  1. in B1 enter the formula =INT((A1+0.005)*100)/100
  2. copy B1 down column B for as many rows as are
necessary to cover all the values
  3. in the *next* row in column B enter the formula
=SUM(B1:Bn) where "n" is the number of values
  4. compare this result to your previous one - the sum of
column A.
The formula in #1 above is designed to remove any rounding
errors in column A and put the result in column B. If the
sum in column B is the same as that in column A and if that
sum is wrong, then I'd say you've found a bug :-(
Harold,
Haven't been in a classroom in a lonnnnnggg time, but I fail tosee the significance of adding 0.005 to each manually user enterednumber in the leftmost column to get column B. It is of course,going to make any digits after the decimal point round upwards,even when it shouldn't: e.g. 123.111 becomes 123.116 or, if oneexposes one more decimal places, 123.1160. That's incorrect, asrounding will either leave a number as is, or increase it by 1 atthe relevant decimal place. 123.111 should round to 123.11 or123.1 or 123, but never 123.116 or 123.17 because that createsonly a round-up. Each additional entry will then increase thesummation errors very substantially with a list of more than a fewdigits to add up.
123.111 will round to 123.11 as you say, but 123.115 will round to123.12 which is what we want. After rounding, the formulamultiplies by 100, giving 12311.1 and 12312.0 respectively; the INTthen gives 12311 and 12312 respectively and then dividing by 100gives the correct results. The idea is to make the 3rd decimalplace have its correct effect.
Perhaps you can enlighten me as to the specific point you'retrying to make with the second column where you add 0.005 to eachmanual entry? 1.4560 1.4600 1.46 2.3450 2.35002.35 4.5670 4.5700 4.57 8.3680 8.3800 8.37Sums in column B is incorrect for what was entered, but notincorrect for the data Calc would use. 4 Dec Plcs 4 Dec Plcs
      Ea +.005 2 Dec Plcs
The second column shows how Calc displays the values. The displayfor rows 1-3 are correct to 2 decimal places but the 4th row, the=SUM(...) is **wrong**. If you add up the *displayed* values you do*not* get the displayed result !!! The reason is that if you add1.456, 2.345 and 4.567 you get 8.368 which rounds to 8.37.
The third column shows the results of my formula and thecorresponding =SUM(...) . The displayed values are the same but nowthe sum is correct.
1.456 1.46 1.456 2.345 2.352.345 4.567 4.57 4.567 8.368 8.38 8.37Sums. Again, column B is no longer the same as A. Sum(a7:a9) sum(b7:b9) Sum a7:a9
Exactly. Same problem. You need to round *before* you add; Calcadds before rounding *because* it uses the more precise values inits calculation. Calc (correctly) does not use the displayedvalues. Asking for a display of 2 decimal places doesn't change theunderlying value, only its appearance.
     Ea + .005 in middle column
First Column, all 3 dec places, same as yours; typed numbers andsumming.Second Column, 4 - 2 decimal places resp top and bottom, usingyour formula and summing.
Third Column, 2 - 3 dec places resp, top to bottom, and summed.
Summing the numbers in Calc or Excel, 8.368 sum for column 1 sumscorrectly to rounded off 8.37 in Column 3, for a 2 dec placesummation.
You also said:
" ...#1 above is designed to remove any rounding
errors in column A and put the result in column B...
"
But I fail to see the relevence of it. Perhaps I'm blind but Ialso don't see a "#1", but I think I know what you meant.
I gave a numbered list of 3 items. It is common practice to referto these separately as #1, #2 and #3. The # sign is pronounced"number" in all the business contexts I've ever encountered.
Trouble is, adding 0.005 to each number is going to force eachnumber to round to the next higher, whether it should roundupwards or not. If you start with 1 and add 5, =6, rounding offat that decimal place is going to happen. Same for 0 thru 4 andeven does a rollover at 5 and above. How can that possibly beintended to account for rounding errors in Column A? What itreally does is make the more like integers than rounded numbers.Whatever precision the numbers are typed in as is the precisionused in calculations, regardless of how many digits (and resultingrounding) you choose to have display.
The point is you don't want to use the precise values for theaddition. You want to use the rounded values. Otherwise you get theludicrous effect shown in my column 2 where 1.46, 2.35 and 4.57 addto 8.37 which is clearly wrong and arises exactly because you areadding the more precise values.
The difference between my columns 2 and 3 is that in column 2 theactual values and the displayed values are different. In column 3they are the same. They have been forced to be the same by roundingeach row separately.
1.5 1.5 1.5 2.3 2.4 2.3 4.6 4.6 4.6 8.48.4 8.4 Note that, with 3 or more decimalplaces, the numbers as originally entered are still used in thecalculation. One simply has to be careful to consider the effectsof rounding vs # digits to display. Other than the apparenterror by adding 0.005 to each entry, the sums are indeed the same.Excel treats these numbers the same way. What did I miss in thepoint you wanted to make?
The point I want to make is that if you want your invoices to*appear* to add up correctly you need to round each item beforeusing it in the total. An invoice showing 1.46, 2.35 and 4.57adding to 8.37 is ridiculous.
<snip>
One of the problems is that INT((A1+.005)*100)/100 is not doingwhat you want it to do. You are introducing an error which dependsupon the number of entries whose last digit is 5 or more. You need aminimum of 3 numbers whose last digit is at least 5. This causes thesum of the INT((A1+.005)*100)/100 to be .01 more than it would be ifyou were adding the numbers using two decimal places. It appearsthat the error happens when there are an odd quantity of numberswhose last digit is 5 thought 9. For an even quantity of thesenumbers, INT((A1+.005)*100) will round half of the numbers lower andhalf higher. Thus the sums are the same to 2 decimal places. *BUT*,when the quantity of these numbers is odd, there is one more numberbeing rounded up than being rounded down. Hence, that one numberrounded up produces the error.
Dan
I don't think this is right. The following shows the same error eventhough there are 4 values whose third digit is 5 or more:
 1.234      1.23      1.23
 2.345      2.35      2.35
 3.456      3.46      3.46
 4.567      4.57      4.57
 5.678      5.68      5.68
17.280     17.28     17.29
Again, the first column (rows 1-5) is as I entered the data; thesecond is as Calc displays the values with 2dp set; the third isafter the application of my original formula (note that I havedeliberately not used the better formula offered by Franz Wein'se-mail in this thread so as not to confuse things with discussionsabout whether or not his formula is equivalent to mine). Again thesum in column 2 (row 6) is *wrong* if you add the displayed valueswhile that in column 3 is right.
I don't think the number of values has anything to do with theprinciple that if the *displayed* total is always to be the correctsum of the *displayed* values then those values must be roundedbefore the addition is performed. I learnt this principle when I usedmy first spreadsheet application in the mid 1970's (yes, really) andI don't think it's changed - because it's fundamental to the wayarithmetic works. Ask an accountant.
I, personally doubt this principle. Anytime rounding is done, an erroris introduced. In the 1956-1960 school years, I learned this. Toreduce the error, here is what we were taught. If 5 is the finaldigit, look at the digit to its left. If that digit is even, thenround down. If that digit is odd, then round up. So, 2.345 becomes2.34. 2.355 becomes 2.36. In long columns of numbers being added, theprobability is that approximately half of the numbers ending in 5will have an even digit to its left and the other half of the numbershave an odd digit preceding the final 5.
Here is why I doubt the principle you stated: I took the same numbersas you did and changed all the final digits to a 5 if the originalfinal digit was greater than 5. Rather interesting results. Roundingresults in a difference of 2 when rounding to
 1.234      1.23          1.23    two decimal places.
 2.345      2.35          2.35
 3.455      3.46          3.46
 4.565      4.57          4.57
 5.675      5.68          5.68
17.274     17.27         17.29
What is the problem? Why is one sum 17.27 and the other 17.29? Simpleanswer: the first sum was rounded after adding the numbers; the secondsum came from rounding the numbers first and adding them afterward. Inmath lingo, These two operations are not commutative. The resultsdepend directly on the which order the operations are performed.

Exactly. As you say "The results depend directly on the which order theoperations are performed. "

Just look at your own numbers. The sum in column 2 (17.27) is patently*not* the sum of the values in column 2 whereas the sum in column 3 *is*the sum of the values in column 3. This is precisely my point.

If the numbers in column 1 represented monetary amounts based on somecalculation using numbers not shown, and if you had to pay each value*separately*, the total of your outgoings would be as shown in column 3,*not* as shown in column 2. That's because each of your payments wouldhave been rounded according to my (or Franz Wein's) formula.

In fact, the *method* used to round the individual numbers is not theissue. The point is that the rounding must be done before the addition.If not, then the "printed" numbers may well not add to the "printed" total.

As an aside, Google shows me that the odd/even method of rounding youdescribe is sometimes called "banker's rounding". All I can say is thatI have spent a long time (since the mid '70s) helping major banks allover the world implement computerised spreadsheets and have never beenasked for this method nor seen it used either on computers or oncalculators. Google also shows me that "Banker's rounding" is not available


   * on any HP or Casio calculator
   * as a built-in function in OpenOffice.org Calc
   * as a built-in function in Microsoft Excel, although it is
     available in Microsoft Visual Basic for Applications (VBA).


--
Harold Fuchs
London, England
Please reply *only* to [email protected]

Re: [users] HTML Post Re: Problem with sum in Calc

Reply via email to