Re: [music-dsp] Precision issues when mixing a large number of signals

Ross Bencina Sun, 09 Dec 2012 18:40:13 -0800

Hi Alessandro,

A lot has been written about this. Google "precision of summing floatingpoint values" and read the .pdfs on the first page for some analysis.Follow the citations for more.

Somewhere there is a paper that analyses the performance of differentmethods and suggests the optimal approach. I think it is signal dependent.



On 10/12/2012 11:32 AM, Alessandro Saccoia wrote:

It's going to be a sort of comulative process that goes on in time,
so I won't necessarily know N in advance. If had a strong evidence
that I should prefer one method over the other, I could decide to
keep all the temporary X1, X2,… and recompute everything each time.
Performance and storage are not a strong concern in this case, but
the quality of the final and intermediate results is.

Dividing by N only when you need to compute the sum sounds like a goodgood idea, that way you won't be hashing the precision of each valueprior to summing it.

To avoid throwing out any information you could use high-precisionarithemetic. Did you say whether the signals are originally integers orfloating point? If they're integers, can you keep the sum in a 64 bitint? Otherwise maybe a double. You can easily compute when you willstart to loose precision in floating point based on the range of theinput and the number of elements in the sum (summing 2 values requires 1extra bit of headroom, 4 values requires 2 bits, 8 values 3 bits etc).So for 1024 32 bit values you'll need 10 more bits to avoid any loss ofprecision due to truncation... etc. There is also arbitrary precisionarithmetic if you don't want to throw any bits away. There is somethingcalled "double double" which is a software 128 bit floating point typethat maybe isn't too expensive.

One problem with floating point is that adding a large value and a smallvalue will truncate the small value (first it needs to be shifted so ithas the same exponent as the large value). You didn't say much aboutyour values, but assuming that you're adding values distributed around anon-zero mean, the accumulator will increase in value as you add morevalues. Thus later summands will be truncated more than earlier ones.One way to minimise this is to maintain multiple accumulators and onlysum a certain number of values into each one (or sum into successiveaccumulators kind of like a ring buffer of accumulators). Then sum theaccumulators together at the end. this reduces truncation effects sinceeach accumulator has a limited range (hence higher precision ofsummation) and when you sum the final accumulators together (hopefully)they will all have a similar range).A variation on this, if you know your signals have different magnitudes(eg you are summing both X and X^2), is to use separate accumulators foreach magnitude class - since these are obviously going to have vastlydifferent domains.

You also need to consider what form the final output will take. If theoutput is low precision then the best you can hope for is that eachinput makes an equal contribution to the output -- you need enoughprecision in your accumulator to ensure this.


For some uses you could consider dithering the output to improve the output.

Ross.

--
dupswapdrop -- the music-dsp mailing list and website:
subscription info, FAQ, source code archive, list archive, book reviews, dsp 
links
http://music.columbia.edu/cmc/music-dsp
http://music.columbia.edu/mailman/listinfo/music-dsp

Re: [music-dsp] Precision issues when mixing a large number of signals

Reply via email to