Re: [music-dsp] [ot] about entropy encoding

robert bristow-johnson Sun, 09 Aug 2015 19:29:33 -0700

On 8/9/15 5:07 PM, Sampo Syreeni wrote:

On 2015-07-18, robert bristow-johnson wrote:
even so, Shannon information theory is sorta static. it does not dealwith the kind of redundancy of a repeated symbol or string.
In fact it does so fully,


really?  like run-length encoding?

and i've always thunk that the data reduction that you get with LPC (toreduce the word size) that depends on the spectrum of the data was adifferent thing that Shannon information theory was looking at thatmight lead to Huffman coding.

and I believe Peter's problem is about not figuring out how it doesthis. (Sorry about arriving at this this late and reviving a heatedthread.)
The basic Shannonian framework works on top of a single, structurelessprobability space.

yes, and you can model dependent probabilities, i s'pose. so you canput together compound messages that have less information needed torepresent them than you might have if they're separate.

Any message sent is a *single* word from that space with a given apriori probability, and the information it conveys is inverselyrelated to the probability of the symbol. Thus, the basic framework isthat we're computing the probabilities and the resulting informationbased on *entire* signals being the symbols that we send.
Shannon entropy is then revolutionary because under certain conditionsit allows us to decompose the space into smaller parts. If the entiresignal can be broken down into Cartesian product of separate,*independent* part signals, the whole information content of thesignal can be calculated from the partwise surprisals. That's how theuniqueness of the entropy measure is proven: if you want theinformation in the parts to sum to the information of the whole, andeach part's information content is obviously related to the number ofcombinations of values they can take, then the only measure can be alogarithm of probabilities. At the bottom that's a simple homomorphismargument: there is no homomorphism from products to sums other thanthe logarithm.
But notice that there was an independence assumption there. If youplan to decompose your signal into smaller parts, Shannon's formula ofadditive entropy only holds if the parts don't affect each other. Withperiodic signals this assumption is violated maximally, by assumption:every period is the same, so that a single period correspond to theentire signal. For the purposes of talking about the entire signal andits entropy, you only ever need one period, and the underlyingprobability it varies within.

one thing that makes this encoding thing difficult is deciding how tocommunicate to the receiver the data that the signal is close toperiodic and what the period is. and when that changes, how tocommunicate something different. it's the issue of how to define yourcodebook and whether the receiver knows about it in advance or not. youcould have a variety of different codebooks established in advance, andthen send a single short word to tell the receiver which codebook touse. maybe for non-white signals, the LPC coefs for that neighborhoodof audio.

More generally, if there are any statistical correlations between thedecomposed parts of any entire signal, they'll in a certainprobabilistic sense mean that the entire symbol space is more limitedthan it would at first seem, and/or that its probability distributionclumps further away from being the flat, maximum entropy distributionShannon's machinery first expects. The surprisal of the whole thing islowered.
In limine you could have *any* signal *at all*, but always sent with100% certainty: when you saw it, its surprisal would be preciselyzero. That's the trivial case of the underlying probability space forthe entire signal being composed of a single message with probability1, *whatever* that one signal might be.
Peter's problem then seems to be that he doesn't specify thatunderlying probability space nor state his assumptions fully. Hecalculates on signals as though their successive part-symbols orperiods were independent, as if Shannon's decomposition worked. Butbetween the line he assumes that signals coming from an entirelydifferent kind of, far more correlated-between-his-chosen-partitionwere an equal option.
Obviously that leads to teeth grinding and misery, because it isn'teven mathematically coherent.
just with the probability of occurrence (which is measured from therelative frequency of occurrence) of messages. run-length codingisn't in there. maybe there is something in Shannon informationtheory about information measure with conditional probability (thatmight obviate what LPC can do to compress data).
In fact LPC and every other DSP transformation we use in codecs arewell within Shannon's framework.

i didn't know that. it appears to me to be different, almost orthogonalto the Shannon thing.

There, the basic message is the whole continuous time signal. If youreally push it, you can model pretty much anything with any kind ofnoise and serial correlation (corresponding directly to any LTI DSPprocess in continuous time) with (huge) multivariate distributions aswell. Of course now modulo a number of measure theoreticalquirks...but you can.
It's just that we don't want to. Instead we take the back alley andmodel the salient psychoacoustical correlations we see using wellunderstood LTI math, and the sampling theorem which lets us go to anumerable base which retains much of the properties of the continuousdomain (mainly shift invariance; that's another neat homomorphism,basically). Combined with the noisy channel coding theorem, we're setto do some real calculation of the MP3 kind.
And really, this ain't rocket science when you get it. It's just thatyou have to delve into the structure behind it before it gets easy.Peter doesn't seem to have done quite that, but instead jumpedstraight into formulae and simulations. That sort of thing of coursesidetracks you from really getting the wider picture...and as in here,particularly the edge cases like periodicity and such. :)



--

r b-j                  r...@audioimagination.com

"Imagination is more important than knowledge."



_______________________________________________
music-dsp mailing list
music-dsp@music.columbia.edu
https://lists.columbia.edu/mailman/listinfo/music-dsp

Re: [music-dsp] [ot] about entropy encoding

Reply via email to