On 6/26/17, David Rowe <da...@rowetel.com> wrote:

> I'm currently using a 2D DCT approach (bit like JPG) to code a version
> of the spectrum.  VQ could also be used.
>
> We don't really need to reg-generate the upper 4kHz as it doesn't take
> many bits to encode it faithfully (perhaps 20% more than the first
> 4kHz).  The log(f) response of the ear means there isn't much info there
> we can actually perceive.

You might try a non-linear scaling prior to the 2D DCT.
Squish and stretch it as you would if making a log-normal graph.
That is, so that your spectrum plot shows octaves with equal spacing,
such that you could line it up with a musical keyboard.

A transform of that nature lets you handle the higher frequencies
with an appropriate level of detail.

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Freetel-codec2 mailing list
Freetel-codec2@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freetel-codec2

Reply via email to