Thanks Ricardo,
The demo was interesting, some high frequency reconstruction. I wonder
if they mean "kHz" in the article rather than "kbps) when they referred
to down sampling. The banner at the bottom made reading the article
really annoying.
As it happens I playing with some similar ideas right now. Speech does
have some correlation across the frequency axis. So if you know one
part of the spectrum of a frame of speech, you also have some
information about the other part.
This correlation can be found using neural nets, or vector quantisation.
Or we can explicitly remove the correlation using techniques like a
DCT, leaving us much less information to sen dover the channel.
Cheers,
David
On 24/06/17 08:41, Ricardo Andere de Mello wrote:
Hi,
Recently I have been working with deep learning, but mainly focused in
image recognition.
I found one article related to audio, and I thought you would find it
interesting:
https://blog.insightdatascience.com/using-deep-learning-to-reconstruct-high-resolution-audio-29deee8b7ccd
I was wondering if it would be possible to use codec2 frames as inputs
for training.
[]s, Ricardo Mello
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Freetel-codec2 mailing list
Freetel-codec2@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freetel-codec2
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Freetel-codec2 mailing list
Freetel-codec2@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freetel-codec2