Re: [Freetel-codec2] facebook speech codec at 365bps

Random via Freetel-codec2 Mon, 13 Sep 2021 00:05:07 -0700

Is it speaker-independent ?


As to the amateur radio use, I would worry about the computation complexity and 
the hardware requirement.&nbsp;




------------------&nbsp;Original&nbsp;------------------
From:                                                                           
                                             "freetel-codec2"                   
                                                                 
<[email protected]&gt;;
Date:&nbsp;Sun, Sep 12, 2021 10:33 AM
To:&nbsp;"freetel-codec2"<[email protected]&gt;;

Subject:&nbsp;[Freetel-codec2] facebook speech codec at 365bps



https://speechbot.github.io/resynthesis/

https://ai.facebook.com/blog/textless-nlp-generating-expressive-speech-from-raw-audio/

The 365 bps figure is not totally fairly comparable to more
traditional codecs because they presume a per-speaker speaker
embedding is sent once.

This model need not be a barrier for amateur radio use. You could
easily imagine a scheme that sent the callsign in each frame and a 1
bit per frame output from a fountain code over the user's speaker
embedding.&nbsp; A receiver that hasn't recovered a particular speaker's
embedding yet could just use a dummy one until its recovered.&nbsp;&nbsp; Bonus,
with this approach you get a good voice changer as a side effect--
increasing the appeal of ham radio to CB users! :P


_______________________________________________
Freetel-codec2 mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/freetel-codec2

_______________________________________________
Freetel-codec2 mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/freetel-codec2

Re: [Freetel-codec2] facebook speech codec at 365bps

Reply via email to