Is it speaker-independent ?

As to the amateur radio use, I would worry about the computation complexity and 
the hardware requirement. 




------------------ Original ------------------
From:                                                                           
                                             "freetel-codec2"                   
                                                                 
<gmaxw...@gmail.com&gt;;
Date:&nbsp;Sun, Sep 12, 2021 10:33 AM
To:&nbsp;"freetel-codec2"<freetel-codec2@lists.sourceforge.net&gt;;

Subject:&nbsp;[Freetel-codec2] facebook speech codec at 365bps



https://speechbot.github.io/resynthesis/

https://ai.facebook.com/blog/textless-nlp-generating-expressive-speech-from-raw-audio/

The 365 bps figure is not totally fairly comparable to more
traditional codecs because they presume a per-speaker speaker
embedding is sent once.

This model need not be a barrier for amateur radio use. You could
easily imagine a scheme that sent the callsign in each frame and a 1
bit per frame output from a fountain code over the user's speaker
embedding.&nbsp; A receiver that hasn't recovered a particular speaker's
embedding yet could just use a dummy one until its recovered.&nbsp;&nbsp; Bonus,
with this approach you get a good voice changer as a side effect--
increasing the appeal of ham radio to CB users! :P


_______________________________________________
Freetel-codec2 mailing list
Freetel-codec2@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freetel-codec2
_______________________________________________
Freetel-codec2 mailing list
Freetel-codec2@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freetel-codec2

Reply via email to