one blocker to continueing is being unsure how to generate novel voices with whisperspeech
(another is finding this intense) maybe i’ll see if there are speaker embeddings in there somewhere i could feed random numbers to or gently mutate or something
