Re: [Trisquel-users] Text to speech

tobias Thu, 09 Jan 2014 06:36:48 -0800

I'm currently working on a free software replacement for the non-free mbrola.

The hardest part of building a speech synthesis system is actually thecreation of a voice library. I decided to use human speech recordings insteadof formant synthesis. For me it began in 2011 when I was looking for asinging synthesizer software. I found many nonfree programs such as MyriadVirtual Singer, OGI Flinger, Vocaloid and UTAU. As I was unable to find afree replacement, I decided to write one. In the meanwhile I found out thatsome plugins for UTAU are free software, but I still had to replace thenonfree GUI, which is also trapped by Windows. One existing GPLv3 UTAU pluginis v.Connect-STAND [1], which is based on WORLD[2]. v.Connect-STAND has amore natural sound[3] than eCantorix[4], but it is limited to the Japaneselanguage. I was able to compile it, but I do not know how to use it.My free program will be based on WORLD, and it will allow speech/singingsynthesis by Collaborative Creation. The algorithms used in WORLD aredescribed in [5]. I chose a design that makes it possible to be multilingual.



[1] http://hal-the-cat.music.coocan.jp/ritsu_e.html
[2] http://ml.cs.yamanashi.ac.jp/world/english/
[3] https://www.youtube.com/watch?v=to28rvoNYfY
[4] https://github.com/divVerent/ecantorix/wiki/Songs
[5] http://iwk.mdw.ac.at/lit_db_iwk/download.php?id=18114

Re: [Trisquel-users] Text to speech

Reply via email to