I use quite a bit of text-to-speech in my Blather commands, and eSpeak is by 
far the best tool for the job in terms of performance. I tried Festival and it 
was very laggy and tended to make the whole system drag. Espeak is so 
lightweight that every response is instantaneous. I love it. :)

Jon 

Jonathan Kulp
http://jonathankulp.org

> On Apr 15, 2019, at 9:34 PM, Klaatu <[email protected]> wrote:
> 
> An HPR episode on this topic would be amazing. If you're too busy, i'd be 
> happy to read your email into a recorder and release the show in your name.
> 
>> On 16 April 2019 1:10:29 PM NZST, Mike Ray <[email protected]> wrote:
>> 
>> I got an error when I tried to post a comment about the latest podcast
>> about TTS.
>> 
>> I don't have the error code now as I pasted it into an email to admin
>> and the email bounced.
>> 
>> I suspect it was because my comment was too long, here it is:
>> 
>> 
>> Condescending and sarcastic.
>> 
>> Oh isn't text-to-speech such a laugh?
>> 
>> I get really, really annoyed when people criticise eSpeak.
>> 
>> Anybody who complains about it sounding robotic obviously was not around
>> thirty years ago.
>> 
>> eSpeak's author, Jonathan Duddington, in my humble opinion, deserves a
>> Nobel prize.
>> 
>> He has probably done more for blind and visually impaired computer
>> users, like me, all over the world, than any other individual or
>> organisation.
>> 
>> In fact it is hard to name any single person who has had such an impact
>> on any group of users, apart perhaps from Linus Torvalds and Richard
>> Stallman.
>> 
>> 1. It is Open Source.
>> 2. It is tiny, the memory footprint is small.
>> 3. It is snappy and can speak really fast, which is what we (blind
>> people) use when we get used to it, speeds that would make your hair curl.
>> 4. It probably has more language support than any other free and Open
>> Source synthesiser.
>> 5. It can run in a mode where it can return rendered speech, in the form
>> of PCM, to a calling program, so it can be used in other programs. I
>> don't think any other synth can do this.
>> 6. It can even return phonemes, a mode which I have used more than once
>> to provide a kind of 'fuzzy search' in a database.
>> 
>> I regularly write and maintain library code, and application code, in C,
>> C++ and/or Python, as well as Perl, and many of these code libraries
>> have in excess of 100k lines.
>> 
>> Including, incidentally, a library which used a combination of eSpeak
>> and OMX to render TTS directly on the GPU of a Raspberry Pi when 'they'
>> broke the ALSA driver on the Pi, which made the speech stutter and crash
>> the kernel, and refused to fix it for about four years.
>> 
>> If I spent all my time bitching about how robotic eSpeak is I would
>> never get any work done.
>> 
>> How much time do you spend when you should be writing code, worrying
>> about your wallpaper or the colour of your screen's background?
>> 
>> Or do you just not notice it after a while?
>> 
>> Well, after spending years writing code when I can't even see the Sun
>> when I stare directly at it, I can tell you I never notice what eSpeak
>> sounds like.
>> 
>> I would probably be equally at home working with flite, festival, or
>> svox pico (which you missed).
>> 
>> In addition, eSpeak is in use in NVDA, the free and Open Source Windows
>> screen reader which is currently giving the multi-hundreds of pounds
>> commercial offerings a real problem, and providing cash-strapped blind
>> users a chance.  Although now the Windows Narrator is catching up, I
>> still prefer NVDA and eSpeak.
>> 
>> MaryTTS is bloated.  There was some excitement around it a few years
>> ago, but it has more or less faded away in the minds of the blind and VI
>> community, since it is so bloated and, as far as I know, nobody has ever
>> made a successful screen reader from it.
>> 
>> Even if there was one, it would probably make a Raspberry Pi choke.
>> Whereas eSpeak runs snappily and happily on a 256k Raspberry Pi first-gen.
>> 
>> The 'holy trinity' of the Linux GUI, as far as blind and VI users are
>> concerned, is:
>> 
>> 1. Orca, the GTK screen reader, written in Python, and a work of art.
>> 2. speech-dispatcher, written in C, a TTS 'server' program which Orca
>> connects to to send text and get speech from it.
>> 3. eSpeak, although there are speech-dispatcher modules also for flite
>> and festival, eSpeak is the best one IMHO.
>> 
>> In the console:
>> 
>> 1. SpeakUp, kernel modules including speakup and speakup_soft which make
>> a console mode screen reader.
>> 2. espeakup, the SpeakUp to eSpeak connector.
>> 3. eSpeak.
>> 
>> eSpeak is gold dust.
>> 
>> 
>> 
> _______________________________________________
> Hpr mailing list
> [email protected]
> http://hackerpublicradio.org/mailman/listinfo/hpr_hackerpublicradio.org
_______________________________________________
Hpr mailing list
[email protected]
http://hackerpublicradio.org/mailman/listinfo/hpr_hackerpublicradio.org

Reply via email to