Re: [Hpr] takov751

Mike Ray Tue, 11 Jan 2022 10:53:09 -0800


I can.

As can a lot of blind people who rely on tts.

Perhaps not for prose, but certainly for source code.







On 11/01/2022 18:49, Nigel Verity wrote:

Hi Ken

The third voice is head and shoulders above the other two to my ears. Above 
200% speed I struggle to take everything in. By the time you get to 500% it is 
just a joke. Surely nobody can follow speech at that speed.

Beeza

  LibreOffice - Free and open source office suite: LibreOffice 
Website<https://www.libreoffice.org>
  Respects your privacy, and gives you back control over your data

________________________________
From: Hpr <hpr-boun...@hackerpublicradio.org> on behalf of Ken Fallon 
<k...@fallon.ie>
Sent: 11 January 2022 18:09
To: m...@raspberryvi.org <m...@raspberryvi.org>; hpr@hackerpublicradio.org 
<hpr@hackerpublicradio.org>
Subject: Re: [Hpr] takov751

Hi Mike,

As a TTS engine for reading the screen back to me I am more than happy
with it and use it continually during the day. It's not just visually
impaired people that rely on TTS. It does that job and does it well.

The objection I have to using espeak as the voice of HPR is that it is
harsh, unfriendly and not welcoming, its so bad in fact that it makes
kids cry. I speak from personal experience. When my kids were small I
made a project based on espeak (in English) for them to interact with.
It was a disaster. When the espeak voice started speaking they got
scared, started to cry, ran away, and never wanted to have anything to
do with it again.

Over the years the biggest objection to the TTS on HPR has been the
espeak voice. It has also been the biggest point of negative feedback I
get when trying to promote HPR to potential interviewees or projects.

If those are not valid enough reasons then I don't know what will
convince you. I can also assure you my desktop wallpaper is the default
supplied with my distro.

In the past it has been argued that the more natural voices are
difficult to understand when sped up. So I took the two most natural
voices from the list and posted a side by side comparison to espeak at
150%, 200%, 250%,  300%, 350%, 400%, 450%, and 500%. In my opinion the
coqui-tts_en_en_ljspeech is more understandable than espeak at every speed.

Can everyone have a listen to this and tell me your preference
https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fhackerpublicradio.org%2Ftts-espeak-ljspeech-vctk-normal-150-200-250-300-350-400-450-500-percent.ogg&amp;data=04%7C01%7C%7Caa9e5cf113f0411706b908d9d52db85a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637775214626255386%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=LU99WKYC1M8TCvMdKh8hM6N4h%2BalZ9gmdlGY%2FG65VtA%3D&amp;reserved=0

Ken.



On 2022-01-11 14:35, Mike Ray wrote:



And here was me thinking about posting to the list about how much
better  it is now with the softer music in the background and a nice
punchy eSpeak voice.

I still have no idea what the objection to the eSpeak voice is.

If you spend as many hours a day coding as I do, and rely on tts to
make this possible, then eSpeak is the way forward. Although I know
this may only be true for English speakers. Not sure how good eSpeak
is at other languages.

People who complain about eSpeak are probably the same people who
never get any work done because they are constantly fiddling with the
desktop wallpaper.

:-p





On 11/01/2022 10:44, Ken Fallon wrote:

Hi All,

We got a comment from takov751 via 
https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fmatrix.to%2F%23%2F%23hpr%3Amatrix.org&amp;data=04%7C01%7C%7Caa9e5cf113f0411706b908d9d52db85a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637775214626255386%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=Sxnt0xVQ%2B1EpCxXi%2FtWfLHlgnsyM4FMpp9X42%2FQY0KY%3D&amp;reserved=0

<quote>

Greetings i am a long listener of the shows . And of course planing
to make my first show . I would like to ask question regarding tts at
the beginning of the show usually I hear the espeak robotic voice .
In the workflow  have you considered using mimic1 or opentts /
Mozillatts or something along those lines ? It’s seems like these
would be compatible with licensing as well and bir more realistic
voices . A few examples 
https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fhub.docker.com%2Fu%2Fsynesthesiam&amp;data=04%7C01%7C%7Caa9e5cf113f0411706b908d9d52db85a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637775214626255386%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=AE86feZPVXgAFElFBdWJkO3f48W6FbaCChLwJd9jBpU%3D&amp;reserved=0

</quote>

Re-posted with permission

Sample voices are here: 
https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsynesthesiam.github.io%2Fopentts&amp;data=04%7C01%7C%7Caa9e5cf113f0411706b908d9d52db85a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637775214626255386%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=AmlRQ9dHu3Es2TwBCFCDQO%2BAUFfWza0TmcABNESk2D8%3D&amp;reserved=0

@Mike Ray

I would like to try and get a happy balance between meeting your
needs and having a voice that is friendly. While I love espeak it is
not friendly - it literally put my kids in tears when they were
younger :-)

Could you have a listen to some of the other voices and see if any of
them come close to your requirements for TTS

FYI I find these two "friendly"

   * 
https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsynesthesiam.github.io%2Fopentts%2F%23coqui-tts_en_en_ljspeech&amp;data=04%7C01%7C%7Caa9e5cf113f0411706b908d9d52db85a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637775214626255386%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=%2Fe%2F0wKykwaqOT%2BZY26xirz4ufyJoUmvU9CSogEl7L5Y%3D&amp;reserved=0
   * 
https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsynesthesiam.github.io%2Fopentts%2F%23coqui-tts_en_en_vctk&amp;data=04%7C01%7C%7Caa9e5cf113f0411706b908d9d52db85a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637775214626255386%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=3RJ0LsgU6xIWhdYG5DjCHWlmkLwA3CQ4lhs4BBhir6U%3D&amp;reserved=0



_______________________________________________
Hpr mailing list
Hpr@hackerpublicradio.org
https://emea01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fhackerpublicradio.org%2Fmailman%2Flistinfo%2Fhpr_hackerpublicradio.org&amp;data=04%7C01%7C%7Caa9e5cf113f0411706b908d9d52db85a%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C637775214626255386%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=B1PV53RnsSgVJIN3HMGrYK909MTpLrMlFxu07cw3mtQ%3D&amp;reserved=0



--
Michael A. Ray
Analyst/Programmer
Witley, Surrey, South-east UK

He/him

"Perfection is achieved, not when there is nothing more to add, but whenthere is nothing left to take away." -- A. de Saint-Exupery


_______________________________________________
Hpr mailing list
Hpr@hackerpublicradio.org
http://hackerpublicradio.org/mailman/listinfo/hpr_hackerpublicradio.org

Re: [Hpr] takov751

Reply via email to