Re: [PD] speech recognition and ethics

Ivica Ico Bukvic Sat, 07 Feb 2015 10:30:58 -0800

There is still the access to computational power challenge, unless wemake a seti@home-like speech recognition crawler which in and of itselfhas similar ethical implications.


On 2/7/2015 12:55 PM, Spencer Russell wrote:

I saw a really interesting talk last year by Johan Schalkwyk, the headof the Google speech recognition group. One of the points he made wasthat while Google's algorithms are important, they got a lot moreleverage from the sheer amount of data they have access to. It allowsthem to get away with much simpler algorithms. I think that's one ofthe biggest problems with trying to compete with Google and Apple onspeech recognition, because OSS developers just don't have access to ahuge corpus of data.Even though a lot of that data is unlabeled (they don't know what theactual words are that correspond to the audio), they have a hugeamount of interaction data, so they can for instance look at whetherthe user tried multiple times with a particular phrase or whether theuser accepted a given transcription.It seems like if we want an open-source speech recognition package weshould focus on finding ways to get an accessible shared corpus.Unless there was some tricky licensing I think that corpus would alsobenefit the big guys though, so their corpus would remain a propersuperset of what's available to OSS developers.
On Sat, Feb 7, 2015, at 11:39 AM, Jonathan Wilkes via Pd-list wrote:
Hi list,
Here's a fun thought-experiment: suppose you're doing a port of Pd,and the graphics toolkit you're using will include functionality tohook in to Google's speech recognition API. Such an API could makethe software accessible to people who would otherwise find it veryhard to write Pd patches.However, the API works by shipping off your audio data to Google'sservers, doing the computation on their machines, and sending youback the results.
Do you use the API in your port, or not?
I'm decidedly not going to use that API, for what I think are obvioussecurity, privacy, and philosophical reasons. But I'm curious justhow obvious the security and privacy implications are to othershere. How many people would use a speech-patching mechanism thatsends all your speech to Google?I'm also increasingly worried by the apparent gap between theusability of Google and Apple's products, and the seemingly glacialpace at which _usable_ free software speech recognition is beingdeveloped. My position won't change, but I'm afraid it's becomingmore symbolic than practical as these insecure tools become a naturalpart of most people's lives.
-Jonathan
_________________________________________________
[email protected] <mailto:[email protected]> mailing list
UNSUBSCRIBE and account-management ->http://lists.puredata.info/listinfo/pd-list
_______________________________________________
[email protected] mailing list
UNSUBSCRIBE and account-management -> 
http://lists.puredata.info/listinfo/pd-list


--
Ivica Ico Bukvic, D.M.A.
Associate Professor
Computer Music
ICAT Senior Fellow
DISIS, L2Ork
Virginia Tech
School of Performing Arts – 0141
Blacksburg, VA 24061
(540) 231-6139
[email protected]
www.performingarts.vt.edu
disis.music.vt.edu
l2ork.music.vt.edu

_______________________________________________
[email protected] mailing list
UNSUBSCRIBE and account-management -> 
http://lists.puredata.info/listinfo/pd-list

Re: [PD] speech recognition and ethics

Reply via email to