Hi all, At the Opencast meeting in June in LA, I presented some figures on baseline speech recognition performance with Sphinx4 for a set of lecture recordings from Open Yale Courses.
I've now put the complete data set for these online, described at: http://trulymadlywordly.blogspot.com/2011/12/sphinx4-speech-recognition-results-for.html The figures and resulting output don't look very encouraging, but keep in mind that a primary purpose of automated speech recognition is to produce searchable time-aligned transcripts rather than a readable result. I'd be interested in comparative results if anyone runs these recordings through another speech recognition engine, or one of the hosted services. Regards Stephen Stephen Marquard, Acting Director Centre for Educational Technology, University of Cape Town http://www.cet.uct.ac.za Email/IM/XMPP: [email protected] Phone: +27-21-650-5037 Cell: +27-83-500-5290 ### UNIVERSITY OF CAPE TOWN This e-mail is subject to the UCT ICT policies and e-mail disclaimer published on our website at http://www.uct.ac.za/about/policies/emaildisclaimer/ or obtainable from +27 21 650 9111. This e-mail is intended only for the person(s) to whom it is addressed. If the e-mail has reached you in error, please notify the author. If you are not the intended recipient of the e-mail you may not use, disclose, copy, redirect or print the content. If this e-mail is not related to the business of UCT it is sent by the sender in the sender's individual capacity. ### _______________________________________________ Community mailing list [email protected] http://lists.opencastproject.org/mailman/listinfo/community To unsubscribe please email [email protected] _______________________________________________
