Robert Frederking
Thu, 21 Jan 2010 15:09:07 -0800
Hi Vadim,Yes, French is the principal written language, but most of the population only speaks Creole (and is illiterate). We ourselves are indeed looking at making speech-based systems (and the rarest part of the data may be the speech data). There may also be unforeseen benefits to the data being available. For example, it appears that Doctors Without Borders (Médecins Sans Frontières) may make use of the bilingual medical phrases as-is, through Translators Without Borders (Traducteurs sans Frontières). So who knows how this may help. Cheers.
Bob // Vadim Berman wrote:
Hi Robert,These are commendable efforts, but isn't French the principal written language in Haiti? Or you are talking about a speech to speech system?Best regards, Vadim ----- Original Message ----- From: "Robert Frederking" <r...@cs.cmu.edu> To: <mt_l...@nist.gov>; <mt-list@eamt.org> Sent: Friday, January 22, 2010 6:49 AMSubject: [Mt-list] Public release of Haitian Creole language data byCarnegie MellonThe Language Technologies Institute (LTI) of Carnegie Mellon University'sSchool of Computer Science (CMU SCS) is making publicly available theHaitian Creole spoken and text data that we have collected or produced. Weare providing this data with minimal restrictions in order toallow others to develop language technology for Haiti, in parallel with our own efforts to help with this crisis. Since organizing the data in a useful fashion is not instantaneous, and more text data is currently being produced by collaborators, we will be publishing the data incrementally on the web,as it becomes available. To access the currently available data, please visit the website at http://www.speech.cs.cmu.edu/haitian/ _______________________________________________Mt-list mailing list
_______________________________________________ Mt-list mailing list