mt-list  

Re: [Mt-list] Public release of Haitian Creole language data byCarnegie Mellon

Robert Frederking
Thu, 21 Jan 2010 15:09:07 -0800

Hi Vadim,
Yes, French is the principal written language, but most of the population only speaks Creole (and is illiterate). We ourselves are indeed looking at making speech-based systems (and the rarest part of the data may be the speech data). There may also be unforeseen benefits to the data being available. For example, it appears that Doctors Without Borders (Médecins Sans Frontières) may make use of the bilingual medical phrases as-is, through Translators Without Borders (Traducteurs sans Frontières). So who knows how this may help. Cheers.

   Bob
//
Vadim Berman wrote:
Hi Robert,

These are commendable efforts, but isn't French the principal written language in Haiti? Or you are talking about a speech to speech system?

Best regards,
Vadim

----- Original Message ----- From: "Robert Frederking" <r...@cs.cmu.edu>
To: <mt_l...@nist.gov>; <mt-list@eamt.org>
Sent: Friday, January 22, 2010 6:49 AM
Subject: [Mt-list] Public release of Haitian Creole language data byCarnegie Mellon


The Language Technologies Institute (LTI) of Carnegie Mellon University's
School of Computer Science (CMU SCS) is making publicly available the
Haitian Creole spoken and text data that we have collected or produced. We
are providing this data with minimal restrictions in order to
allow others to develop language technology for Haiti, in parallel with our own efforts to help with this crisis. Since organizing the data in a useful fashion is not instantaneous, and more text data is currently being produced by collaborators, we will be publishing the data incrementally on the web,
as it becomes available.  To access the currently available data, please
visit the website at  http://www.speech.cs.cmu.edu/haitian/

_______________________________________________
Mt-list mailing list




_______________________________________________
Mt-list mailing list