On 26 September 2010 20:17, Kevin Brubeck Unhammer <[email protected]> wrote: > 2010/9/26 Jimmy O'Regan <[email protected]>: >> Sun Sep 26 15:42:44 IST 2010 >> * Initial release (0.1.0) >> * Caveats: >> - Functions only in an->es direction >> - Several closed category words missing from an analyser >> (including "ir") >> - "Cowboys, Ted!" >> This system has been put together in a very shoddy, MacGuyver-ish >> way: >> The majority of the lexicon has been composed on the basis >> of presumed cognates. For the most part, this has been >> restricted to Latin derivatives, but on more than one occasion, I >> simply went nuts and pulled in anything the Spanish analyser would >> recognise. >> The only bitexts available were the UN Declaration of Human Rights >> and the welcome message for new users of the Aragonese Wikipedia. >> Statistical methods were not widely employed. >> To deal with the spelling variations, I abused the heck out of sed, >> filtering unknowns repeatedly before passing the result through the >> analyser, to pluck out the results. Much of the ~8000 words in the >> bilingual lexicon are mere variations. (In a particularly ironic >> twist, it has 3 variations of 'normalización'). These variants will >> need to be sorted out to have es->an: the first translation made with >> this system before release was of the document on an.wikipedia >> describing the new spelling rules. >> Although I got some notes from Juan Pablo Martínez on the equivalents >> of ser and estar, I was not able to get further information. My >> "solution" is to ignore the issue and come back to it later. >> Also, Juan Pablo added some vocabulary to the analyser, most of which >> I have not been able to use for lack of translations. Hopefully, we >> can get these reinstated soon. >> A tagger has yet to be trained for Aragonese; during development, I found >> the Spanish tagger to be sufficient, and so have used that. This is a >> temporary measure. >> >> >> The release is a little premature, perhaps, but I want a release to >> mark the European Day of Languages. It's not bad for approximately 3 >> weeks' work :) >> > > Congrats! > > And happy Language Day, all :-)
Wszystkiego najlepszego! -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. ------------------------------------------------------------------------------ Start uncovering the many advantages of virtual appliances and start using them to simplify application deployment and accelerate your shift to cloud computing. http://p.sf.net/sfu/novell-sfdev2dev _______________________________________________ Apertium-stuff mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/apertium-stuff
