Sun Sep 26 15:42:44 IST 2010
* Initial release (0.1.0)
* Caveats:
- Functions only in an->es direction
- Several closed category words missing from an analyser
(including "ir")
- "Cowboys, Ted!"
This system has been put together in a very shoddy, MacGuyver-ish
way:
The majority of the lexicon has been composed on the basis
of presumed cognates. For the most part, this has been
restricted to Latin derivatives, but on more than one occasion, I
simply went nuts and pulled in anything the Spanish analyser would
recognise.
The only bitexts available were the UN Declaration of Human Rights
and the welcome message for new users of the Aragonese Wikipedia.
Statistical methods were not widely employed.
To deal with the spelling variations, I abused the heck out of sed,
filtering unknowns repeatedly before passing the result through the
analyser, to pluck out the results. Much of the ~8000 words in the
bilingual lexicon are mere variations. (In a particularly ironic
twist, it has 3 variations of 'normalización'). These variants will
need to be sorted out to have es->an: the first translation made with
this system before release was of the document on an.wikipedia
describing the new spelling rules.
Although I got some notes from Juan Pablo MartÃnez on the equivalents
of ser and estar, I was not able to get further information. My
"solution" is to ignore the issue and come back to it later.
Also, Juan Pablo added some vocabulary to the analyser, most of which
I have not been able to use for lack of translations. Hopefully, we
can get these reinstated soon.
A tagger has yet to be trained for Aragonese; during development, I found
the Spanish tagger to be sufficient, and so have used that. This is a
temporary measure.
The release is a little premature, perhaps, but I want a release to
mark the European Day of Languages. It's not bad for approximately 3
weeks' work :)
--
<Leftmost> jimregan, that's because deep inside you, you are evil.
<Leftmost> Also not-so-deep inside you.
------------------------------------------------------------------------------
Start uncovering the many advantages of virtual appliances
and start using them to simplify application deployment and
accelerate your shift to cloud computing.
http://p.sf.net/sfu/novell-sfdev2dev
_______________________________________________
Apertium-stuff mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/apertium-stuff