Hi Juan, $ echo "El Papa desea jubilarse" | apertium -d ./ spa-arg LO Papa deseya chubilar-se $ echo "Lo Papa deseya chubilar-se" | apertium -d ./ arg-spa El Papá desea jubilarse
I'm not reproducing the pronoun issue. As for the strange capitalization coming from the postgenerator, no one has yet come up with a way for it to behave correctly on overlapping matches, format handling, and capitalization simultaneously, so my general recommendation is to move capitalization to a separate post-processor (https://wiki.apertium.org/wiki/Capitalization_restoration), which I'm happy to help set up for anyone who's interested in trying it. Daniel On Thu, May 4, 2023 at 10:10 AM Juan Pablo <jpm...@unizar.es> wrote: > > Dear Apertiumers: > > I have the spa-arg, arg-cat pairs almost ready for a new release (See * > below the signature if you want more context on the new versions). > > I had been working on spa-arg with a previous version of the development > tools (the one installed by default in Apertium Virtual Box > https://wiki.apertium.org/wiki/Apertium_VirtualBox). But in order to > work with the arg-cat pair, I have needed to update apertium-all-dev to > the last version in https://apertium.projectjj.com/apt/install-nightly.sh. > > Without changing anything more, I have observed two changes in the > behaviour of the spa-arg pair after updating to the last version: > > 1) The verbs with enclitic pronoun se (as in: irse in Spanish, ir-se in > Aragonese) are not well generated. > 2) In spa->arg, when I have the article in capital initial "El" and the > next word also begins by capital, like "El Papa", the result is "LO > Papa" when it should be "Lo Papa". This might be related to the > postgenerator, which replaces ~O by LO (but it should be "Lo"). > > Using a sentence to illustrate both phenomena, I get: > > echo "El Papa desea jubilarse" | apertium -d ./ spa-arg > LO Papa deseya #chubilar > > ~/dev/apertium-spa-arg/$ echo "Lo Papa deseya chubilar-se" | apertium -d > ./ arg-spa > El Papá desea #jubilar > > This did not happen with the old version I had installed before. Do you > have a clue what may be happening and how to solve it? > > Thanks, > Juan Pablo > > *In early April, the Academia Aragonesa de la Lengua, the official > standardization/normativisation body for Aragonese language (created in > 2021 by the Government of Aragon, and to which I belong myself), > approved and published the official spelling rules for Aragonese: > https://www.boa.aragon.es/cgi-bin/EBOA/BRSCGI?CMD=VEROBJ&MLKOB=1272612550808. > > This is good news for Aragonese, as it puts an end to the situation of > different concurrent unofficial spelling norms. This required, of > course, an adaptation of the Aragonese translation pairs, so that they > will generate Aragonese according to the official spelling. I have kept, > though, compatibility with the previous reference spelling used by > Apertium. So spa-arg and arg-cat are almost ready to release (also > including some changes performed in the last couple of years). > > > > > > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff _______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff