Thanks for sharing this great news, Daniel! Is there anything special that needs to be done to leverage this new approach to capitalisation in new pairs created using apertium-init?
-- Jonathan 27 dek 2022, Ç.a. tarixində 10:33 tarixində Daniel Swanson <awesomeevildu...@gmail.com> yazdı: > > Greetings Apertiumers! > > For anyone testing this, I've now also added -w/--dictionary-case to > apertium-{transfer,interchunk,postchunk} which makes the > capitalization instructions simply do nothing so we don't have two > conflicting sets of rules trying to solve the same problem in opposite > ways. > > Daniel > > On Tue, Dec 27, 2022 at 6:47 AM Marc Riera Irigoyen > <marc.riera.irigo...@gmail.com> wrote: > > > > Thanks for the great work! I'll make sure to test it with apertium-eng-cat, > > which has generation errors due to capitalization. > > > > Happy holidays! > > > > Marc Riera > > > > > > Missatge de Hèctor Alòs i Font <hectora...@gmail.com> del dia ds., 24 de > > des. 2022 a les 14:12: > >> > >> Looks very good, Daniel. Thanks in advance. I'll try to test in the next > >> days in the pairs I maintain. > >> Merry Christmas/Hanukkah/New Year/*. > >> Hèctor > >> > >> Missatge de Daniel Swanson <awesomeevildu...@gmail.com> del dia dv., 23 de > >> des. 2022 a les 0:41: > >>> > >>> Greetings Apertiumers! > >>> > >>> I have two updates to report: > >>> > >>> First, I have rewritten the postgenerator (again), this time as part > >>> of apertium-separable (and so not breaking the old one, unlike last > >>> time), and in such a way that postgenerator rules can both match on > >>> lemma and tags in addition to surface forms and iteratively apply to > >>> their own output. > >>> > >>> This is available as part of apertium-separable 0.7.0 and is > >>> documented at https://wiki.apertium.org/wiki/Postgenerator > >>> > >>> Second, I just added a pair of modules which move capitalization > >>> information into word-bound blanks at the beginning of the pipeline > >>> and then reapply them according to LRX-like rules at the end of the > >>> pipeline, allowing all intermediate modules to operate solely on > >>> dictionary case. > >>> > >>> This should be available after the next nightly build (i.e. tomorrow) > >>> in apertium 3.9.0, and is documented at > >>> https://wiki.apertium.org/wiki/Capitalization_restoration > >>> > >>> If anyone has questions or would like help trying this out for a > >>> language pair or if I missed something in the documentation, let me > >>> know. > >>> > >>> Thanks to Kevin Unhammer and Marc Riera for helping me figure out what > >>> the design of the capitalization module should be. > >>> > >>> Merry Christmas, > >>> Daniel > >>> > >>> P.S. To anyone not interested in either of these developments: your > >>> Christmas gift is that I accidentally made lexical selection quite a > >>> bit faster while I was working on these. > >>> > >>> > >>> _______________________________________________ > >>> Apertium-stuff mailing list > >>> Apertium-stuff@lists.sourceforge.net > >>> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > >> > >> _______________________________________________ > >> Apertium-stuff mailing list > >> Apertium-stuff@lists.sourceforge.net > >> https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > > > _______________________________________________ > > Apertium-stuff mailing list > > Apertium-stuff@lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/apertium-stuff > > > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff _______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff