Hi Greg, The file where you want to write rules for this is https://github.com/apertium/apertium-pol/blob/master/apertium-pol.pol.rlx
If you want something like "tacy is <det> before <n>", you could get that with SELECT DET IF (0 DET) (0 NOUN) (1 NOUN) ; Daniel On Mon, Dec 20, 2021 at 1:40 PM Grzegorz Kulik <gregorykku...@gmail.com> wrote: > > Hello all, > > I haven't contacted you for some time, I hope you are all well. I developed > the pol-szl pair and although the translation is quite reasonable, I decided > to make it better by improving the lexical selection. I've been reading the > documentation and managed to write several rules for forms that need > disambiguation and are the same parts of speech. However, I cannot find any > information anywhere about what to do if there is a form that can mean two > completely different things. Example in Polish: > > tacy (such) = taki<det><dem><mp><pl><nom> > tacy (of a tablet) = > taca<n><f><sg><gen>/taca<n><f><sg><dat>/taca<n><f><sg><loc> > > The first meaning is obviously much more frequent but the translator chooses > the second one, which is less than desirable. > > What can I do to remedy this? Can I write rules for that manually? Should I > train the tagger? If so, what method would be the best? There's multiple > training methods and I don't know which one to choose for my pair. Could you > recommend me the best approach? > > Thank you in advance > Greg > _______________________________________________ > Apertium-stuff mailing list > Apertium-stuff@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/apertium-stuff _______________________________________________ Apertium-stuff mailing list Apertium-stuff@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/apertium-stuff