Hi László,

Another question about affixes optimization:

First, here is the prefixation flags for elisions (nouns and adjectives only):


PFX L' Y 1
PFX L'   0    l'       [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ]      le|la+

PFX D' Y 1
PFX D'   0    d'       [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ]      de+

PFX Qu Y 1
PFX Qu   0    qu'      [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ]      que+

PFX Qq Y 1
PFX Qq   0    quoiqu'  [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ]      quoique+

PFX Qp Y 1
PFX Qp   0    puisqu'  [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ]      puisque+

PFX Ql Y 1
PFX Ql   0    lorsqu'  [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ]      lorsque+



For common plural, I use the following flag:

SFX S. Y 1
SFX S.   0          s                   [^sxz]              /pl

But for nouns and adjectives which begin by a vowel, I need to add elisions, so I created another flag:

SFX S* Y 2
SFX S*   0          0/D'L'QuQqQlQp      [^sxz]
SFX S*   0          s/D'QuQqQlQp        [^sxz]              /pl

In the dictionary, it looks like this:
bandage/S.                    [nm]
bande/S.                      [nf]
attention/S*()                [nf]
attentisme/S*()               [nm]

() is the NEEDAFFIX flag

S. is used ~24400 times
S* is used ~7400 times



This works also the same for adjectives and feminine/masculine nouns: one flag for simple plural forms and another one for the same thing + elisions.

SFX F. Y 64
SFX F.   0          0                   .                   .fem
SFX F.   0          s                   [eë]                .fem/pl
SFX F.   e          0                   [éiï]e              .mas
SFX F.   e          s                   [éiï]e              .mas/pl
SFX F.   rice       eur                 [dt]rice            .mas
SFX F.   rice       eurs                [dt]rice            .mas/pl
SFX F.   de         d                   de                  .mas
SFX F.   de         ds                  de                  .mas/pl
SFX F.   ffe        f                   ffe                 .mas
SFX F.   ffe        fs                  ffe                 .mas/pl
SFX F.   che        c                   [^è]che             .mas
SFX F.   che        cs                  [^è]che             .mas/pl
SFX F.   èche       ec                  èche                .mas
SFX F.   èche       ecs                 èche                .mas/pl
SFX F.   he         h                   [ut]he              .mas
SFX F.   he         hs                  [ut]he              .mas/pl
SFX F.   ke         k                   ke                  .mas
SFX F.   ke         ks                  ke                  .mas/pl
SFX F.   le         l                   [aiouû]le           .mas
SFX F.   le         ls                  [aiouû]le           .mas/pl
SFX F.   lle        l                   [eiu]lle            .mas
SFX F.   lle        ls                  [eiu]lle            .mas/pl
SFX F.   ne         n                   [aiou]ne            .mas
SFX F.   ne         ns                  [aiou]ne            .mas/pl
SFX F.   nne        n                   [aeo]nne            .mas
SFX F.   nne        ns                  [aeo]nne            .mas/pl
SFX F.   igne       in                  igne                .mas
SFX F.   igne       ins                 igne                .mas/pl
SFX F.   re         r                   [aiuûy]re           .mas
SFX F.   re         rs                  [aiuûy]re           .mas/pl
SFX F.   ère        er                  ère                 .mas
SFX F.   ère        ers                 ère                 .mas/pl
SFX F.   se         s                   [^us]se             .mas.inv
SFX F.   esse       e                   [^eo].esse          .mas
SFX F.   esse       es                  [^eo].esse          .mas/pl
SFX F.   eresse     eur                 eresse              .mas
SFX F.   eresse     eurs                eresse              .mas/pl
SFX F.   oresse     eur                 oresse              .mas
SFX F.   oresse     eurs                oresse              .mas/pl
SFX F.   sse        s                   [^e]sse             .mas.inv
SFX F.   use        us                  [^eo]use            .mas.inv
SFX F.   euse       eur                 euse                .mas
SFX F.   euse       eurs                euse                .mas/pl
SFX F.   te         t                   [^èt]te             .mas
SFX F.   te         ts                  [^èt]te             .mas/pl
SFX F.   tte        t                   tte                 .mas
SFX F.   tte        ts                  tte                 .mas/pl
SFX F.   ète        et                  ète                 .mas
SFX F.   ète        ets                 ète                 .mas/pl
SFX F.   ue         u                   [^gq]ue             .mas
SFX F.   ue         us                  [^gq]ue             .mas/pl
SFX F.   gue        g                   gue                 .mas
SFX F.   gue        gs                  gue                 .mas/pl
SFX F.   cque       c                   cque                .mas
SFX F.   cque       cs                  cque                .mas/pl
SFX F.   que        c                   [^c]que             .mas
SFX F.   que        cs                  [^c]que             .mas/pl
SFX F.   ève        ef                  ève                 .mas
SFX F.   ève        efs                 ève                 .mas/pl
SFX F.   ve         f                   [iïu]ve             .mas
SFX F.   ve         fs                  [iïu]ve             .mas/pl
SFX F.   uë         u                   uë                  .mas
SFX F.   uë         us                  uë                  .mas/pl
SFX F.   üe         u                   üe                  .mas
SFX F.   üe         us                  üe                  .mas/pl

SFX F* Y 64
SFX F*   0          0/D'L'QuQqQlQp      .                   .fem
SFX F*   0          s/D'QuQqQlQp        [eë]                .fem/pl
SFX F*   e          0/D'L'QuQqQlQp      [éiï]e              .mas
SFX F*   e          s/D'QuQqQlQp        [éiï]e              .mas/pl
SFX F*   rice       eur/D'L'QuQqQlQp    [dt]rice            .mas
SFX F*   rice       eurs/D'QuQqQlQp     [dt]rice            .mas/pl
SFX F*   de         d/D'L'QuQqQlQp      de                  .mas
SFX F*   de         ds/D'QuQqQlQp       de                  .mas/pl
SFX F*   ffe        f/D'L'QuQqQlQp      ffe                 .mas
SFX F*   ffe        fs/D'QuQqQlQp       ffe                 .mas/pl
SFX F*   che        c/D'L'QuQqQlQp      [^è]che             .mas
SFX F*   che        cs/D'QuQqQlQp       [^è]che             .mas/pl
SFX F*   èche       ec/D'L'QuQqQlQp     èche                .mas
SFX F*   èche       ecs/D'QuQqQlQp      èche                .mas/pl
SFX F*   he         h/D'L'QuQqQlQp      [ut]he              .mas
SFX F*   he         hs/D'QuQqQlQp       [ut]he              .mas/pl
SFX F*   ke         k/D'L'QuQqQlQp      ke                  .mas
SFX F*   ke         ks/D'QuQqQlQp       ke                  .mas/pl
SFX F*   le         l/D'L'QuQqQlQp      [aiouû]le           .mas
SFX F*   le         ls/D'QuQqQlQp       [aiouû]le           .mas/pl
SFX F*   lle        l/D'L'QuQqQlQp      [eiu]lle            .mas
SFX F*   lle        ls/D'QuQqQlQp       [eiu]lle            .mas/pl
SFX F*   ne         n/D'L'QuQqQlQp      [aiou]ne            .mas
SFX F*   ne         ns/D'QuQqQlQp       [aiou]ne            .mas/pl
SFX F*   nne        n/D'L'QuQqQlQp      [aeo]nne            .mas
SFX F*   nne        ns/D'QuQqQlQp       [aeo]nne            .mas/pl
SFX F*   gne        n/D'L'QuQqQlQp      igne                .mas
SFX F*   gne        ns/D'QuQqQlQp       igne                .mas/pl
SFX F*   re         r/D'L'QuQqQlQp      [aiuûy]re           .mas
SFX F*   re         rs/D'QuQqQlQp       [aiuûy]re           .mas/pl
SFX F*   ère        er/D'L'QuQqQlQp     ère                 .mas
SFX F*   ère        ers/D'QuQqQlQp      ère                 .mas/pl
SFX F*   se         s/D'L'QuQqQlQp      [^us]se             .mas.inv
SFX F*   esse       e/D'L'QuQqQlQp      [^eo].esse          .mas
SFX F*   esse       es/D'QuQqQlQp       [^eo].esse          .mas/pl
SFX F*   eresse     eur/D'L'QuQqQlQp    eresse              .mas
SFX F*   eresse     eurs/D'QuQqQlQp     eresse              .mas/pl
SFX F*   oresse     eur/D'L'QuQqQlQp    oresse              .mas
SFX F*   oresse     eurs/D'QuQqQlQp     oresse              .mas/pl
SFX F*   sse        s/D'L'QuQqQlQp      [^e]sse             .mas.inv
SFX F*   use        us/D'L'QuQqQlQp     [^eo]use            .mas.inv
SFX F*   euse       eur/D'L'QuQqQlQp    euse                .mas
SFX F*   euse       eurs/D'QuQqQlQp     euse                .mas/pl
SFX F*   te         t/D'L'QuQqQlQp      [^èt]te             .mas
SFX F*   te         ts/D'QuQqQlQp       [^èt]te             .mas/pl
SFX F*   tte        t/D'L'QuQqQlQp      tte                 .mas
SFX F*   tte        ts/D'QuQqQlQp       tte                 .mas/pl
SFX F*   ète        et/D'L'QuQqQlQp     ète                 .mas
SFX F*   ète        ets/D'QuQqQlQp      ète                 .mas/pl
SFX F*   ue         u/D'L'QuQqQlQp      [^gq]ue             .mas
SFX F*   ue         us/D'QuQqQlQp       [^gq]ue             .mas/pl
SFX F*   gue        g/D'L'QuQqQlQp      gue                 .mas
SFX F*   gue        gs/D'QuQqQlQp       gue                 .mas/pl
SFX F*   cque       c/D'L'QuQqQlQp      cque                .mas
SFX F*   cque       cs/D'QuQqQlQp       cque                .mas/pl
SFX F*   que        c/D'L'QuQqQlQp      [^c]que             .mas
SFX F*   que        cs/D'QuQqQlQp       [^c]que             .mas/pl
SFX F*   ève        ef/D'L'QuQqQlQp     ève                 .mas
SFX F*   ève        efs/D'QuQqQlQp      ève                 .mas/pl
SFX F*   ve         f/D'L'QuQqQlQp      [iïu]ve             .mas
SFX F*   ve         fs/D'QuQqQlQp       [iïu]ve             .mas/pl
SFX F*   uë         u/D'L'QuQqQlQp      uë                  .mas
SFX F*   uë         us/D'QuQqQlQp       uë                  .mas/pl
SFX F*   üe         u/D'L'QuQpQlQp      üe                  .mas
SFX F*   üe         us/D'QuQqQlQp       üe                  .mas/pl

In the dictionary:
courbatue/F.()                [adj]
coureuse/F.()                 [nom]
abasourdissante/F*()          [adj]
arrogante/F*()                [nom|adj]

F. is used ~8300 times
F* is used ~1500 times


HERE IS THE QUESTION:
Obviously, I could replace S. by S*() flags. That will do the same thing, for the prefixation flags would only apply for words beginning by a vowel.
I could also replace F.() by F*() flags.

2/3 of entries use one of these four flags.

When I created them, I wanted to prevent unnecessary prefixation tests. But I know now that doublon affixes generate also more tests.
So what is the best solution?


Best regards,
Olivier

--

== N'écrivez pas à cette adresse. Réservée aux listes de discussion. ==
** Do not reply at this address. Mailing-list only. **

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to