Hi László,
Another question about affixes optimization:
First, here is the prefixation flags for elisions (nouns and adjectives
only):
PFX L' Y 1
PFX L' 0 l' [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ] le|la+
PFX D' Y 1
PFX D' 0 d' [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ] de+
PFX Qu Y 1
PFX Qu 0 qu' [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ] que+
PFX Qq Y 1
PFX Qq 0 quoiqu' [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ] quoique+
PFX Qp Y 1
PFX Qp 0 puisqu' [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ] puisque+
PFX Ql Y 1
PFX Ql 0 lorsqu' [aàâeèéêiîoôuyhœæAÀÂEÈÉÊIÎOÔUYHŒÆ] lorsque+
For common plural, I use the following flag:
SFX S. Y 1
SFX S. 0 s [^sxz] /pl
But for nouns and adjectives which begin by a vowel, I need to add
elisions, so I created another flag:
SFX S* Y 2
SFX S* 0 0/D'L'QuQqQlQp [^sxz]
SFX S* 0 s/D'QuQqQlQp [^sxz] /pl
In the dictionary, it looks like this:
bandage/S. [nm]
bande/S. [nf]
attention/S*() [nf]
attentisme/S*() [nm]
() is the NEEDAFFIX flag
S. is used ~24400 times
S* is used ~7400 times
This works also the same for adjectives and feminine/masculine nouns:
one flag for simple plural forms and another one for the same thing +
elisions.
SFX F. Y 64
SFX F. 0 0 . .fem
SFX F. 0 s [eë] .fem/pl
SFX F. e 0 [éiï]e .mas
SFX F. e s [éiï]e .mas/pl
SFX F. rice eur [dt]rice .mas
SFX F. rice eurs [dt]rice .mas/pl
SFX F. de d de .mas
SFX F. de ds de .mas/pl
SFX F. ffe f ffe .mas
SFX F. ffe fs ffe .mas/pl
SFX F. che c [^è]che .mas
SFX F. che cs [^è]che .mas/pl
SFX F. èche ec èche .mas
SFX F. èche ecs èche .mas/pl
SFX F. he h [ut]he .mas
SFX F. he hs [ut]he .mas/pl
SFX F. ke k ke .mas
SFX F. ke ks ke .mas/pl
SFX F. le l [aiouû]le .mas
SFX F. le ls [aiouû]le .mas/pl
SFX F. lle l [eiu]lle .mas
SFX F. lle ls [eiu]lle .mas/pl
SFX F. ne n [aiou]ne .mas
SFX F. ne ns [aiou]ne .mas/pl
SFX F. nne n [aeo]nne .mas
SFX F. nne ns [aeo]nne .mas/pl
SFX F. igne in igne .mas
SFX F. igne ins igne .mas/pl
SFX F. re r [aiuûy]re .mas
SFX F. re rs [aiuûy]re .mas/pl
SFX F. ère er ère .mas
SFX F. ère ers ère .mas/pl
SFX F. se s [^us]se .mas.inv
SFX F. esse e [^eo].esse .mas
SFX F. esse es [^eo].esse .mas/pl
SFX F. eresse eur eresse .mas
SFX F. eresse eurs eresse .mas/pl
SFX F. oresse eur oresse .mas
SFX F. oresse eurs oresse .mas/pl
SFX F. sse s [^e]sse .mas.inv
SFX F. use us [^eo]use .mas.inv
SFX F. euse eur euse .mas
SFX F. euse eurs euse .mas/pl
SFX F. te t [^èt]te .mas
SFX F. te ts [^èt]te .mas/pl
SFX F. tte t tte .mas
SFX F. tte ts tte .mas/pl
SFX F. ète et ète .mas
SFX F. ète ets ète .mas/pl
SFX F. ue u [^gq]ue .mas
SFX F. ue us [^gq]ue .mas/pl
SFX F. gue g gue .mas
SFX F. gue gs gue .mas/pl
SFX F. cque c cque .mas
SFX F. cque cs cque .mas/pl
SFX F. que c [^c]que .mas
SFX F. que cs [^c]que .mas/pl
SFX F. ève ef ève .mas
SFX F. ève efs ève .mas/pl
SFX F. ve f [iïu]ve .mas
SFX F. ve fs [iïu]ve .mas/pl
SFX F. uë u uë .mas
SFX F. uë us uë .mas/pl
SFX F. üe u üe .mas
SFX F. üe us üe .mas/pl
SFX F* Y 64
SFX F* 0 0/D'L'QuQqQlQp . .fem
SFX F* 0 s/D'QuQqQlQp [eë] .fem/pl
SFX F* e 0/D'L'QuQqQlQp [éiï]e .mas
SFX F* e s/D'QuQqQlQp [éiï]e .mas/pl
SFX F* rice eur/D'L'QuQqQlQp [dt]rice .mas
SFX F* rice eurs/D'QuQqQlQp [dt]rice .mas/pl
SFX F* de d/D'L'QuQqQlQp de .mas
SFX F* de ds/D'QuQqQlQp de .mas/pl
SFX F* ffe f/D'L'QuQqQlQp ffe .mas
SFX F* ffe fs/D'QuQqQlQp ffe .mas/pl
SFX F* che c/D'L'QuQqQlQp [^è]che .mas
SFX F* che cs/D'QuQqQlQp [^è]che .mas/pl
SFX F* èche ec/D'L'QuQqQlQp èche .mas
SFX F* èche ecs/D'QuQqQlQp èche .mas/pl
SFX F* he h/D'L'QuQqQlQp [ut]he .mas
SFX F* he hs/D'QuQqQlQp [ut]he .mas/pl
SFX F* ke k/D'L'QuQqQlQp ke .mas
SFX F* ke ks/D'QuQqQlQp ke .mas/pl
SFX F* le l/D'L'QuQqQlQp [aiouû]le .mas
SFX F* le ls/D'QuQqQlQp [aiouû]le .mas/pl
SFX F* lle l/D'L'QuQqQlQp [eiu]lle .mas
SFX F* lle ls/D'QuQqQlQp [eiu]lle .mas/pl
SFX F* ne n/D'L'QuQqQlQp [aiou]ne .mas
SFX F* ne ns/D'QuQqQlQp [aiou]ne .mas/pl
SFX F* nne n/D'L'QuQqQlQp [aeo]nne .mas
SFX F* nne ns/D'QuQqQlQp [aeo]nne .mas/pl
SFX F* gne n/D'L'QuQqQlQp igne .mas
SFX F* gne ns/D'QuQqQlQp igne .mas/pl
SFX F* re r/D'L'QuQqQlQp [aiuûy]re .mas
SFX F* re rs/D'QuQqQlQp [aiuûy]re .mas/pl
SFX F* ère er/D'L'QuQqQlQp ère .mas
SFX F* ère ers/D'QuQqQlQp ère .mas/pl
SFX F* se s/D'L'QuQqQlQp [^us]se .mas.inv
SFX F* esse e/D'L'QuQqQlQp [^eo].esse .mas
SFX F* esse es/D'QuQqQlQp [^eo].esse .mas/pl
SFX F* eresse eur/D'L'QuQqQlQp eresse .mas
SFX F* eresse eurs/D'QuQqQlQp eresse .mas/pl
SFX F* oresse eur/D'L'QuQqQlQp oresse .mas
SFX F* oresse eurs/D'QuQqQlQp oresse .mas/pl
SFX F* sse s/D'L'QuQqQlQp [^e]sse .mas.inv
SFX F* use us/D'L'QuQqQlQp [^eo]use .mas.inv
SFX F* euse eur/D'L'QuQqQlQp euse .mas
SFX F* euse eurs/D'QuQqQlQp euse .mas/pl
SFX F* te t/D'L'QuQqQlQp [^èt]te .mas
SFX F* te ts/D'QuQqQlQp [^èt]te .mas/pl
SFX F* tte t/D'L'QuQqQlQp tte .mas
SFX F* tte ts/D'QuQqQlQp tte .mas/pl
SFX F* ète et/D'L'QuQqQlQp ète .mas
SFX F* ète ets/D'QuQqQlQp ète .mas/pl
SFX F* ue u/D'L'QuQqQlQp [^gq]ue .mas
SFX F* ue us/D'QuQqQlQp [^gq]ue .mas/pl
SFX F* gue g/D'L'QuQqQlQp gue .mas
SFX F* gue gs/D'QuQqQlQp gue .mas/pl
SFX F* cque c/D'L'QuQqQlQp cque .mas
SFX F* cque cs/D'QuQqQlQp cque .mas/pl
SFX F* que c/D'L'QuQqQlQp [^c]que .mas
SFX F* que cs/D'QuQqQlQp [^c]que .mas/pl
SFX F* ève ef/D'L'QuQqQlQp ève .mas
SFX F* ève efs/D'QuQqQlQp ève .mas/pl
SFX F* ve f/D'L'QuQqQlQp [iïu]ve .mas
SFX F* ve fs/D'QuQqQlQp [iïu]ve .mas/pl
SFX F* uë u/D'L'QuQqQlQp uë .mas
SFX F* uë us/D'QuQqQlQp uë .mas/pl
SFX F* üe u/D'L'QuQpQlQp üe .mas
SFX F* üe us/D'QuQqQlQp üe .mas/pl
In the dictionary:
courbatue/F.() [adj]
coureuse/F.() [nom]
abasourdissante/F*() [adj]
arrogante/F*() [nom|adj]
F. is used ~8300 times
F* is used ~1500 times
HERE IS THE QUESTION:
Obviously, I could replace S. by S*() flags. That will do the same
thing, for the prefixation flags would only apply for words beginning by
a vowel.
I could also replace F.() by F*() flags.
2/3 of entries use one of these four flags.
When I created them, I wanted to prevent unnecessary prefixation tests.
But I know now that doublon affixes generate also more tests.
So what is the best solution?
Best regards,
Olivier
--
== N'écrivez pas à cette adresse. Réservée aux listes de discussion. ==
** Do not reply at this address. Mailing-list only. **
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]