Hi,

I did a check on apertium-eng and found about 300 double entries, especially 
verbs. Two years ago I removed all double entries (lemma + PoS with different 
pardef if one is unnecessary or wrong) from apertium-eng.

Some of the double entries are like this:

<e lm="remarry">         <i>remarr</i><par n="appl/y__vblex"/></e>
<e lm="remarry">         <i>remarry</i><par n="accept__vblex"/></e>
(the second new one ist wrong because pp is remarried and not remarryed, as 
fare as I know)

<e lm="swell">           <i>sw</i><par n="sw/ell__vblex"/></e>
<e lm="swell">           <i>swel</i><par n="spel/l__vblex"/></e>
(the second new one, in my dictionary I didn't find swelt as pp of swell? only 
swelled and swollen)


I can remove all the double entries again but unfortunately apertium-eng now 
belongs to a released language pair. What's the best way now to remove these 
double entries and how can we ensure that no one will add these again in 
future, because he uses this entry in his (unreleaded) language pair?

In the next days I will try to remove a few of the wrong ones so we can watch 
what happend. I think if we can find the source we have the possibility to 
correct this directly in the source dictionary so that this will not copied 
again to apertium-eng.

Best regards,
wolfgang

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to