2016-04-06 14:55 GMT+02:00 Juan Martorell <juan.martor...@gmail.com>:

> But more important are some derivatives, both suffixed and prefixed.
>

Hi Juan,

I can tell you my experience in these points.


> To transform one adjective into an adverb, in English you use the suffix
> `-ly` and in Spanish you use the suffix `-mente`:
>
> Equal --> equally
> Igual --> igualmente
>
I found 18340 candidates for suffixation in the Spanish dictionary for this
> particular case.
>
>
This is fine. In Catalan it is done in the tagger. Every "femenine/neuter
adjective/participle" + "-ment" is tagged as an adverb. A few malformed or
undesired words could appear.


> Same for diminutives, augmentatives and superlatives. Depending on the
> zone these may vary, but if you want to be fully inclusive
> <https://es.wikipedia.org/wiki/Diminutivo>, you have to include 17
> diminutives, both genders; 9 augmentatives, both genders; 1 superlative,
> both genders excluding the irregular forms
> <https://es.wikipedia.org/wiki/Superlativo>. They apply to the same
> ~18000 candidates. They are widely used in writing, so it is worth to
> include them.
>

You can try to implement it in the same way as the adverbs with suffix
-mente. In Catalan is not implemented this way, because there are too many
problems. Instead, the most frequent words, one by one, are added to the
dictionary.


> It is quite common to attach some pronouns to the verb thus including
> information about direct and/or indirect object, or passive/impersonal
> voice.
>

As said before, this is implemented in the Catalan tokenizer. In Spanish
there are fewer combinations, but you need to remove the diacritics (and
check if they are correct).


> Other derivations imply prefixing like *re-* or *anti-*. These can be
> applied to both verbs (all conjugations), adjetives and nouns: *reiniciar*
> , *antiperson**al*, *recalificar*, *reincidente*, *reforestar*. Some of
> this are present in the dictionary, some are not, l will not include them
> into the account but it may be relevant.
>

I have tried this one, but I gave up almost all prefixes because there were
too many unexpected problemes. The only remaning prefix is auto- + verb.

Regards,
Jaume Ortolà
------------------------------------------------------------------------------
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to