Thank you Jaume for  sharing your valuable experience in this regard

On 6 April 2016 at 16:22, Jaume Ortolà i Font <jaumeort...@gmail.com> wrote:

> 2016-04-06 14:55 GMT+02:00 Juan Martorell <juan.martor...@gmail.com>:
>
>
>> To transform one adjective into an adverb, in English you use the suffix
>> `-ly` and in Spanish you use the suffix `-mente`:
>>
>> Equal --> equally
>> Igual --> igualmente
>>
> I found 18340 candidates for suffixation in the Spanish dictionary for
>> this particular case.
>>
>>
> This is fine. In Catalan it is done in the tagger. Every "femenine/neuter
> adjective/participle" + "-ment" is tagged as an adverb. A few malformed or
> undesired words could appear.
>
>

Have you solved synthetising? Maybe there is not case for synthesizing here
and I can save the dime.

Same for diminutives, augmentatives and superlatives. Depending on the zone
>> these may vary, but if you want to be fully inclusive
>> <https://es.wikipedia.org/wiki/Diminutivo>, you have to include 17
>> diminutives, both genders; 9 augmentatives, both genders; 1 superlative,
>> both genders excluding the irregular forms
>> <https://es.wikipedia.org/wiki/Superlativo>. They apply to the same
>> ~18000 candidates. They are widely used in writing, so it is worth to
>> include them.
>>
>
> You can try to implement it in the same way as the adverbs with suffix
> -mente. In Catalan is not implemented this way, because there are too many
> problems. Instead, the most frequent words, one by one, are added to the
> dictionary.
>

You may benefit from the hybrid statistical approach so you can shortlist
them angainst a huge corpus like the Wikisource. I'll share it should it is
useful.


>
>
>> It is quite common to attach some pronouns to the verb thus including
>> information about direct and/or indirect object, or passive/impersonal
>> voice.
>>
>
> As said before, this is implemented in the Catalan tokenizer. In Spanish
> there are fewer combinations, but you need to remove the diacritics (and
> check if they are correct).
>

I plant to start from some verbal inflections closer to the final product,
so I overcome the irregularities.


>
>
>> Other derivations imply prefixing like *re-* or *anti-*. These can be
>> applied to both verbs (all conjugations), adjetives and nouns:
>> *reiniciar*, *antiperson**al*, *recalificar*, *reincidente*, *reforestar*.
>> Some of this are present in the dictionary, some are not, l will not
>> include them into the account but it may be relevant.
>>
>
> I have tried this one, but I gave up almost all prefixes because there
> were too many unexpected problemes. The only remaning prefix is auto- +
> verb.
>

That's a good one. If I see some other good prefix candidate I will share
since many languages (as ours) share the same latin roots. Perhaps this is
another candidate for statistical shortlisting.
------------------------------------------------------------------------------
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to