Thank you Jaume for sharing your valuable experience in this regard On 6 April 2016 at 16:22, Jaume Ortolà i Font <jaumeort...@gmail.com> wrote:
> 2016-04-06 14:55 GMT+02:00 Juan Martorell <juan.martor...@gmail.com>: > > >> To transform one adjective into an adverb, in English you use the suffix >> `-ly` and in Spanish you use the suffix `-mente`: >> >> Equal --> equally >> Igual --> igualmente >> > I found 18340 candidates for suffixation in the Spanish dictionary for >> this particular case. >> >> > This is fine. In Catalan it is done in the tagger. Every "femenine/neuter > adjective/participle" + "-ment" is tagged as an adverb. A few malformed or > undesired words could appear. > > Have you solved synthetising? Maybe there is not case for synthesizing here and I can save the dime. Same for diminutives, augmentatives and superlatives. Depending on the zone >> these may vary, but if you want to be fully inclusive >> <https://es.wikipedia.org/wiki/Diminutivo>, you have to include 17 >> diminutives, both genders; 9 augmentatives, both genders; 1 superlative, >> both genders excluding the irregular forms >> <https://es.wikipedia.org/wiki/Superlativo>. They apply to the same >> ~18000 candidates. They are widely used in writing, so it is worth to >> include them. >> > > You can try to implement it in the same way as the adverbs with suffix > -mente. In Catalan is not implemented this way, because there are too many > problems. Instead, the most frequent words, one by one, are added to the > dictionary. > You may benefit from the hybrid statistical approach so you can shortlist them angainst a huge corpus like the Wikisource. I'll share it should it is useful. > > >> It is quite common to attach some pronouns to the verb thus including >> information about direct and/or indirect object, or passive/impersonal >> voice. >> > > As said before, this is implemented in the Catalan tokenizer. In Spanish > there are fewer combinations, but you need to remove the diacritics (and > check if they are correct). > I plant to start from some verbal inflections closer to the final product, so I overcome the irregularities. > > >> Other derivations imply prefixing like *re-* or *anti-*. These can be >> applied to both verbs (all conjugations), adjetives and nouns: >> *reiniciar*, *antiperson**al*, *recalificar*, *reincidente*, *reforestar*. >> Some of this are present in the dictionary, some are not, l will not >> include them into the account but it may be relevant. >> > > I have tried this one, but I gave up almost all prefixes because there > were too many unexpected problemes. The only remaning prefix is auto- + > verb. > That's a good one. If I see some other good prefix candidate I will share since many languages (as ours) share the same latin roots. Perhaps this is another candidate for statistical shortlisting.
------------------------------------------------------------------------------
_______________________________________________ Languagetool-devel mailing list Languagetool-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/languagetool-devel