Hi,
In Catalan there are many pairs of words that differ only in an accent mark
(about 5000 excluding verb-verb pairs). With some lists of these words
combined with some pattern rules, we can detect a lot of "forgotten
accents".
Taking as a model some of the existing Java rules (AvsAnRule,
WordCoherencyRule, etc), I wrote a Java rule for this. Now I would like to
make some improvements.
1) I need to match POS tags with regular expressions, but I don't find any
method for this at this level. So I will try to write my own.
2) I need to make some new dictionary lookups.
Any tips, suggestions, examples for doing this?
Regards,
Jaume Ortolà
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel