On 28/12/2012 08:41, Paolo Bianchini wrote:
Actually what I wanted to try is to add a check in the grammar rule that takes into account also the person in which the verb is used so that we check the tense of verbs only if they are in the same person.

Therefore, instead of adding the exception as Dominique was suggesting (that btw is something that we could try), the rule would match

raccolse[raccogliere/VER:ind+past+3+s]

with

viaggio[viaggio/NOUN-M:s,viaggiare/VER:ind+pres+1+s]

because they are not in the same person. I assume that if someone was to make a verb tense mistake in writing a sentence they would, at least, use the same person.
I disagree.
You would restrict the usefulness of the rule to cover for a different deficiency. Here the right thing is to understand "viaggio" should NOT be interpreted as a VERb at all.

Any suggestion on how this could be achieved? I guess that it would be much easier in Java than with regexps
Unfortunately I am not a linguist and I have problems understanding what are the "right" rules to use for disambiguation.

What I can say is rule GR_10_001 is by far the foremost source of false positives in literature. I checked using random chapters of well-established novels. OTOH disabling it isn't an option because wrong tens concordance is a major cause of error (especially when subjunctive is involved!).

What I don't know is how to build a comparable corpus of *wrong* Italian sentences. I will try to devise something. Building wrong phrases myself should be last resort because those wouldn't be representative of real errors "in the wild".

Thanks

Paolo

My disambiguation rule needs updating, if someone can suggest how.

Mario gli chiese l'ora.

121 rules activated for language Italian

<S> Mario[Mario/NPR]  gli[gli/PRO-PERS-CLI-3-M-S,il/ART-M:p]  chiese[chiesa/NOUN-F:p]  
l[l]'['/PON]ora[orare/VER:impr+pres+2+s,orare/VER:ind+pres+3+s,ora/ADV,ora/NOUN-F:s].[./SENT,</S>]<P/>

Disambiguator log:

art-ver: chiese[chiesa/NOUN-F:p,chiedere/VER:ind+past+3+s] -> 
chiese[chiesa/NOUN-F:p]

1.) Line 1, column 7, Rule ID: GR_02_001[2]

Message: L'articolo non concorda: 'le chiese'.

Suggestion: le chiese

Mario gli chiese l'ora.

      ^^^^^^^^^^


Here problem is "gli" is not an ARTicle, but a PROnoun, thus the rule should not apply. In the same sentence "l'" is not recognized as ARTicle ("la") and thus the rule is not applied to the following "ora", while it should have.

Can someone suggest how to improve this disambiguation rule to cover the currently known cases?

TiA
Mauro
------------------------------------------------------------------------------
Master HTML5, CSS3, ASP.NET, MVC, AJAX, Knockout.js, Web API and
much more. Get web development skills now with LearnDevNow -
350+ hours of step-by-step video tutorials by Microsoft MVPs and experts.
SALE $99.99 this month only -- learn more at:
http://p.sf.net/sfu/learnmore_122812
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to