On 28/12/2012 08:41, Paolo Bianchini wrote:
Actually what I wanted to try is to add a check in the grammar rule
that takes into account also the person in which the verb is used so
that we check the tense of verbs only if they are in the same person.
Therefore, instead of adding the exception as Dominique was suggesting
(that btw is something that we could try), the rule would match
raccolse[raccogliere/VER:ind+past+3+s]
with
viaggio[viaggio/NOUN-M:s,viaggiare/VER:ind+pres+1+s]
because they are not in the same person. I assume that if someone was
to make a verb tense mistake in writing a sentence they would, at
least, use the same person.
I disagree.
You would restrict the usefulness of the rule to cover for a different
deficiency.
Here the right thing is to understand "viaggio" should NOT be
interpreted as a VERb at all.
Any suggestion on how this could be achieved? I guess that it would be
much easier in Java than with regexps
Unfortunately I am not a linguist and I have problems understanding what
are the "right" rules to use for disambiguation.
What I can say is rule GR_10_001 is by far the foremost source of false
positives in literature. I checked using random chapters of
well-established novels. OTOH disabling it isn't an option because wrong
tens concordance is a major cause of error (especially when subjunctive
is involved!).
What I don't know is how to build a comparable corpus of *wrong* Italian
sentences. I will try to devise something. Building wrong phrases myself
should be last resort because those wouldn't be representative of real
errors "in the wild".
Thanks
Paolo
My disambiguation rule needs updating, if someone can suggest how.
Mario gli chiese l'ora.
121 rules activated for language Italian
<S> Mario[Mario/NPR] gli[gli/PRO-PERS-CLI-3-M-S,il/ART-M:p] chiese[chiesa/NOUN-F:p]
l[l]'['/PON]ora[orare/VER:impr+pres+2+s,orare/VER:ind+pres+3+s,ora/ADV,ora/NOUN-F:s].[./SENT,</S>]<P/>
Disambiguator log:
art-ver: chiese[chiesa/NOUN-F:p,chiedere/VER:ind+past+3+s] ->
chiese[chiesa/NOUN-F:p]
1.) Line 1, column 7, Rule ID: GR_02_001[2]
Message: L'articolo non concorda: 'le chiese'.
Suggestion: le chiese
Mario gli chiese l'ora.
^^^^^^^^^^
Here problem is "gli" is not an ARTicle, but a PROnoun, thus the rule
should not apply.
In the same sentence "l'" is not recognized as ARTicle ("la") and thus
the rule is not applied to the following "ora", while it should have.
Can someone suggest how to improve this disambiguation rule to cover the
currently known cases?
TiA
Mauro
------------------------------------------------------------------------------
Master HTML5, CSS3, ASP.NET, MVC, AJAX, Knockout.js, Web API and
much more. Get web development skills now with LearnDevNow -
350+ hours of step-by-step video tutorials by Microsoft MVPs and experts.
SALE $99.99 this month only -- learn more at:
http://p.sf.net/sfu/learnmore_122812
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel